Taro Logo
4

OpenAI audio demo is super cool

Profile picture
Rahul Pandey (Tech Lead/Manager at Meta, Pinterest, Kosei)6 days ago

Just released from OpenAI: https://www.linkedin.com/feed/update/urn:li:activity:7195838342729486337/

A few observations:

  • I love how natural the voice sounds, almost playful
  • Incredible that the model is able to take in audio and visual inputs
  • Feels like we're getting closer to an AI-companion
60
3

Discussion

(3 comments)
  • 4
    Profile picture
    Founding ML Engineer @ Lancey (YC S22)
    6 days ago

    3 uses that jump to me:

    • A GPT4 copilot that can watch your screen and help you pair program/debug
    • Real time mechanic to help you correct your car or something
    • Real time fitness coach to help you correct your form while training

    https://www.youtube.com/watch?v=wfAYBdaGVxs
    Here they show a real time interview coach 🤯. Would be an amazing tool for system design and DSA prep

    • 0
      Profile picture
      Senior Leadership @ Meta | Mentor | Coach | Tech Advisor
      5 days ago

      I definitely think this could be the perfect agent for Smart Glasses like the Meta Ray-bans. Having both audio and video input and getting real-time data for any situation, whether that is pleasure, i.e. travelling, doing something professionally, writing docs or working with tools.

      So many opportunities!

    • 0
      Profile picture
      SWE @ Govt
      5 days ago

      @Alex my thoughts exactly, I did a demo with a few phones and the meta glasses, and even though it was a bit sus - this form of communication with AIs is the way forward!

      I won't show the demo until I can trick it into calling my alt FB account "Chaat Geepeatee" though 😂

OpenAI is an American artificial intelligence (AI) research laboratory, which is behind ChatGPT. OpenAI conducts research on artificial intelligence with the declared intention of developing "safe and beneficial" artificial general intelligence, which it defines as "highly autonomous systems that outperform humans at most economically valuable work".
OpenAI4 questions