{"id":242269,"date":"2026-02-16T22:46:17","date_gmt":"2026-02-16T22:46:17","guid":{"rendered":"https:\/\/evertise.net\/?p=131146"},"modified":"2026-02-16T22:46:17","modified_gmt":"2026-02-16T22:46:17","slug":"tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai","status":"publish","type":"post","link":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/","title":{"rendered":"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI"},"content":{"rendered":"<p><img fetchpriority=\"high\" decoding=\"async\" class=\"aligncenter size-full wp-image-131147\" src=\"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png\" alt=\"\" width=\"1060\" height=\"590\" srcset=\"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png 1060w, https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34-300x167.png 300w, https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34-1024x570.png 1024w, https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34-768x427.png 768w\" sizes=\"(max-width: 1060px) 100vw, 1060px\" \/><\/p>\n<p><span style=\"font-weight: 400\">SAN FRANCISCO, CA &#8211; <\/span><a href=\"https:\/\/www.tavus.io\/\"  rel=\"noopener\"><span style=\"font-weight: 400\">Tavus<\/span><\/a><span style=\"font-weight: 400\">, the human computing company building lifelike AI humans that can see, hear, and respond in real time, <\/span><a href=\"https:\/\/www.tavus.io\/post\/raven-1-bringing-emotional-intelligence-to-artificial-intelligence\"  rel=\"noopener\"><span style=\"font-weight: 400\">launched Raven-1 into GA today<\/span><\/a><span style=\"font-weight: 400\">, a multimodal perception system that enables AI to understand emotion, intent, and context the way humans do.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Raven-1 captures and interprets audio and visual signals together, enabling AI systems to understand not just what users say, but how they say it and what that combination actually means. The model is now generally available across all Tavus conversations and APIs.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Conversational AI has made rapid progress in language generation and speech synthesis, yet understanding remains a persistent gap. Most systems process speech by converting it into transcripts. The transformation that strips away tone, pacing, hesitation, and expression- everything that makes the communication colorful and meaningful. Without those signals and the perception of how something is said, AI is forced to guess at intent, and those guesses break down exactly when they matter most. The sarcastic &#8220;great&#8221; becomes indistinguishable from the genuine one.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Raven-1 takes a different approach. Instead of analyzing audio and visual signals in isolation, it fuses them into a unified representation of the user&#8217;s state, intent, and context, producing natural language descriptions that downstream language models can reason over directly.<\/span><\/p>\n<h2><b>A New Model for Conversational Perception<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Raven-1 is a multimodal perception system built for real-time conversation in the Tavus Conversational Video Interface (CVI). Rather than outputting rigid categorical labels like &#8220;happy&#8221; or &#8220;sad,&#8221; Raven-1 works just like humans think to produce interpretable natural language descriptions of emotional state and intent at sentence-level granularity.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Key capabilities include:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Audio-visual fusion that integrates tone, prosody, facial expression, posture, and gaze into unified real-time context<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Natural language outputs aligned directly with LLMs, requiring no translation layer<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Temporal modeling that tracks how emotional and attentional states evolve throughout a conversation<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Sub-100ms audio perception latency with combined pipeline latency under 600ms<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Custom tool calling support for developer-defined events such as emotional thresholds, attention shifts, or user laughter<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400\">Raven-1 functions as a perception layer that works alongside <\/span><a href=\"https:\/\/www.tavus.io\/post\/sparrow-1-human-level-conversational-timing-in-real-time-voice\"  rel=\"noopener\"><span style=\"font-weight: 400\">Sparrow-1, Tavus\u2019 recently<\/span><\/a> <a href=\"https:\/\/www.tavus.io\/post\/sparrow-1-human-level-conversational-timing-in-real-time-voice\"  rel=\"noopener\"><span style=\"font-weight: 400\">launched conversational timing model<\/span><\/a><span style=\"font-weight: 400\">, and Phoenix-4, creating a closed loop where perception informs response and response reshapes the moment.<\/span><\/p>\n<h2><b>Why Multimodal Perception Matters<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Traditional emotion detection systems suffer from fundamental limitations. They flatten nuance into rigid categories, assume emotional consistency across entire utterances, and treat audio and visual signals independently. Human emotion is fluid, layered, and contextual. A single moment can carry frustration and hope at once.<\/span><\/p>\n<p><span style=\"font-weight: 400\">When someone says &#8220;Yeah, I&#8217;m fine&#8221; while avoiding eye contact and speaking in a flat monotone, transcription-based systems take them at their word. Raven-1 captures the full picture: tone, expression, posture, and the incongruence between words and signals that often carries the most important meaning.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Industry research indicates that up to 75 percent of medical diagnoses are derived from patient communication and history-taking rather than lab tests or physical exams. For high-stakes use cases like healthcare, therapy, coaching, and interviews, perception-aware AI ensures this signal is not lost.<\/span><\/p>\n<h2><b>Built for Real-Time Conversations<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Raven-1 was designed from the ground up for real-time operation. The audio perception pipeline produces rich descriptions in sub-100ms. Combined with the visual pipeline, the system maintains context that is never more than a few hundred milliseconds stale.<\/span><\/p>\n<p><span style=\"font-weight: 400\">The system excels on short, ambiguous, emotionally loaded inputs, exactly the moments where traditional systems fail. A single word response like &#8220;sure&#8221; or &#8220;fine&#8221; carries radically different meanings depending on how it&#8217;s delivered. Raven-1 captures that signal and makes it available to response generation.<\/span><\/p>\n<h2><b>Availability<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Raven-1 is generally available today across all Tavus conversations and APIs. The model works automatically out of the box, with perception layer access exposed through Tavus APIs for custom tool calls and programmatic logic.<\/span><\/p>\n<p><span style=\"font-weight: 400\">To see Raven-1 in action, visit the demo at <\/span><a href=\"https:\/\/raven.tavuslabs.org\/\"  rel=\"noopener\"><span style=\"font-weight: 400\">https:\/\/raven.tavuslabs.org<\/span><\/a><\/p>\n<h2><b>About Tavus<\/b><\/h2>\n<p><span style=\"font-weight: 400\">Tavus is a San Francisco-based AI research company pioneering human computing, the next era of computing built around adaptive and emotionally intelligent AI humans. Tavus develops foundational models that enable machines to see, hear, respond, and act in ways that feel natural to people.<\/span><span style=\"font-weight: 400\"><br \/>\n<\/span><\/p>\n<p><span style=\"font-weight: 400\">In addition to <\/span><a href=\"https:\/\/docs.tavus.io\/sections\/introduction\"  rel=\"noopener\"><span style=\"font-weight: 400\">APIs for developers and business<\/span><\/a><span style=\"font-weight: 400\">, Tavus offers PALs, a consumer platform for AI agents that might become a friend, intern, or both.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Learn more at <\/span><a href=\"http:\/\/tavus.io\/\"  rel=\"noopener\"><span style=\"font-weight: 400\">tavus.io<\/span><\/a><span style=\"font-weight: 400\">\u00a0<\/span><\/p>\n<p><b>For Contact:<\/b><\/p>\n<p><span style=\"font-weight: 400\">Leigh Disher <\/span><a href=\"mailto:leigh@gmkcommunications.com\"><span style=\"font-weight: 400\">leigh@gmkcommunications.com<\/span><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>SAN FRANCISCO, CA \u2013 Tavus, the human computing company building lifelike AI humans that can see, hear, and respond in real time, launched Raven-1 into GA today, a multimodal perception system that enables AI to understand emotion, intent, and context the way humans do. Raven-1 captures and interprets audio and visual signals together, enabling AI [\u2026] <a href=\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\" class=\"more-link\">Continue Reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":271,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[57],"tags":[],"class_list":["post-242269","post","type-post","status-publish","format-standard","hentry","category-ips"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business\" \/>\n<meta property=\"og:description\" content=\"SAN FRANCISCO, CA \u2013 Tavus, the human computing company building lifelike AI humans that can see, hear, and respond in real time, launched Raven-1 into GA today, a multimodal perception system that enables AI to understand emotion, intent, and context the way humans do. Raven-1 captures and interprets audio and visual signals together, enabling AI [\u2026] Continue Reading &rarr;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\" \/>\n<meta property=\"og:site_name\" content=\"Business\" \/>\n<meta property=\"article:published_time\" content=\"2026-02-16T22:46:17+00:00\" \/>\n<meta name=\"author\" content=\"Evertise\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Evertise\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\",\"url\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\",\"name\":\"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business\",\"isPartOf\":{\"@id\":\"https:\/\/ipsnews.net\/business\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png\",\"datePublished\":\"2026-02-16T22:46:17+00:00\",\"author\":{\"@id\":\"https:\/\/ipsnews.net\/business\/#\/schema\/person\/02176def5777c27b30102772b94615ca\"},\"breadcrumb\":{\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage\",\"url\":\"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png\",\"contentUrl\":\"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/ipsnews.net\/business\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/ipsnews.net\/business\/#website\",\"url\":\"https:\/\/ipsnews.net\/business\/\",\"name\":\"Business\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/ipsnews.net\/business\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/ipsnews.net\/business\/#\/schema\/person\/02176def5777c27b30102772b94615ca\",\"name\":\"Evertise\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/ipsnews.net\/business\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d79ec50bebdc68a4ebc6cfc341e0920ba7b507bde39945491ca6dec05d097ed7?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d79ec50bebdc68a4ebc6cfc341e0920ba7b507bde39945491ca6dec05d097ed7?s=96&d=mm&r=g\",\"caption\":\"Evertise\"},\"sameAs\":[\"http:\/\/evertise.net\"],\"url\":\"https:\/\/ipsnews.net\/business\/author\/evertise\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/","og_locale":"en_US","og_type":"article","og_title":"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business","og_description":"SAN FRANCISCO, CA \u2013 Tavus, the human computing company building lifelike AI humans that can see, hear, and respond in real time, launched Raven-1 into GA today, a multimodal perception system that enables AI to understand emotion, intent, and context the way humans do. Raven-1 captures and interprets audio and visual signals together, enabling AI [\u2026] Continue Reading &rarr;","og_url":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/","og_site_name":"Business","article_published_time":"2026-02-16T22:46:17+00:00","author":"Evertise","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Evertise","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/","url":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/","name":"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI - Business","isPartOf":{"@id":"https:\/\/ipsnews.net\/business\/#website"},"primaryImageOfPage":{"@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage"},"image":{"@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage"},"thumbnailUrl":"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png","datePublished":"2026-02-16T22:46:17+00:00","author":{"@id":"https:\/\/ipsnews.net\/business\/#\/schema\/person\/02176def5777c27b30102772b94615ca"},"breadcrumb":{"@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#primaryimage","url":"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png","contentUrl":"https:\/\/evertise.net\/wp-content\/uploads\/2026\/02\/34.png"},{"@type":"BreadcrumbList","@id":"https:\/\/ipsnews.net\/business\/2026\/02\/16\/tavus-introduces-raven-1-bringing-multimodal-perception-to-real-time-conversational-ai\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/ipsnews.net\/business\/"},{"@type":"ListItem","position":2,"name":"Tavus Introduces Raven-1, Bringing Multimodal Perception to Real-Time Conversational AI"}]},{"@type":"WebSite","@id":"https:\/\/ipsnews.net\/business\/#website","url":"https:\/\/ipsnews.net\/business\/","name":"Business","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/ipsnews.net\/business\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/ipsnews.net\/business\/#\/schema\/person\/02176def5777c27b30102772b94615ca","name":"Evertise","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/ipsnews.net\/business\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/d79ec50bebdc68a4ebc6cfc341e0920ba7b507bde39945491ca6dec05d097ed7?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d79ec50bebdc68a4ebc6cfc341e0920ba7b507bde39945491ca6dec05d097ed7?s=96&d=mm&r=g","caption":"Evertise"},"sameAs":["http:\/\/evertise.net"],"url":"https:\/\/ipsnews.net\/business\/author\/evertise\/"}]}},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/posts\/242269","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/users\/271"}],"replies":[{"embeddable":true,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/comments?post=242269"}],"version-history":[{"count":2,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/posts\/242269\/revisions"}],"predecessor-version":[{"id":242281,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/posts\/242269\/revisions\/242281"}],"wp:attachment":[{"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/media?parent=242269"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/categories?post=242269"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/ipsnews.net\/business\/wp-json\/wp\/v2\/tags?post=242269"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}