Skip to content

Beet.TV

BRB26 Promo
BeetTV on Facebook BeetTV on Twitter BeetTV on LinkedIn BeetTV on YouTube Subscribe to BeetTV RSS

Twelve Labs’ Soyoung Lee: Video Foundation Models Finally Enable ‘Holistic’ Contextual Understanding

Commerce Media Moves Up Funnel as Brands Test ‘Off-Platform’ Signals
Digital Envoy’s Charlie Johnson: LocID Anchors Identity On Locations Instead Of IPs
AI Copywriting Bots Join With Humans to Go Beyond Garbage: Omnicom’s Stern
Brand, Performance Budgets Merge as Marketers Chase Unified KPIs: Assembly’s Jill Kelly
WBD Sees Ads Entering the ‘Age of Relevance’, Shoppable TV
Best Buy’s Jen Jensen: Why Consumer Electronics Shoppers Represent ‘Unique Customer Landscape’
Accurate Forecasting Is Key to Unlocking Premium CTV: IPG’s Jean Fitzpatrick
Liquid Death Media Chief Pours Cold Water on CTV Conversion Hype
TelevisaUnivision’s Bethany Hillman: Linear Dollars have Fully Arrived to CTV
AI is Streamlining Identity, Media Planning and Future of FAST: Spark Foundry’s Steve Birnbaum
Retail Media Poised to Reshape Every Major Ad Channel: Analyst Andrew Lipsman
AI and the New Streaming Bargain, with KERV.ai’s Brad Quinn
Roku’s Jeff Katz: Amazon Partnership Reaches ‘80% of U.S. CTV Footprint’
CTV Growth is Fueling Both Opportunity and Complexity: OpenX’s Stacey Bohrer
Kinective Media’s Alissa Spiwak: Starlink will Enable Real-Time Ad Optimization at 30,000 Feet
Advertisers are Coming Back to News, With Better Tools: Ad Fontes Media’s Vanessa Otero
Scene-Level Targeting Brings AI’s Contextual Revolution To CTV
Mobile Gaming Drives Performance for Programmatic Ads: Zynga’s Corentin Leydis
Elevating Context, Commerce Media and the Power of Signals: Uber Advertising’s Kristi Argyilan
‘Boring Work’ Comes Before AI-Optimized TV: OpenAP’s Levy

RANCHO PALOS VERDES, CALIF. — There’s a lot more to analyzing streaming video than what meets the eye. And while all kinds of software promises to plumb the depths of knowing what ads are working in real-time, the need to apply a genuine understanding is where multimodal video foundation models like that of Twelve Labs comes in. 

“Video is just really complex. There’s sound, there’s visual, there’s language dialogue, there’s also time. And everything in the human world is very nuanced,” Soyoung Lee, co-founder and head of GTM at Twelve Labs, told Beet.TV contributor David Kaplan at the Beet Retreat LA. “You really need to understand all of these different data modalities together and be able to capture it in a holistic way that almost replicates the way that the human mind works.”

Previous approaches relied on transcribing audio or extracting objects from individual frames without understanding how actions unfold over time, limiting contextual accuracy for advertising and content applications.

Video requires native models

Large language model-based approaches that analyze video frames struggle because video differs fundamentally from text data formats, requiring models built from the ground up to handle moving images.

“When we speak or write, every word that we spit out is done intentionally,” Lee said. For video, it’s very different where not every frame is useful, right? Not every moment or context aggregates into meaning.”

AI must continuously watch videos at scale to identify meaningful seconds, frames, and moments that formulate memory and true context rather than treating every frame equally.

Video embeddings unify metadata

The advertising industry uses text embeddings to unify metadata taxonomies across platforms and stakeholders, and video embeddings now enable similar standardization for brand creative and addressable publisher content.

“You can actually start to unify all of the contextual information across the industry and have the data speak in the same language that becomes semantically accessible for any productization or service offering that can be created downstream,” Lee noted.

These multimodal video embeddings power semantic search, classification, and insight generation across applications.

Episode-level targeting

Rich contextual descriptions at episode level enable advertisers to buy based on content context rather than genre categories or behavioral targeting that raises privacy concerns.

“For years, contextual advertising has been a hot topic, but in order to really access episode level targeting as opposed to behavioral targeting, there’s always a question of privacy,” Lee said.

Publishers benefit through customized platform experiences including enhanced content discovery, personalized recommendations based on viewer moment-level preferences, and optimized trailers that maintain engagement.

Brand safety is a driving force

Understanding true video context reveals that not all news content carries risk, expanding viable inventory for advertisers who previously avoided entire categories.

“Brand safety is the number one fastest use case that drives adoption of truly understanding context. News is interesting because not all news is risky. If you can actually understand the context of what’s there, there’s actually a lot more inventory that can exist that’s viable,” Lee said.

Publishers also analyze creative performance by cohort to identify commonalities in successful ads, providing actionable insights that weren’t previously possible at scale.

Delivering immediate impact

Publishers see fastest results through platform experience optimization that keeps users engaged via better recommendations, enriched discovery algorithms, and trailers generated from finished content.

“The most impactful ones have been how you optimize the platform experience and optimize that experience for your user. That’s everyone’s challenge of keeping the user engaged,” Lee said.

Creative analysis helps advertisers understand cultural elements and performance attributes to optimize future assets, with all use cases powered by the same underlying video understanding technology that analyzes temporal context across multiple modalities.

“The ability for an AI to continuously watch a video and many videos at scale and to be able to understand what are the meaningful seconds, frames, moments that need to be put together in order to formulate a memory and true context has to be a model that’s built from the ground up to tackle video,” Lee said.

Share this post:
LinkedIn Facebook X WhatsApp Email Embed

Share: Twelve Labs’ Soyoung Lee: Video Foundation Models Finally Enable ‘Holistic’ Contextual Understanding

Direct Link:

Embed Code:

By David Kaplan on December 9, 2025December 15, 2025    @
Beet Retreat LA 2025 presented by Adobe, OpenX & StackAdaptTagged metadata, digital video, data, audience targeting, streaming, Soyoung Lee, Twelve Labs, Beet Retreat LA

Recent Videos

Retail Media Wants Your Budget, But It Needs to Earn It: Bayer’s Ryan Verklin

NEW YORK – At the IAB Connected Commerce Summit, Ryan Verklin, paid media and retail media senior lead at Bayer, delivered a useful reminder to an industry that sometimes treats every dashboard as a victory parade: winning in retail media requires more than colorful graphs and aggressive sales decks. “What does winning look like in […]

 

POSSIBLE 2026 Tops 7,500 Attendees, Announces Lisbon Expansion

MIAMI BEACH, Fla. — POSSIBLE 2026 drew more than 7,500 attendees this week, up from 5,400 last year, as organizers used the event’s momentum to announce an international expansion to Lisbon, signaling ambitions to turn the fast-growing marketing conference into a global franchise. The Portugal launch marks the first overseas edition of POSSIBLE, which has […]

 

GMMB’s Erica Monteith: Political Campaigns ‘Can’t Afford to be Locked Into a Single Workflow’

Political advertising is completely unrecognizable from past generations’ expectations of whistle stops and seasonal local TV buys. The always-on nature of politics in an era of intense polarization, razor thin margins in Congress, and now the influence of artificial intelligence tools requires flexible open architecture that integrates multiple data partners, verification tools, and measurement systems […]

 

AI Shopping Agents are Still in Beta, Need More Training: Mirakl’s Amelia Van Camp

NEW YORK – At the IAB Connected Commerce Summit, Amelia Van Camp, head of agentic commerce at AI firm Mirakl, offered a timely reminder for anyone convinced AI shopping agents are already buying socks, cereal and patio furniture on humanity’s behalf: not so fast. “So agentic commerce is in concept, in theory,” Van Camp said […]

 

HUMAN Security’s Stu Solomon: What Does It Mean for Advertisers When Half of Online Traffic Is ‘No Longer Human?’

NEW YORK — The artificial intelligence summary is the first thing anyone sees after conducting an online search. The content that follows those summaries are increasingly AI-driven. That creates complex trust scenarios where artificial intelligence agents act autonomously on behalf of humans while learning and adapting to anticipate user preferences without direct oversight. “Over 50% […]

 

Horizon Media’s Domenic Venuto: AI Positions Agencies As Strategic Partners, Not Vendors

NEW YORK — Rather than reducing ad agencies to the role of “mere service provider,” artificial intelligence tools just might be elevating creatives and media buyers to strategic C-suite partners that address fundamental business challenges. “For a long time in the agency business, we got relegated to the procurement office. It was all about cost […]

 

Creators are Finally Getting Credit for Priming the Pump of Sales

For years, brands have treated influencers as glorified billboards while simultaneously judging them by conversion metrics they were never designed to deliver. The result has often been a mismatch between brand goals and measurement, with creators consistently under-credited for the discovery, trust-building, and purchase intent they generate before a single transaction occurs. That disconnect is […]

 

CTV’s Performance Era Demands Streaming Prove Its Worth: NBCU’s Shepard

The days of connected TV serving merely as a way to chase cord-cutters are over. Streaming has graduated from an incremental reach play into something advertisers increasingly expect to deliver measurable business outcomes. Where brands once viewed streaming as a top-of-funnel branding exercise separate from their performance budgets, they are now demanding evidence that premium […]

 

Beet@20: Peter Naylor of Nielsen Says Personalization Is Coming for Everything

As Beet.TV celebrates 20 years of documenting media’s endless reinvention, Peter Naylor, chief client officer at Nielsen, offered a reminder that the biggest disruption in television didn’t kill advertising. It upgraded it. From its first video shot at Google’s campus in Mountain View in 2006 to thousands of interviews from CES hallways and industry conferences, […]

 

Late to Retail Media Means Right on Time: Ace Hardware’s Molly Hjelm

NEW YORK — Ace Hardware may be famous for paint, screws and the reassuring smell of lumber, but now it also sells advertising. Molly Hjelm, head of retail media at Ace, says the co-op’s unusual structure gives it a distinctive angle in the ad business now sweeping retail. “Ace Hardware is a unique business model […]

 

Programmatic TV as a Strategy Is Leaving Money on the Table, Tatari Exec Warns

TV’s long-running fragmentation problem is driving renewed interest in what the industry calls “convergent TV,” the ability to plan, buy, and measure across linear, streaming, and online video through a single platform. The infrastructure supporting such unified approaches has matured considerably since the early days of programmatic and demand-side platforms. “The days of getting a […]

 

Swivel’s Joseph Hirsch: Campaign Optimization Reaches ‘Peak Efficiency’ at Impression Level

NEW YORK — Campaign optimization can now operate at impression level through real-time agent decisioning, representing peak efficiency for advertising technology that traditionally relied on periodic human intervention across broader campaign segments. “Can you decision an impression with an agent as opposed to the way it was done in the past?” Joseph Hirsch, CEO of […]

 

Marketers Must Impress Machines in Age of AI: IAB’s Caroline Geigerich

NEW YORK — Commerce media is booming so fast it may soon need its own zoning permit. Caroline Giegerich, vice president of artificial intelligence at IAB, said the sector is expected to reach $105 billion in 2025, roughly triple the size of four years ago, which is the sort of growth chart that makes PowerPoint […]

 

Philo’s Mike Keyserling: CTV’s ‘Paradox Of Choice’ Is the Real Discovery Challenge for Marketers and Media

NEW YORK — Most everyone loves to binge their favorite programs on CTV. But streaming services’ vast content libraries also create decision fatigue that has consumers often pining linear television’s simplicity. Such analog wishes may be the reason that live viewing represents approximately 70% of consumption even on platforms designed for on-demand access, according to […]

 

Why Legacy Workflows are Holding Brands Back From AI Transformation

If you want to get to the future, you may have to let go of the past. But companies that have operated the same way for 30 or 40 years are finding those very practices now stand as barriers to adopting AI. “The brands that are really achieving a new level of adoption of AI […]

 

Xumo Sees the TV Home Screen as Streaming’s Most Valuable Ad Real Estate

The remote control used to be simple – flip through channels, land somewhere, stay a while. Now the act of watching television has become its own kind of labor – but that complexity, one executive argues, is exactly where the advertising opportunity lies. Ying Wang, general manager of Xumo Advertising, believes the fragmentation that frustrates […]

 

Retail Media Must Grow Up Fast: Albertsons Media Collective’s Brian Monahan

NEW YORK – Retail media has spent the past few years being treated like advertising’s golden child. Money poured in, decks multiplied and everyone claimed miraculous returns. Now comes the less glamorous phase: proving it. Brian Monahan of Albertsons Media Collective arrived at the Beet.TV AI Media Summit with Horizon Media carrying a message for […]

 

GSTV’s Kristina Lutz: Gen Z is Looking for Discovery Beyond Social in the Real World

NEW YORK — Physical shopping experiences are far from dead for younger consumers, despite the view that Gen Z lives are primarily lived within the confines of TikTok. That realization is reshaping how brands think about the consumer journey. But there’s a sticking point: how can brands tap into authentic discovery moments that balance digital […]

 

WPP’s Puma: Why CTV Demands a Dual Strategy of Scarcity and Agility

The connected television advertising market is bifurcating between the abundant dreams of programmatic and the reality of scarce, premium content. The industry may be finding itself in a hybrid model where tentpole events like the World Cup must be booked far in advance, while performance-focused campaigns demand the nimble optimization that digital platforms provide. That […]

 

IAB’s Jamie Finstein on CTV and Commerce: Don’t Talk About One Without the Other

NEW YORK — Connected TV and commerce have become inseparably linked as streaming platforms increasingly partner with retailers and closed-loop attribution systems enable direct measurement of advertising’s impact on purchase behavior. “You don’t really talk about commerce anymore without thinking about CTV,” Jamie Finstein, vp, Media Center at IAB, told Beet.TV contributor David Kaplan at […]

 

Beet@20: Bill Koenigsberg of Horizon Media Says Culture Still Crushes Ego

As Beet.TV marks 20 years of chronicling the media business, Bill Koenigsberg, founder and CEO of Horizon Media, used the milestone to deliver a pointed message to Madison Avenue: swagger is cheap, culture compounds. From its first recording at Google’s campus in Mountain View in 2006 to the maze-like halls of CES, Beet.TV has built […]

 
More Videos
  • About
  • Team
  • Calendar
  • Products
  • Contact Us
  • Privacy Policy
  • Terms of Use
Copyright © 2026 Beet Media, LLC. All Rights Reserved. Beet.TV is published by Beet Media, LLC.