Skip to content

Beet.TV

BRSJ 2026 – Promo
BeetTV on Facebook BeetTV on Twitter BeetTV on LinkedIn BeetTV on YouTube Subscribe to BeetTV RSS

Twelve Labs’ Soyoung Lee: Video Foundation Models Finally Enable ‘Holistic’ Contextual Understanding

Commerce Media Moves Up Funnel as Brands Test ‘Off-Platform’ Signals
Digital Envoy’s Charlie Johnson: LocID Anchors Identity On Locations Instead Of IPs
AI Copywriting Bots Join With Humans to Go Beyond Garbage: Omnicom’s Stern
Brand, Performance Budgets Merge as Marketers Chase Unified KPIs: Assembly’s Jill Kelly
WBD Sees Ads Entering the ‘Age of Relevance’, Shoppable TV
Best Buy’s Jen Jensen: Why Consumer Electronics Shoppers Represent ‘Unique Customer Landscape’
Accurate Forecasting Is Key to Unlocking Premium CTV: IPG’s Jean Fitzpatrick
Liquid Death Media Chief Pours Cold Water on CTV Conversion Hype
TelevisaUnivision’s Bethany Hillman: Linear Dollars have Fully Arrived to CTV
AI is Streamlining Identity, Media Planning and Future of FAST: Spark Foundry’s Steve Birnbaum
Retail Media Poised to Reshape Every Major Ad Channel: Analyst Andrew Lipsman
AI and the New Streaming Bargain, with KERV.ai’s Brad Quinn
Roku’s Jeff Katz: Amazon Partnership Reaches ‘80% of U.S. CTV Footprint’
CTV Growth is Fueling Both Opportunity and Complexity: OpenX’s Stacey Bohrer
Kinective Media’s Alissa Spiwak: Starlink will Enable Real-Time Ad Optimization at 30,000 Feet
Advertisers are Coming Back to News, With Better Tools: Ad Fontes Media’s Vanessa Otero
Scene-Level Targeting Brings AI’s Contextual Revolution To CTV
Mobile Gaming Drives Performance for Programmatic Ads: Zynga’s Corentin Leydis
Elevating Context, Commerce Media and the Power of Signals: Uber Advertising’s Kristi Argyilan
‘Boring Work’ Comes Before AI-Optimized TV: OpenAP’s Levy

RANCHO PALOS VERDES, CALIF. — There’s a lot more to analyzing streaming video than what meets the eye. And while all kinds of software promises to plumb the depths of knowing what ads are working in real-time, the need to apply a genuine understanding is where multimodal video foundation models like that of Twelve Labs comes in. 

“Video is just really complex. There’s sound, there’s visual, there’s language dialogue, there’s also time. And everything in the human world is very nuanced,” Soyoung Lee, co-founder and head of GTM at Twelve Labs, told Beet.TV contributor David Kaplan at the Beet Retreat LA. “You really need to understand all of these different data modalities together and be able to capture it in a holistic way that almost replicates the way that the human mind works.”

Previous approaches relied on transcribing audio or extracting objects from individual frames without understanding how actions unfold over time, limiting contextual accuracy for advertising and content applications.

Video requires native models

Large language model-based approaches that analyze video frames struggle because video differs fundamentally from text data formats, requiring models built from the ground up to handle moving images.

“When we speak or write, every word that we spit out is done intentionally,” Lee said. For video, it’s very different where not every frame is useful, right? Not every moment or context aggregates into meaning.”

AI must continuously watch videos at scale to identify meaningful seconds, frames, and moments that formulate memory and true context rather than treating every frame equally.

Video embeddings unify metadata

The advertising industry uses text embeddings to unify metadata taxonomies across platforms and stakeholders, and video embeddings now enable similar standardization for brand creative and addressable publisher content.

“You can actually start to unify all of the contextual information across the industry and have the data speak in the same language that becomes semantically accessible for any productization or service offering that can be created downstream,” Lee noted.

These multimodal video embeddings power semantic search, classification, and insight generation across applications.

Episode-level targeting

Rich contextual descriptions at episode level enable advertisers to buy based on content context rather than genre categories or behavioral targeting that raises privacy concerns.

“For years, contextual advertising has been a hot topic, but in order to really access episode level targeting as opposed to behavioral targeting, there’s always a question of privacy,” Lee said.

Publishers benefit through customized platform experiences including enhanced content discovery, personalized recommendations based on viewer moment-level preferences, and optimized trailers that maintain engagement.

Brand safety is a driving force

Understanding true video context reveals that not all news content carries risk, expanding viable inventory for advertisers who previously avoided entire categories.

“Brand safety is the number one fastest use case that drives adoption of truly understanding context. News is interesting because not all news is risky. If you can actually understand the context of what’s there, there’s actually a lot more inventory that can exist that’s viable,” Lee said.

Publishers also analyze creative performance by cohort to identify commonalities in successful ads, providing actionable insights that weren’t previously possible at scale.

Delivering immediate impact

Publishers see fastest results through platform experience optimization that keeps users engaged via better recommendations, enriched discovery algorithms, and trailers generated from finished content.

“The most impactful ones have been how you optimize the platform experience and optimize that experience for your user. That’s everyone’s challenge of keeping the user engaged,” Lee said.

Creative analysis helps advertisers understand cultural elements and performance attributes to optimize future assets, with all use cases powered by the same underlying video understanding technology that analyzes temporal context across multiple modalities.

“The ability for an AI to continuously watch a video and many videos at scale and to be able to understand what are the meaningful seconds, frames, moments that need to be put together in order to formulate a memory and true context has to be a model that’s built from the ground up to tackle video,” Lee said.

Share this post:
LinkedIn Facebook X WhatsApp Email Embed

Share: Twelve Labs’ Soyoung Lee: Video Foundation Models Finally Enable ‘Holistic’ Contextual Understanding

Direct Link:

Embed Code:

By David Kaplan on December 9, 2025December 15, 2025    @
Beet Retreat LA 2025 presented by Adobe, OpenX & StackAdaptTagged metadata, digital video, data, audience targeting, streaming, Soyoung Lee, Twelve Labs, Beet Retreat LA

Recent Videos

Agentic AI Is Rewriting Business Models for Ad Agencies: S4 Capital’s Sir Martin Sorrell

LAS VEGAS –Sir Martin Sorrell, executive chairman of S4 Capital, has a blunt assessment of advertising’s future. Speaking with Beet.TV contributor David Kaplan at CES 2026, Sorrell said agentic AI “fundamentally challenges the model.” Strategic and creative under attack Sorrell argued that agentic systems threaten the traditional agency stack. “Strategic and creative is under attack,” […]

 

Why Boring Brands Fail in the $8.6 Billion Gaming Ad Market

LAS VEGAS — Advertisers are pouring billions into virtual worlds, with U.S. game ad revenues projected to climb nearly 7% to reach $8.59 billion this year. But, while nearly half of gamers purchase items based on in-game ads, a majority still view these interruptions as a distraction from the action. The problem? A lack of […]

 

Live Sports Advertising Edges Toward Agentic Automation: NBCU’s Ryan McConville

LAS VEGAS – At CES 2026, Comcast’s NBCUniversal rolled out an idea that sounds futuristic but solves an old headache: buying live sports ads without losing your sanity. Ryan McConville, chief product officer and EVP of ad products at NBCUniversal, said the company is testing agentic AI to automate live sports buying across streaming and […]

 

In-Store Media Is Third Pillar of Retail Advertising: Albertsons’ Brian Monahan

LAS VEGAS – At CES 2026, Albertsons Media Collective outlined how it is tightening measurement, expanding in-store media and using AI to link advertising more directly to sales outcomes. Senior Vice President Brian Monahan said the network is focused on shared growth across brands, merchants and shoppers. “We seek to live up to our name […]

 

Cross-Platform Cloud Gaming Comes With PHYND’s Magnite-Powered Ads

LAS VEGAS — The gaming industry has long been the epitome of fragmentation – mobile gamers on tiny screens, console enthusiasts locked into expensive hardware, and PC players tethered to their desks. Now a new platform aims to unite them all on the largest screen in the house. PHYND is betting that the smart TV […]

 

Agentic Commerce Adds New Lane to Shopping, Not a Detour: Criteo CEO Michael Komasinski

LAS VEGAS – Agentic commerce is expanding how consumers shop rather than replacing existing channels, says Michael Komasinski, chief executive of adtech firm Criteo. Speaking with Beet.TV contributor David Kaplan at CES 2026, Komasinski said AI-powered assistants should be viewed as an incremental layer that complements search, social and retail media, not a substitute for […]

 

Unified Measurement Across TV and Social Is Finally Arriving: iSpot’s Julie Van Ullen

LAS VEGAS – Advertisers sounded a familiar concern at CES 2026: They still struggle to measure television and video spending across platforms in one unified view. That challenge has persisted for years, Julie Van Ullen, president, CRO of iSpot, said in this interview with Lisa Granatstein, editorial director of Beet.TV. The biggest gap has been […]

 

AI Agents are Starting to Buy TV Ads for Each Other

The complexity of purchasing premium video across linear and streaming endpoints has long plagued the industry, creating friction that often sends budgets flowing toward simpler social platforms. As artificial intelligence matures from generative creative tools into autonomous transaction capabilities, the machinery of media trading is beginning to mimic the speed of high-frequency finance. But while […]

 

Inside Reddit’s AI-Powered Push into Self-Serve Media Buying: Stephen Riad

LAS VEGAS – Agentic AI and self-serve tools took center stage at CES in Las Vegas last week, marking a significant shift from assisting advertisers to actually executing media campaigns. Joining me in the bustling Beet.TV studio at the Vdara Hotel, I caught up with Reddit’s Stephen Riad, who was just promoted to EVP of […]

 

WPP Media’s Luis Martinez: Retail Media Reaches ‘Tipping Point’ Beyond Transactions to Brand Building

LONDON — Retail media is evolving from a transaction-focused performance channel toward brand building as companies leverage data and signals to shape customer connections across multiple touchpoints. “Companies can actually use the data in two ways. One is obviously the transaction that will always be there, will remain strong, but also brand building,” Luis Martinez, […]

 

Interactive Video Ads Outperform Standard Formats Across the Funnel: Amazon’s Maggie Zhang

LAS VEGAS – Amazon Ads and Publicis Media unveiled new research on interactive video advertising at CES 2026. The joint thought leadership study focuses on interactive formats in streaming TV. “Yes, we’re really excited to announce and release the Joint Thought Leadership Study with Publicis Media today at CES,” Maggie Zhang, head of global video […]

 

Streaming, Social, Retail Media Redraw Europe’s TV Ad Market: PubMatic’s Andy Jones

LONDON – Linear television viewing is declining across major European markets, driving a corresponding drop in advertising spend, says Andy Jones, VP of broadcast and connected television, EMEA, at PubMatic. The shift is consistent across regions and no longer disputed by buyers or sellers, he said in this interview with Beet.TV contributor Robert Andrews at […]

 

Commerce Media Moves Up Funnel as Brands Test ‘Off-Platform’ Signals

RANCHO PALOS VERDES, CALIF – Retail media networks are evolving beyond their own websites, as platforms increasingly leverage transaction data to target consumers across the open web and connected television. U.S. commerce media ad spend projected to reach $69.69 billion in 2025, a 21.8% increase over the previous year, according to data from Emarketer. For […]

 

Live TV, AI Search, Retail Media are Set to Converge: Omnicom’s Megan Pagliuca at CES 2026

LAS VEGAS – At CES 2026, Megan Pagliuca, Chief Product Officer at Omnicom Media Group, outlined how the agency is responding to fundamental shifts in how consumers discover information, shop and engage with media. Tightly linking live content, AI-powered search signals, influencer marketing and retail media into a single, measurable system is essential. Speaking with […]

 

Commerce Data Pushes Retail Media Beyond the Lower Funnel: Criteo’s Nicole Kivel

LONDON – Retail media is no longer confined to on-site placements or last-click conversions. Instead, commerce data is increasingly reshaping how brands plan and activate media across the funnel and across channels, says Nicole Kivel, managing director of Northern Europe at Criteo. Speaking with Beet.TV contributor Robert Andrews at the Beet.TV and WPP Media Leadership […]

 

Advertisers Want it All: Mass Audience Reach Plus Advanced Targeting in CTV

LONDON — Advertisers want it all. They crave the mass audience reach that television has always delivered, but they also demand the precise targeting capabilities that digital advertising has trained them to expect. So, as connected TV matures into a dominant force in the media landscape, the industry is racing to satisfy both appetites simultaneously. […]

 

With AVA, Havas Pursues Context-First Planning for CTV Brand-Building

LAS VEGAS — The connected TV advertising market has exploded in scale, but some in the industry say it’s still searching for the fabled combination of right-moment, right-message. That’s the philosophy driving a renewed focus on “context-first planning”, which moves beyond demographic proxies. “We are big believers in context-first planning, and that’s important because it […]

 

We’ve Lost a Great One: CBS’s Radha Subramanyam. Gone at 55

We heard the sad news today that our friend Radha has passed. She had been battling brain cancer for some time. I wanted to say how sad I am, how much we will miss her and how grateful we are for her contributions to the industry. Our deepest condolences to her family. We so appreciated […]

 

TiVo Ad’s Chris Kleinschmidt: CTV Home Screens Are Billboards With Better Targeting

LONDON — Connected TV home screens function as digital billboards that capture viewer attention before content consumption begins, offering targeting capabilities unavailable in traditional out-of-home advertising. “It is very similar to out of home in the sense that it is a billboard. For lack of a better word, it is a billboard on the homepage […]

 

Operative’s Tatta: AI can Solve the ‘4D Chess’ of Media Fragmentation

LONDON — As streaming and social platforms muscle their way into the mainstream, the media landscape has fractured into a kaleidoscope of disconnected inventory, with broadcasters, trying to figure out how to sell it across a sprawling ecosystem. The result is often a manual, time-consuming struggle to unify ad sales and present a coherent offering […]

 

Attention Metrics Emerge as the Unifier in a Fragmented Media World

LONDON — The advertising industry has spent years grappling with measurement challenges as media channels proliferate. Now, a new approach is gaining traction: using attention as the common currency that can bridge the gap between disparate platforms. Rather than simply counting whether consumers have seen an ad, marketers are increasingly focused on understanding how deeply […]

 
More Videos
  • About
  • Team
  • Calendar
  • Products
  • Contact Us
  • Privacy Policy
Copyright © 2026 Beet Media, LLC. All Rights Reserved. Beet.TV is published by Beet Media, LLC.