GEO InsightsDec 20, 2025

Cosmetics: The Weight of Customer Reviews in the GEO Era

How AI engines use customer reviews and authority sources to recommend cosmetic products. A deep dive into GEO for the beauty industry.

1. Introduction: The Fundamental Paradigm Shift in Beauty Discovery

The cosmetics industry, historically anchored in visual seduction and brand storytelling, now finds itself at the epicenter of an unprecedented technological upheaval. For two decades, SEO (Search Engine Optimization) constituted the backbone of digital visibility, a technical game aimed at positioning URLs at the top of search engine results pages (SERPs). However, the advent of Large Language Models (LLMs) and generative response engines—such as OpenAI's ChatGPT, Google AI Overviews (SGE), Perplexity, and Claude—has inaugurated a new era: that of Generative Engine Optimization (GEO).
This report analyzes in depth this critical transition, postulating that the battle for visibility is no longer about keywords or backlinks, but about a brand's ability to integrate into the semantic fabric of artificial intelligence. Unlike the classic search engine that acts as a librarian pointing to resources, the generative engine acts as an expert consultant, synthesizing a unique answer from billions of data points.
In this new ecosystem, the brand's voice is diluted, even marginalized, in favor of a complex triangulation between user reviews, dermatological validations, and media consensus.
The transition in search behavior is palpable and rapid. Data indicates that in the United States, nearly 50% of queries now trigger AI-generated responses via Google AI Overviews, a figure expected to grow exponentially.
More alarming for traditional strategies, organic click-through rate (CTR) is undergoing massive erosion, dropping by nearly 60% for brands not optimized for these "zero-click answers". Users no longer seek to visit a site; they seek an immediate, contextual, and reliable answer.

Architecture Comparison: Legacy vs AI Stack

LEGACY STACK

Classic Indexing

Crawler (Googlebot)
HTML Parsing & Extraction
Inverted Index
Map: Keyword -> URL
Ranking (PageRank)
Sort by Links & Keywords
AI STACK

Retrieval Augmented Gen.

Tokenizer
Text -> Vector Conversion
Vector Store
Semantic Search (Dense Retrieval)
LLM Context Window
Synthesis & Generation

The shift from deterministic indexing to probabilistic inference.

The direct consequence for cosmetics brands is a loss of control over the narrative. Where the brand's website was once the final destination and source of truth, it is now just one source among many, ingested and often contested by the model. AI favors corroboration: if the brand claims a product is moisturizing but Reddit threads and dermatologist reviews on YouTube indicate otherwise, AI will reflect this dissonance or exclude the product from its recommendations.
This report will demonstrate how trust mechanisms have shifted from technical signals (site speed, meta tags) to semantic and third-party authority signals (sentiment, expertise, consensus).

2. AI Recommendation Mechanics: RAG, Entities, and Trust

To understand how to influence responses, it is imperative to dissect the technical workings of current recommendation engines. Most commercial systems, including Bing's shopping features or Google's summaries, use a Retrieval-Augmented Generation (RAG) architecture.

2.1 The RAG Process in the Beauty Context

RAG combines the linguistic power of an LLM (like GPT-4) with the precision of an updated search database. When a user asks a complex question like "What is the best anti-aging cream for sensitive skin on retinoids?", the system doesn't just invent an answer. It executes a series of ultra-fast operations:
Retrieval: The engine scans the web for relevant documents. Unlike SEO which looks for keywords, GEO looks for concepts and entities related to "sensitive skin" and "retinoids".
Generation (Synthesis): The model reads the retrieved content (reviews, blog posts, product pages) and synthesizes an answer. This is the critical stage where sorting occurs. The model is programmed to evaluate the reliability of contradictory sources.
In this process, the source of information becomes as important as the information itself. Studies show that AI platforms assign different trust weights depending on data origin. Commercial sites (.com) represent 80.41% of overall citations, but non-profit (.org) and educational (.edu) domains benefit from an implicit authority bonus, especially for dermatological health questions.

2.2 The Concept of "Co-Citation" and the Knowledge Graph

The concept of co-citation is central to GEO. AI builds its understanding of the world through a Knowledge Graph, linking entities (brands, ingredients, skin problems) to each other. If the brand "CeraVe" is frequently cited near terms like "dermatologist", "skin barrier", and "ceramides" on authority sites like WebMD, Allure, and Reddit, AI strengthens this neural link.
Conversely, a brand that only appears on its own site and low-quality affiliate blogs will fail to anchor its entity in the global knowledge graph. Semrush's GEO study emphasizes that unlinked brand mentions now carry significant weight. A simple text mention in a Vogue editorial article can have more GEO visibility impact than a technical link from a directory, because it feeds the model's semantic context.

2.3 The "Mention-Source Divide": Being Seen vs Being Believed

A subtle but crucial distinction emerges from recent data: the gap between mention and source citation (Mention-Source Divide). A brand can be mentioned in AI-generated text ("Users often appreciate The Ordinary products...") without its website being cited as a footnote source.
The AI Visibility Index 2025 report reveals that fewer than one in five brands manages to be both frequently mentioned in text and cited as an authoritative source.
MetricDefinitionStrategic Implication
AI MentionPresence of brand name in generated text response.Awareness (Top of Mind). Indicates AI "knows" the brand via its training corpus.
Source CitationClickable link or reference at bottom of response pointing to a specific URL.Traffic and Authority. Indicates AI considers this URL as reliable proof of the stated information.
Mention/Citation RatioFrequency of mentions compared to citations.A high ratio of mentions without citations suggests a popular brand whose own content lacks technical or structural authority.
Brands dominating this space, like L'Oréal or Estée Lauder, benefit from a massive history of mentions in web literature, giving them positive inertia in probabilistic models. However, newer brands like Rhode Skin or Rare Beauty manage to break through thanks to massive social mentions and third-party reviews, compensating for their lack of history with superior content velocity and freshness.

3. The Hegemony of Third-Party Sources: The Redistribution of Authority

If the brand's website loses its luster, who takes over? Analysis of citation data across Perplexity, ChatGPT, and Google SGE paints a radically transformed media landscape, dominated by three giants: communities (Reddit), encyclopedias (Wikipedia), and specialized press.

3.1 Reddit: The Truth Engine of Beauty

It is impossible to overstate Reddit's importance in the current GEO ecosystem. Statistics are clear: on Perplexity, an AI search engine favored by early adopters and precise information seekers, Reddit represents about 46.5% of primary citations. On Google AI Overviews, it captures 21% of citations, surpassing even YouTube and Wikipedia.
Why this AI obsession with Reddit? The answer lies in the nature of the data:
Vernacular Authenticity: Discussions on subreddits like r/SkincareAddiction or r/AsianBeauty are rich in contextual details ("it worked for my cystic acne but dried out my cheeks"). This level of nuance is exactly what models seek to answer complex queries ("long-tail queries").
Q&A Structure: Reddit's very structure (Question -> Multiple Answers -> Validation Votes) mimics the reinforcement learning process of AI (RLHF). Reddit threads are ready-to-use training data for learning human consensus.
Freshness: Unlike a static blog post, a Reddit thread evolves in real-time. For product launches or crises (botched reformulations), Reddit is often the first source to capture the signal.
For a cosmetics brand, this means that a "bad buzz" on Reddit is no longer just a PR problem; it's a structural SEO problem. If Reddit users systematically describe a product as a "scam" or "ineffective", AI will integrate this probabilistic association, contaminating future recommendations.

Dominant Authority Sources by AI Platform

Reddit
YouTube
Wikipedia
46.5%
2%
12.5%
Perplexity
21%
19%
0.6%
Google AI Overviews
11.3%
47.9%
ChatGPT
Share of voice

Distribution of the most cited domains. Note the dominance of Wikipedia and Reddit over direct e-commerce sites.

SOURCE: REDDIT, PROFOUND, SEJ

3.2 The Press and "Listicles": The Structure of Consensus

If Reddit provides raw experience, specialized press provides structure. Articles like "Top 10 Best Foundations 2025" published by authorities like Vogue, Byrdie, or Allure are essential for GEO.
In the SEO past, these articles served to acquire links (backlinks). In the GEO present, they serve to categorize and validate. When a model must answer a generic query ("best matte lipstick"), it doesn't test products. It performs a meta-analysis of existing lists on the web.
Recurrence is key: a brand appearing in 8 out of 10 lists from high-authority domains has an almost total probability of being cited by AI. Data shows that brands in the top quartile of web mentions get more than 10 times more citations in AI summaries than those in the next quartile. AI delegates, in a way, its judgment to these recognized human curators.

3.3 Dermatological Authority and the "YMYL" Factor

In the specific skincare sector, medical authority surpasses all others. Google's algorithms and OpenAI's safety guidelines classify skincare advice in the "YMYL" (Your Money, Your Life) category, requiring a higher level of proof.
Brands like CeraVe, La Roche-Posay, or EltaMD dominate AI recommendations for acne or eczema because their knowledge graph is saturated with connections to medical terms and doctor citations.
Conversely, "entertainment" influencers are losing ground to scientific "Skinfluencers" like Lab Muffin Beauty Science or Dr. Dray. These content creators, often with chemistry or dermatology degrees, produce dense, factual, and sourced content that is "AI-friendly". Their videos and blogs are frequently used as truth sources to debunk myths (e.g., "vitamin C doesn't work"). A brand favorably cited by Dr. Dray benefits from immediate authority transfer in the algorithm's eyes.

4. Sentiment Analysis: The Emotional Heart of the Algorithm

AI's most sophisticated contribution to SEO is its ability to understand sentiment beyond simple binary metrics (Like/Dislike). Semantic sentiment analysis has become an invisible but determining ranking factor.

4.1 From Star Rating to Semantic Nuance

Traditionally, a product with a 4.8/5 star average was considered superior. For AI, this figure is insufficient. Current models ingest the full text of reviews to extract aspect-based sentiments.
For example, for a Laura Mercier setting powder, AI doesn't just retain "it's a good product". It identifies semantic clusters: "excellent oil control", "can cake on dry areas", "high price", "invisible finish".
This granularity allows AI to answer very precise queries: "Setting powder for dry skin that doesn't cake". If reviews, even globally positive, often mention "cakes on dry areas", the product will be excluded from this specific answer. This is the triumph of relevance over raw popularity.

4.2 Polarity and Contradiction Management

The cosmetics sector is rich in polarizing products (e.g., Retinol, which is effective but irritating). How does AI handle these contradictions?
Computer science research on LLMs shows that models attempt to resolve information conflicts by evaluating relative source credibility and information recency.
If the brand's site promises "zero irritation" but 30% of recent Reddit reviews mention "burning" and "redness", AI will penalize the brand for this dissonance. The model may either display a warning ("Although the brand claims absolute gentleness, many users report irritation"), or simply downrank the product for lack of reliability.
Alignment between the marketing promise (Brand Voice) and consumer reality (Consumer Voice) thus becomes a technical imperative for GEO. "Marketing BS" is algorithmically filtered.

Competitive Landscape Analysis

Relative positioning by Volume and Sentiment

Positive
Neutral
Negative
0
20
40
60
80
100
0
20
40
60
80
100

Niche

(Loved / Low Vol)

Leader

(High Vol / Sent)

Risk

(Viral / Negative)

Sentiment Score (0-100)
Mention Volume (Index 0-100)

This chart cross-references conversation volume (X-Axis) with sentiment quality (Y-Axis). Bubble size represents AI Visibility. Note how "Brand C" (red) has high volume but negative sentiment, which hurts its recommendation by algorithms.

5. Share of Model (SoM): The New Currency of Performance

While market share and share of voice have long served as the compass for marketing directors, the AI era imposes a new king metric: Share of Model (SoM). This measure quantifies a brand's presence and quality of representation within generative models.

5.1 Defining and Measuring Cosmetics SoM

Share of Model is not limited to counting how many times a brand is cited. It evaluates the probability that a brand will be recommended as a solution to a given problem.
The conceptual SoM formula could be written as:
SoM=Relevant Brand MentionsTotal Category Mentions×Sentiment Score×Source AuthoritySoM = \frac{\text{Relevant Brand Mentions}}{\text{Total Category Mentions}} \times \text{Sentiment Score} \times \text{Source Authority}
For a cosmetics brand, measuring SoM involves simulating thousands of user prompts (e.g., "oily skin routine", "best waterproof mascara", "clean retinol alternative") and analyzing outputs from different AIs (ChatGPT, Gemini, Perplexity).
It's a dynamic metric. Unlike a Google index that can remain stable for weeks, an LLM's response can vary by context, location, and model updates. Analyses show significant volatility: about 57% of brands disappear and reappear from one session to another, highlighting the importance of multichannel presence to stabilize visibility.

5.2 Case Study: Gentle Monster and Semantic Optimization

The example of luxury eyewear brand Gentle Monster (though in accessories, the logic applies perfectly to beauty) is instructive. Using "Share of Model" analysis tools, the brand identified how AI perceived its products (avant-garde, artistic, but sometimes unclear on function).
By adjusting its Performance Max campaigns and content to align paid search themes with AI's natural semantic associations, the brand achieved a 39% increase in ROAS (Return on Ad Spend).
This demonstrates that GEO is not just an organic visibility tool, but a media efficiency lever. Understanding the AI's "brain" allows better ad targeting, as advertising platforms (Google Ads, Meta) use the same underlying semantic understanding technologies.

6. The Technical Imperative: Structured Data and Transparency

Beyond content and sentiment, GEO relies on rigorous technical infrastructure. AI is a voracious machine for structured data. The easier information is to "parse" (syntactically analyze), the more likely it is to be used.

6.1 Ingredient Transparency and "Machine-Readability"

In the beauty sector, the ingredient is the atomic unit of truth. The most successful GEO brands, like The Ordinary or COSRX, adopt radical transparency. They don't just say "contains vitamin C", they specify "10% L-Ascorbic Acid".
Why is this crucial for AI? Because users ask technical questions ("Which serum contains at least 10% vitamin C and has an acidic pH?"). If this data is locked in a promotional image (not readable by all bots) or a PDF, AI cannot answer with certainty. It will then turn to third-party databases like INCI Decoder or SkinCarisma, which structure this data.
To maximize their SoM, brands must tag their product pages with detailed Schema.org schemas, including complete ingredient lists, precautions, and target skin types.

6.2 Creator Content as Training Data

A fascinating evolution is the use of creator content (transcribed TikToks, YouTube descriptions) as direct training data for models. Multimodal AIs can "watch" a demonstration video to understand a cream's texture.
Brands must therefore consider their influencer briefs from a new angle: data. It's no longer just about asking an influencer to be "fun", but encouraging them to use precise descriptive vocabulary ("velvety finish", "quick absorption", "no greasy residue"). These keywords, spoken in the video and automatically transcribed, become product attributes associated with the brand in the AI's knowledge graph. It's a form of vocal and visual SEO.

Visibility Calculator: Share of Model (SoM)

AI Visibility Score (0-100)

72%

ChatGPT

Market leader

55%

Bing (Copilot)

Integrated with search

41%

Perplexity

Answer engine

Mention Trend (Last 6 months)

ChatGPT
Bing
Perplexity
100
75
50
25
0
Month 1Month 2Month 3Month 4Month 5Month 6

Conceptual dashboard for tracking brand visibility in LLM models.

Methodological data sources:Hallam,Semrush,Averi.ai

Share of Model Trend (Quarterly)

My Brand
Competitor A
0%10%20%24%Jan22%15%Fev21%16%Mar20%18.5%

Simulation of a GEO tracking dashboard. Note the importance of 'Share of Model' compared to competitors and the sentiment breakdown by channel.

DATA SOURCES: HALLAM, ZIGPOLL, AVENUE Z

7. Strategic Implications: The New Beauty Playbook

Facing this total reconfiguration, brands can no longer simply adapt their old SEO strategies. A complete overhaul of the digital approach is necessary.

7.1 From Keyword Optimization to Entity Management

The priority is no longer to rank a page for the keyword "best anti-wrinkle", but to ensure that the entity "Brand X" is associated with the attribute "effective anti-wrinkle" in AI's mind.
This requires an aggressive Digital PR strategy. Touch points on high-authority third-party sites must be multiplied. A mention in a New York Times feature article or validation by a dermatologist association has more GEO value than a hundred blog articles on the brand's site.

7.2 The Importance of "Zero-Party Data"

Since organic traffic to the website will decrease (AI giving the answer directly), brands must maximize the value of each visitor who actually arrives on the site. Capturing proprietary data (Zero-Party Data) via skin diagnostic quizzes, loyalty programs, or virtual try-on apps becomes vital.
These tools don't just serve to convert; they generate unique data that AI cannot find elsewhere. By exposing part of this data (e.g., anonymized skin problem trends) via reports or white papers, the brand can become an authority source cited by AIs, creating a virtuous referencing loop.

8. Conclusion and 2025 Outlook

The cosmetics industry has entered the era of Algorithmic Beauty. Visibility can no longer simply be bought; it must be earned through consistency, transparency, and external validation.
GEO sanctions mediocrity and empty marketing. It rewards products that keep their promises, because they are the only ones that survive the ruthless filter of large-scale sentiment analysis and community validation on Reddit.
For 2025, winning brands will be those that accept losing some control over their direct narrative to better influence indirect conversation. They will transform their product sheets into structured databases, their press relations into semantic co-citation campaigns, and their customer service into an intelligence source to feed algorithms.
In this world where AI is the new trust intermediary, product truth—validated by the crowd and experts—is the only asset that doesn't depreciate.

Actionable Recommendations for Brands

ActionDescriptionGEO Impact
Off-Site Sentiment AuditUse AI tools to scan Reddit and YouTube. Identify gaps between your marketing speech and user speech. Correct the product or the speech.Brand/Consumer Voice Alignment
Wiki & Data StrategyEnsure your brand has a factual and neutral presence on Wikipedia and Wikidata. Structure your product data (Schema.org) for machine readability.Machine-Readability
Expert PartnershipsInvest heavily in relationships with dermatologists and cosmetic chemist content creators. Their word is the "Gold Standard" for Google and OpenAI's YMYL algorithms.E-E-A-T & YMYL Compliance
SoM AdoptionIntegrate "Share of Model" as a key KPI in your marketing dashboards, alongside ROI and CPC.AI Performance Measurement

Sources and References

GEO is your next opportunity

Don't let AI decide your visibility. Take control with LUCID.