The rise of large language models (LLMs) has fundamentally shifted how we access information online, giving birth to a new generation of search tools: generative engines (GEs). These innovative systems move beyond simply listing websites, instead synthesizing information from multiple sources to generate comprehensive and personalized answers to user queries. While generative engines like BingChat, Google’s SGE, and perplexity.ai offer clear benefits for users and developers, this paradigm shift presents a significant challenge for website and content creators who risk losing visibility and organic traffic.

The core issue is that generative engines operate as black boxes, making it difficult for creators to understand and influence how their content is utilized and displayed. Recognizing this growing challenge, researchers at Cornell University have introduced GENERATIVE ENGINE OPTIMIZATION (GEO), a creator-centric framework designed to enhance the visibility of web content within GE responses. GEO provides a black-box optimization approach, enabling content creators to tailor their websites to increase the likelihood of being cited and highlighted by these emerging search technologies. As Figure 1 illustrates, the application of GEO can significantly improve a website’s presence in a generative engine’s response.
The Evolution from Traditional Search to Generative Engines
Three decades ago, traditional search engines revolutionized information access. However, their utility was primarily limited to providing a ranked list of potentially relevant websites. The recent advancements in large language models have paved the way for generative engines, which combine the power of conventional search with the nuanced understanding and generation capabilities of AI. As depicted in Figure 2, generative engines typically function by first retrieving relevant documents based on a user’s query and then employing large neural models to synthesize a response grounded in these sources, often with inline citations for verification.
The Visibility Challenge for Content Creators
The advantages of generative engines for users (faster, more accurate information retrieval) and developers (increased user satisfaction and potential revenue) are clear. However, generative engines disrupt the traditional online ecosystem by often eliminating the need for users to click through to individual websites. By providing direct, synthesized answers, GEs can lead to a substantial decrease in organic traffic, severely impacting the visibility of content creators who rely on online presence for their livelihood. The opaque nature of these systems further compounds the problem, leaving creators with little control over how their work is presented.
Introducing Generative Engine Optimization (GEO)
To address this critical challenge, this work introduces GENERATIVE ENGINE OPTIMIZATION (GEO), a black-box framework empowering content creators to adapt to this new search landscape with greater confidence. GEO analyzes a source website and generates an optimized version by adjusting its presentation, text style, and content to improve its chances of being favorably included in generative engine responses.
Defining and Measuring Visibility in Generative Engines

Unlike traditional search engines where visibility is often measured by average ranking on a results page, generative engines present a more complex landscape. Their responses are rich and structured, embedding website citations in various lengths, positions, and styles. This necessitates new, tailored visibility metrics that go beyond simple rankings. Cornell’s GEO framework proposes a comprehensive set of such metrics, considering factors like the relevance and influence of a citation to the query, evaluated through both objective and subjective measures. The framework also allows content creators to define their own customized visibility metrics relevant to their specific goals.
Figure 2: Overview of Generative Engines. Generative Engines primarily consist of a set of generative models and a search engine to retrieve relevant documents. Generative Engines take user queries as input and through a series of steps generate a final response that is grounded in the retrieved sources with inline attributions through the response.
GEO in Action: Demonstrating Significant Visibility Gains
Through rigorous evaluation, Cornell University demonstrated the effectiveness of their GEO methods, achieving up to a 40% boost in visibility across a diverse range of queries. Their findings highlight specific strategies that significantly improve source visibility, such as incorporating direct citations, relevant quotations, and supporting statistics. These results underscore the potential of GEO in providing content creators with actionable techniques to enhance their presence in the rapidly evolving world of generative search.
Conclusion
The emergence of generative engines marks a significant turning point in how information is discovered and consumed online. While offering substantial benefits, these systems pose a unique challenge to the creator economy by potentially diminishing website traffic and visibility. The GENERATIVE ENGINE OPTIMIZATION (GEO) framework represents a crucial step towards addressing this challenge, providing content creators with the tools and strategies needed to navigate this new paradigm. By focusing on optimizing content for generative engine citations, GEO empowers creators to maintain and even enhance their visibility in the age of AI-powered search. The continued development and adoption of frameworks like GEO will be essential to ensuring a thriving and diverse online ecosystem where both users and content creators can benefit.
Sources:
https://arxiv.org/abs/2311.09735 Cornell University