๋ฉ”์ธ ์ฝ˜ํ…์ธ ๋กœ ๊ฑด๋„ˆ๋›ฐ๊ธฐAWS Startups
์ฝ˜ํ…์ธ  ์–ธ์–ด
ํ˜„์žฌ ๋ชจ๋“  ์ฝ˜ํ…์ธ ๊ฐ€ ๋ฒˆ์—ญ๋˜์ง€๋Š” ์•Š์Šต๋‹ˆ๋‹ค.

Reimagining search: Perplexity drives productivity with generative AI-powered answer engine

์ด ์ฝ˜ํ…์ธ ๋Š” ์–ด๋– ์…จ๋‚˜์š”?

Search engines have become an indispensable part of our lives. We rely on them to find answers to pressing questions, like โ€œwho invented pizza?โ€ and โ€œwhat are the origins of the Joker card?โ€ But they are also a powerful tool for conducting research, scanning the job market, accessing educational resources, and many other tasks that significantly impact our personal and professional lives.

Approximately 402.74 million terabytes of data are created every dayโ€”thatโ€™s roughly the equivalent of 137.5 billion hoursโ€™ worth of video. With so much information available to us, the ability to quickly find the right answers is essential for casual and professional users alike. But are traditional search engines still the fastest way to do that?

Some now believe that the growing influence of SEO has skewed how results are presented to usย , raised questions around trust, and created a sub-optimal experience. Together with Amazon Web Services, Perplexityย is redressing the balance. The companyโ€™s generative AI-powered solution, built on Amazon SageMaker, curates and synthesizes relevant information from trusted sources that is tailored to the userโ€™s query. Now with the release of its Enterprise Proย offering, Perplexity is helping professional teams win back time and unlock significant productivity gains.

Delivering answers not results

Over the last 20 years, weโ€™ve grown accustomed to a search experience defined by what Denis Yarats, Perplexity co-founder and CTO, refers to as โ€œan interface of ten blue links.โ€ A user enters a search query and is met with a list of links or documents from various sources. Itโ€™s then up to the user to click through to those results in the hopes of finding the information theyโ€™re looking for.

Perplexity changes that. โ€œItโ€™s the fastest way to get information on the internet,โ€ says Yarats. โ€œFundamentally, itโ€™s a tool that satisfies peopleโ€™s curiosity.โ€ Unlike traditional search engines, Perplexity provides users with a single synthesized answer to their queries, based on real-time information, and complete with citations from trusted outlets and publications. It also curates a list of relevant follow up questions if a user wants to dive deeper into a topic.ย  โ€œThis saves users a lot of time, and the answers we provide are very accurate, factually grounded, and trustworthy.โ€

15 million users in just 2 years

Perplexity was founded in 2022 by four experienced engineers: Aravind Srinivas, Denis Yarats, Johnny Ho, and Andy Konwinski. Prior to Perplexity, the quartet had been working at the edge of technological innovation and research: Srinivas worked as an AI researcher at OpenAI; Yarats was an AI research scientist at Meta; Ho had previously worked as an engineer at Quora; and Konwinski was among the founding team at Databricks.

Having initially experimented with algorithms capable of translating natural language into SQL, the team soon re-focused their efforts towards combining traditional search index with large language models. Fast-forward to today and Perplexity answers more than 250 million queries a month and is valued at over $1 billion USD.

Growth built on proven technology and trusted expertise

Perplexity has been working with AWS from day one. โ€œWhen we started out, we had very limited resources,โ€ says Yarats. To overcome these limitations, the company enrolled in the AWS Activate program, which provides startups with technical support, architecture guidance, and up to $100,000 USD in AWS credits to help them build, launch, and scale at speed. โ€œWe received a lot of compute credits that helped us to develop Perplexity at low infrastructure costs; it was an instrumental piece of our success story.โ€

He continues: โ€œPerplexity is fully based on AWS. We are using pretty much every technology that is available there, starting from DNS and web servers to GPU clusters, and storage services like Amazon Simple Storage Service (Amazon S3).โ€ As Perplexityโ€™s user base grows, AWS is helping to enable the company to scale its computational resources to meet demand on infrastructure backed by vast security services and features. When it comes to generative AI, AWS gives all customers control over their data across the entire AI lifecycle, including preparation, training, and inferencing.

โ€œGenerative AI is at our core. We use this technology not only to understand the user's questions better but also synthesize answers for them,โ€ says Yarats. โ€œAWS is a great partner for AI startups because they understand at the core how difficult it is to tackle generative AI.โ€ He continues: โ€œWe are developing at the forefront of this technology and stumble upon very difficult and novel questions that require specialized expertise. AWS provides us with that expertise.โ€

Proven methodologies and frameworks for success

The AWS Startupsย  team comprises over a thousand global expertsโ€”including ex-founders, CTOs, and investorsโ€”who understand the unique challenges faced by disruptors. Yarats explains: โ€œwe needed a trusted partner that could deliver secure, scalable and elastic infrastructureโ€”and thatโ€™s what AWS excels at. I vividly remember when we first started working with AWS, we had a Slack channel with around fifty people from their team ready to help us whenever we needed it.โ€

Marcos Boaglio, Senior Machine Learning Solutions Architect, AWS, adds: โ€œOne of our most important leadership principles is customer obsession. We start by connecting with the customer, understanding their needs and the problem they want to solve, and then we work backwards towards the solution.โ€ Yarats adds: โ€œAs a small startup you really need that kind of help as you have limited resources to tackle those problems. It's obvious that AWS cares deeply about innovation at its core, itโ€™s part of their mission.โ€

A key part of the support provided by AWS Startups is helping to ensure that disruptors like Perplexity canย innovate on secure, high-performing, resilient, and efficient infrastructure. Thatโ€™s where the AWS Well-Architected Toolโ€™s framework comes into play. โ€œAWS Well-Architected is a collection of best practices based on multiple key pillars including: Security, Cost optimization, and Operational excellence,โ€ says Boaglio. โ€œWe conduct periodic Well-Architectedย reviews with Perplexity to ensure that these best practices are being applied, uncover opportunities to reduce costs, and make technical recommendations.โ€

It's this level of support that has helped Perplexity maintain momentum and develop a professional-grade version of its toolโ€”Enterprise Pro.

Time-saving intelligence

Enterprise Pro expands on Perplexityโ€™s core capabilities and has already been adopted by the likes of Databricks, HP, Zoom, and the Cleveland Cavaliers. Itโ€™s designed to support professional users working on demanding tasks like academic research and data analysis, simulations, and code interpretation. โ€œThe average query on Perplexity is around ten words, which is much, much longer than traditional search engines,โ€ says Yarats.

โ€œOne of the key features generative AI enables is conversational search, allowing our users to follow up on their original queries, receive suggestions, and dive deeper,โ€ Yarats explains. This helps professional teams quickly find the answers theyโ€™re looking for and experience an uptick in productivity. Enterprise Pro can even provide users with helpful pointers on how to refine their queries.

For example, the Cleveland Cavaliers are using Enterprise Pro to close knowledge gaps, quickly on-board new staff and stakeholders, and boost daily productivity across multiple teams. On average, this is enabling employees to claim back over ten hours every week. That time can now be spent on more pressing business priorities instead of mundane, repetitive search tasks.

Reimagining whatโ€™s possible with generative AI

Perplexity Enterprise Pro leverages leading generative AI models. That includes its own large language models, developed using Amazon SageMaker. โ€œOne of the reasons we decided to go with AWS is because they provide the most advanced technologies to enable generative AI,โ€ says Yarats. โ€œWeโ€™ve been working closely with the Amazon Bedrockย  team to understand best practices to enable security, scale, and privacy for our users. We are also one of the first customers to use Amazon SageMaker HyperPod, which we are using to train and serve our models.โ€

SageMaker HyperPodย takes the heavy lifting out of optimizing machine learning (ML) infrastructure used for training foundation models (FMs). It enables customers to automatically split training workloads across thousands of accelerators, so workloads can be processed in parallel for improved model performance. Boaglio explains: โ€œItโ€™s a platform that helps customers train and fine-tune large language models. It essentially optimizes every single thing that you use for training your models.โ€

Yarats explains: โ€œSageMaker HyperPod allows us to easily train very large and long-running jobs without having to worry about things like hardware issues.โ€ If a hardware failure occurs during training, itโ€™s automatically detected, and the faulty instance is either repaired or replaced as neededโ€”no manual intervention required. This enables startups like Perplexity to reduce the time needed to fine-tune foundation models by up to 40%.

โ€œ90% of AWS products are created as a direct result of customer feedback,โ€ says Boaglio. โ€œSageMaker HyperPod is one of the clearest examples of this. It was created to solve a specific problem, a problem Perplexity was experiencing.โ€ Placing emerging technologies in the hands of innovative startups is a key part of how AWS develops its products. โ€œSageMaker Hyperpod is an evolving product, and the Perplexity team has been very active in the whole process, helping us test and providing feedback.โ€

Frictionless customer experience that powers productivity

โ€œPerplexity is an answer engine, not a search engine,โ€ says Yarats. Together with AWS, Perplexity is reimagining how businesses find answers, research, and collaborate. Gone are the days of ten blue links. With Enterprise Pro, professionals can access the information needed to do their jobs effectively at speeds that would have previously been impossible.

Going forward, Perplexity will continue to innovate at the forefront of generative AI. โ€œWhat excites me the most about Perplexity is that the team is always experimenting,โ€ says Boaglio. โ€œWhether itโ€™s new models, or tweaking different features to make their product even faster and more accurate. Itโ€™s impressive.โ€

โ€œIโ€™m very excited about developing a much more advanced version of Perplexity, where you can not only ask questions, but also give tasks and actions for the AI to perform,โ€ says Yarats. โ€œI believe AWS is going to be the perfect partner to reach that future, because it will require a lot of expertise, computing power, and cutting-edge infrastructureโ€”Iโ€™m confident AWS can deliver that.โ€

Denis Yarats

Denis Yarats

Denis Yarats ์”จ๋Š” 2022๋…„ ์ดํ›„ Perplexity AI์˜ ๊ณต๋™ ์„ค๋ฆฝ์ž ๊ฒธ CTO๋กœ ํ™œ๋™ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Š” ๋‰ด์š• ๋Œ€ํ•™๊ต์—์„œ ์ธ๊ณต ์ง€๋Šฅ ์ „๊ณต์œผ๋กœ ๋ฐ•์‚ฌ ํ•™์œ„๋ฅผ ๋ฐ›์•˜์Šต๋‹ˆ๋‹ค. Denis ์”จ๋Š” Perplexity๋ฅผ ๊ณต๋™ ์„ค๋ฆฝํ•˜๊ธฐ ์ „, 2016๋…„ 6์›”๋ถ€ํ„ฐ 2022๋…„ 7์›”๊นŒ์ง€ Facebook AI ์—ฐ๊ตฌ ํŒ€์—์„œ AI ์—ฐ๊ตฌ ๊ณผํ•™์ž๋กœ ๊ทผ๋ฌดํ–ˆ์Šต๋‹ˆ๋‹ค. 2013๋…„ 9์›”๋ถ€ํ„ฐ 2016๋…„ 6์›”๊นŒ์ง€๋Š” Quora์—์„œ ๊ธฐ๊ณ„ ํ•™์Šต ์—”์ง€๋‹ˆ์–ด๋กœ ๊ทผ๋ฌดํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Š” ๊ธฐ๊ณ„ ํ•™์Šต ํŒ€์˜ ๊ธฐ์ˆ  ์ฑ…์ž„์ž๋กœ ์ผํ•˜๋ฉด์„œ ํ”Œ๋žซํผ์˜ AI ๊ธฐ๋Šฅ์— ํฌ๊ฒŒ ๊ธฐ์—ฌํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ทธ๋Š” ๊ด‘๋ฒ”์œ„ํ•œ ๊ฒฝํ—˜๊ณผ ์ „๋ฌธ ์ง€์‹์„ ๋ฐ”ํƒ•์œผ๋กœ AI ์‚ฐ์—… ๋‚ด ์ง€์†์ ์ธ ๋ฐœ์ „์˜ ํ•ต์‹ฌ ์ฃผ์—ญ์œผ๋กœ ์ž๋ฆฌ๋งค๊น€ํ–ˆ์Šต๋‹ˆ๋‹ค.

Marcos Boaglio

Marcos Boaglio

Marcos ์”จ๋Š” ๋ฏธ๊ตญ ํ”Œ๋กœ๋ฆฌ๋‹ค์—์„œ ํ™œ๋™ํ•˜๋Š” AWS ์„ ์ž„ ๊ธฐ๊ณ„ ํ•™์Šต ์†”๋ฃจ์…˜ ์•„ํ‚คํ…ํŠธ์ž…๋‹ˆ๋‹ค. ๊ทธ๋Š” ํ•ด๋‹น ์—ญํ• ์—์„œ ๋ฏธ๊ตญ ์ƒ์„ฑํ˜• AI ์Šคํƒ€ํŠธ์—… ์กฐ์ง์˜ ํด๋ผ์šฐ๋“œ ์ „๋žต์„ ์•ˆ๋‚ดํ•˜๊ณ  ์ง€์›ํ•˜์—ฌ ๊ณ ์œ„ํ—˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ณ  ๊ธฐ๊ณ„ ํ•™์Šต ์›Œํฌ๋กœ๋“œ๋ฅผ ์ตœ์ ํ™”ํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•œ ์ง€์นจ์„ ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋Š” ํด๋ผ์šฐ๋“œ ์†”๋ฃจ์…˜ ๊ฐœ๋ฐœ, ๊ธฐ๊ณ„ ํ•™์Šต, ์†Œํ”„ํŠธ์›จ์–ด ๊ฐœ๋ฐœ, ๋ฐ์ดํ„ฐ ์„ผํ„ฐ ์ธํ”„๋ผ๋ฅผ ๋น„๋กฏํ•œ ๊ธฐ์ˆ  ๋ถ„์•ผ์—์„œ 25๋…„ ์ด์ƒ์˜ ๊ฒฝ๋ ฅ์„ ์Œ“์•˜์Šต๋‹ˆ๋‹ค.

์ด ์ฝ˜ํ…์ธ ๋Š” ์–ด๋– ์…จ๋‚˜์š”?