Perplexity, which offers an AI search product that it calls an “answer engine,” is a buzzy AI startup embroiled in scandal following accusations that it rips off content, doesn’t respect robots.txt files, and even plagiarizes articles.
The company, which has already received funding from the likes of Jeff Bezos and is in talks to raise hundreds of millions of dollars more, advertises on its website that “every answer” is “backed by citations from trusted news outlets, academic papers, and established blogs.
However, plagiarism and paywall problems have made Perplexity a lightning rod for media industry frustrations as it attempts to overtake Google for the future of search on the internet.
Here’s our coverage of the ongoing developments.
Perplexity partners with Tripadvisor to source hotel info from real people
Image: The VergeThe AI search engine Perplexity is launching an integration with Tripadvisor that will add more information about hotels. Now, when you search for places to stay, Perplexity will present you with a neatly organized list of hotels, alongside summaries of why it chose them using information sourced from Tripadvisor.
In an example shared by Perplexity, a search for “hotels in Madrid for a business trip” yields a result for Hotel Regina, which the search engine says you should choose “if you want a centrally located hotel in Madrid with exceptional service and a rich breakfast offering.” It also displays its ratings and images from Tripadvisor as well as a list of perks, like “location,” “service,” and “cleanliness.”
Read Article >Perplexity’s AI search engine can now buy products for you
The VergePerplexity is rolling out a new feature that will let Pro subscribers purchase a product without leaving its AI search engine. When searching for a product using Perplexity, Pro members based in the US can now choose a “Buy with Pro” button that will automatically order the product using saved shipping and billing information.
Perplexity says all products purchased through Buy with Pro come with free shipping. For products that don’t support Buy with Pro, Perplexity will redirect users to the merchant’s website to complete their purchase.
Read Article >Perplexity blasts media as ‘adversarial’ in response to copyright lawsuit
The VergeAI startup Perplexity, which offers an AI search engine, published a blog post today pushing back on News Corp’s lawsuit against the company.
Perplexity has recently come under significant scrutiny following accusations that it scraped content without permission, and News Corp, which is the parent company of the New York Post and The Wall Street Journal-owner Dow Jones, alleged that Perplexity’s search engine “copies on a massive scale.”
Read Article >News Corp sues Perplexity for ripping off WSJ and New York Post
Image: The VergeNews Corp, the parent company of media outlets like The Wall Street Journal and the New York Post, is suing the AI search engine Perplexity for infringing copyrighted content. In a lawsuit filed on Monday, News Corp alleges Perplexity copies news articles, analyses, and opinions “on a massive scale.”
Perplexity is an AI startup that trains its AI search models using content from around the web, allowing it to respond to user queries with a summary of its sources. As outlined in the lawsuit, Perplexity bills itself as a platform that lets users “skip the links” to online articles, which News Corp alleges drives “customers and critical revenues away from those copyright holders.”
Read Article >The New York Times warns AI search engine Perplexity to stop using its content
Photo: Jakub Porzycki / NurPhoto via Getty ImagesThe New York Times has demanded that AI search engine startup Perplexity stop using content from its site in a cease and desist letter sent to the company, reports The Wall Street Journal. The Times, which is currently suing OpenAI and Microsoft over allegedly illegally training models on its content, says the startup has been using its content without permission, a claim made earlier this year by Forbes and Condé Nast.
The Journal included this passage from the letter:
Read Article >- Cloudflare is offering to block crawlers scraping information for AI bots.
Tech giants are rewriting the rules on web scraping, blaming unnamed third parties for disregarding robots.txt, and seemingly claiming the right to reuse anything posted anywhere for AI.
Now, Cloudflare is telling customers on its CDN that it can find and block AI bots that try to get around the rules.
The upshot of this globally aggregated data is that we can immediately detect new scraping tools and their behavior without needing to manually fingerprint the bot, ensuring that customers stay protected from the newest waves of bot activity.
Perplexity’s ‘Pro Search’ AI upgrade makes it better at math and research
Illustration: The VergePerplexity has launched a major upgrade to its Pro Search AI tool, which it says “understands when a question requires planning, works through goals step-by-step, and synthesizes in-depth answers with greater efficiency.”
Examples on Perplexity’s website of what Pro Search can do include a query asking the best time to see the northern lights in Iceland or Finland. It breaks down its research process into three searches: the best times to see the northern lights in Iceland and Finland; the top viewing locations in Iceland; and the top viewing locations in Finland. It then provides a detailed answer addressing all aspects of the question, including when to view the northern lights in either country and where.
Read Article >Perplexity’s grand theft AI
What, exactly, is Perplexity’s innovation? Image: The VergeIn every hype cycle, certain patterns of deceit emerge. In the last crypto boom, it was “ponzinomics” and “rug pulls.” In self-driving cars, it was “just five years away!” In AI, it’s seeing just how much unethical shit you can get away with.
Perplexity, which is in ongoing talks to raise hundreds of millions of dollars, is trying to create a Google Search competitor. Perplexity isn’t trying to create a “search engine,” though — it wants to create an “answer engine.” The idea is that instead of combing through a bunch of results to answer your own question with a primary source, you’ll simply get an answer Perplexity has found for you. “Factfulness and accuracy is what we care about,” Perplexity CEO Aravind Srinivas told The Verge.
Read Article >- AI is eating its own tail, Perplexity edition.
Uh oh!
In multiple scenarios, Perplexity relied on AI-generated blog posts, among other seemingly authentic sources, to provide health information. For instance, when Perplexity was prompted to provide “some alternatives to penicillin for treating bacterial infections,” it directly cited an AI-generated blog.
Reddit escalates its fight against AI bots
Illustration by William Joel / The VergeIn the coming weeks, Reddit will start blocking most automated bots from accessing its public data. You’ll need to make a licensing deal, like Google and OpenAI have done, to use Reddit content for model training and other commercial purposes.
While this has technically been Reddit’s policy already, the company is now enforcing it by updating its robots.txt file, a core part of the web that dictates how web crawlers are allowed to access a site. “It’s a signal to those who don’t have an agreement with us that they shouldn’t be accessing Reddit data,” the company’s chief legal officer, Ben Lee, tells me. “It’s also a signal to bad actors that the word ‘allow’ in robots.txt doesn’t mean, and has never meant, that they can use the data however they want.”
Read Article >- Perplexity CEO’s answers are weak.
Fast Company asked him why his AI search engine is ripping content from paywalled news outlets like Wired, and... hoo boy. He attempted to shift blame to “third-party web crawlers,” refused to identify which ones, said it was too “complicated” to just stop doing that, and suggested it’s not technically illegal to ignore robots.txt. Sure.
- Plagiarism machine plagiarizes article about its plagiarism.
Wired, June 19th: “Perplexity Is a Bullshit Machine.”
These links are paywalled, but that’s part of the point: it’s subscription journalism. Wired even blocks Perplexity in its robots.txt file, yet Perplexity is scraping stories anyhow. Might not be the only one, but that’s no excuse.
- Perplexity continues to piss off publishers.
Wired and Robb Knight, a developer at MacStories, found that the AI search engine seems to ignore requests not to scrape their websites. They both blocked Perplexity in their robots.txt file — a standard instruction document for web crawlers — and found that Perplexity still managed to access their content. They’re not the only ones annoyed.
Perplexity will research and write reports
Illustration by Cath Virginia / The Verge | Photos by Getty ImagesAI search platform Perplexity is launching a new feature called Pages that will generate a customizable webpage based on user prompts. The new feature feels like a one-stop shop for making a school report since Perplexity does the research and writing for you.
Pages taps Perplexity’s AI search models to find information and then creates what I can loosely call a research presentation that can be published and shared with others. In a blog post, Perplexity says it designed Pages to help educators, researchers, and “hobbyists” share their knowledge.
Read Article >Perplexity is ready to take on Google
Aravind Srinivas. Perplexity, Illustration by William Joel / The VergeIt’s hard to have a conversation about AI startups these days without Perplexity coming up.
Nvidia CEO Jensen Huang professes to using the AI search engine “almost every day,” Shopify CEO Tobi Lütke says it has replaced Google for him, and I’ve heard Mark Zuckerberg is also a user. I’ve been testing Perplexity in place of Google the past couple of months and have found it to be better for some searches, like ones with a very specific answer I’m looking for. But I’m not ready to completely switch.
Read Article >