{
“title”: “SerpApi Asks Court to Dismiss Reddit’s Scraping Lawsuit, Citing Copyright Misuse and Public Data Access”,
“content”: “
SerpApi, a company that provides API access to Google Search results, is urging a federal court to throw out a lawsuit filed by Reddit. The core of SerpApi’s argument is that Reddit is attempting to misuse copyright law and the Digital Millennium Copyright Act (DMCA) to restrict access to publicly available information and control content posted by its users. This legal maneuver comes after Reddit filed an amended complaint in February, which SerpApi argues still fails to meet the necessary legal thresholds for a valid claim.
\n\n
The Heart of the Dispute: Data Ownership and Access
\n\n
At the crux of this legal battle lies a fundamental disagreement over who owns and controls content that appears in search engine results. Reddit’s lawsuit, initially filed in October 2023, alleges that SerpApi and other data scraping services unlawfully accessed and reused Reddit content at scale by scraping it through Google Search. However, SerpApi, through its CEO Julien Khaleghy, has countered these claims, asserting that Reddit’s legal arguments are flawed and that the company is essentially trying to prevent legitimate access to public data.
\n\n
SerpApi’s motion to dismiss hinges on several key points:
\n\n
- \n
- Lack of Copyright Ownership: SerpApi contends that Reddit does not hold exclusive copyright ownership over the vast majority of the content at issue. According to SerpApi, Reddit’s own user agreement states that users retain ownership of their posts. Reddit, in this view, only possesses a non-exclusive license to use that content.
- Non-Copyrightable Content: The company argues that many of the specific examples cited in Reddit’s complaint – such as dates, addresses, or short text fragments – are not eligible for copyright protection. These are often considered factual information or de minimis snippets that fall outside the scope of copyright law.
- Accessing Public Search Results, Not Reddit Directly: A critical distinction made by SerpApi is that it accesses and processes information from Google Search results pages, not directly from Reddit’s servers or internal systems. This means, in their view, they are not circumventing any direct technical barriers Reddit might have in place on its own platform.
\n
\n
\n
\n\n
Khaleghy, in a public statement, emphasized that SerpApi retrieves the same information that any user could see by performing a search on Google. This position challenges the notion that such access constitutes an illegal act.
\n\n
Navigating the DMCA and ‘Circumvention’ Claims
\n\n
A significant part of Reddit’s legal strategy involves invoking the Digital Millennium Copyright Act (DMCA). Reddit claims that SerpApi violated the DMCA by circumventing technological measures that control access to copyrighted works. However, SerpApi vehemently disputes this characterization.
\n\n
SerpApi’s defense against the DMCA claims rests on the assertion that they do not engage in any form of illegal circumvention. Their argument includes:
\n\n
- \n
- No Breaking Encryption or Bypassing Authentication: SerpApi states that its methods do not involve breaking encryption codes or bypassing any login or authentication systems. They are accessing publicly visible web pages.
- Public Webpages Are Not ‘Circumvention’: The company argues that accessing publicly available webpages, even if those pages are generated dynamically by a search engine, does not constitute ‘circumvention’ under the DMCA. The DMCA is typically intended to prevent the bypassing of security measures designed to protect copyrighted material.
- Reddit’s Own Policies Contradict Claims: SerpApi points to Reddit’s own privacy policy and terms of service, which they argue acknowledge that public posts may appear in search engine results. This, they suggest, undermines Reddit’s claim that such visibility is an unauthorized access that violates the DMCA.
\n
\n
\n
\n\n
Essentially, SerpApi is arguing that Reddit is attempting to enforce copyright protections on content and access methods that it does not exclusively control and that are made available through public search engines, a practice that has long been a staple for various data analysis and SEO tools.
\n\n
Broader Implications for Data Scraping and AI
\n\n
This legal dispute between Reddit and SerpApi is not an isolated incident. It is part of a growing wave of litigation and debate surrounding the scraping of web content, particularly in the context of training artificial intelligence (AI) models. Companies are increasingly seeking to leverage vast amounts of data from the internet, and the legality and ethics of scraping this data are being fiercely contested.
\n\n
The legal fights have intensified in recent months, with several key developments:
\n\n
- \n
- October 22, 2023: Reddit files its initial lawsuit against SerpApi, Perplexity, Oxylabs, and AWMProxy, alleging systematic scraping of

Leave a Comment