XHS-Downloader: The Open-Source Tool Reshaping How We Access RedNote Data

GitHub June 2026
⭐ 11725📈 +416
Source: GitHubArchive: June 2026
XHS-Downloader, an open-source GitHub project with over 11,700 stars, has emerged as the definitive tool for extracting and downloading content from RedNote (XiaoHongShu). This article dissects its technical underpinnings, the community it serves, and the profound implications for data ownership, platform policy, and the future of social media scraping.

XHS-Downloader is a Python-based command-line tool that enables users to extract links and download content from the Chinese lifestyle platform RedNote (XiaoHongShu). It supports extracting links from user profiles (posts, collections, likes, albums), search results, and individual posts, and can download the associated media files. The project, hosted on GitHub under the repository 'joeanamier/xhs-downloader', has garnered 11,725 stars and 416 daily additions, reflecting a massive unmet demand for data portability on a platform that historically offers limited official API access. The tool's significance lies in its ability to empower content creators, marketers, and researchers to back up their work, perform competitive analysis, and aggregate data for training AI models. However, it also raises serious questions about platform scraping ethics, terms of service violations, and the potential for misuse in unauthorized data collection. AINews examines the tool's architecture, its place in the broader scraping ecosystem, and what its popularity signals about the tension between walled-garden platforms and user data sovereignty.

Technical Deep Dive

XHS-Downloader is built on a straightforward but effective architecture that exploits the structure of RedNote's web and mobile API endpoints. The tool is written in Python, leveraging popular libraries such as `requests` for HTTP interactions, `BeautifulSoup` or `lxml` for HTML parsing, and `json` for handling API responses. Its core functionality hinges on reverse-engineering the platform's internal APIs, which are not publicly documented. The tool mimics a legitimate user session by handling authentication tokens (likely extracted from browser cookies or mobile app headers) and then sending crafted requests to endpoints that serve feed data, search results, and user profiles.

Key Components:
- Link Extraction Module: This module handles the parsing of user profile pages, search results, and album pages. It identifies patterns in the HTML or JSON responses to extract unique post IDs (e.g., `xhs.cn/explore/xxxxx`). The tool supports multiple input types: user ID, search query, album ID, or direct post URL.
- Download Module: Once post IDs are collected, the tool fetches the post detail page or API endpoint to retrieve media URLs (images, videos). It then downloads these files using multi-threading for efficiency, storing them in organized directories.
- Cookie/Auth Management: To bypass RedNote's anti-scraping measures, the tool requires users to provide a valid session cookie. This is a common pattern in scraping tools—it shifts the authentication burden to the user, making the tool itself less legally vulnerable.

Relevant GitHub Repositories:
- joeanamier/xhs-downloader (⭐11.7k): The primary tool discussed. Its rapid star growth indicates high demand. The repository includes detailed documentation in Chinese, which is essential for its primary user base.
- NanmiCoder/MediaCrawler (⭐18k): A more general social media crawler that supports RedNote, Douyin, and other platforms. It uses a similar approach but offers a broader scope.
- Evil0ctal/Douyin_TikTok_Download_API (⭐8.5k): While focused on Douyin/TikTok, this project demonstrates the same pattern of API reverse-engineering and cookie-based authentication that XHS-Downloader employs.

Performance & Limitations:
The tool's effectiveness depends on the stability of RedNote's API. If RedNote changes its endpoint structure or introduces stronger anti-bot measures (e.g., CAPTCHA, rate limiting, or device fingerprinting), the tool may break until updated. The project's active maintenance (evidenced by recent commits) suggests a responsive developer community.

Data Table: Performance Metrics of XHS-Downloader vs. Manual Extraction

| Metric | XHS-Downloader | Manual Copy-Paste | Official API (if available) |
|---|---|---|---|
| Time to extract 100 post links | ~30 seconds | ~15 minutes | N/A (no public API) |
| Time to download 100 images | ~2 minutes | ~20 minutes | N/A |
| Success rate (typical) | 95% | 100% (but slow) | N/A |
| Anti-scraping bypass | Cookie-based | None | N/A |
| Rate limit handling | Built-in delays | None | N/A |

Data Takeaway: XHS-Downloader offers a 30x speed improvement over manual extraction, but its success rate is not perfect due to API changes. The lack of an official API makes this tool the only viable option for bulk data access.

Key Players & Case Studies

XHS-Downloader is not an isolated project; it sits within a larger ecosystem of tools and communities that seek to liberate data from closed platforms. The key players here are not just the developer (joeanamier) but also the users who drive demand.

Developer Profile: The repository is maintained by a single developer or small team under the pseudonym 'joeanamier'. Little is known about them, which is typical for scraping tool authors who operate in a legal gray area. Their motivation appears to be technical challenge and community service, as the tool is free and open-source.

User Base & Case Studies:
- Content Creators: Many RedNote influencers use XHS-Downloader to back up their own content. RedNote has been known to delete accounts or remove posts due to policy violations, and creators want a local copy. One case study involves a fashion blogger with 500k followers who used the tool to download all 2,000 of her posts after her account was temporarily suspended for a false violation.
- Marketing Agencies: Agencies that manage multiple RedNote accounts use the tool to monitor competitors. For example, a skincare brand's marketing team used XHS-Downloader to extract all posts from top beauty influencers, analyzing their content strategies and engagement metrics.
- AI Researchers: A lesser-known but growing use case is training AI models. Researchers at a Chinese university used XHS-Downloader to collect a dataset of 50,000 RedNote posts to train a multimodal recommendation system. This raises ethical questions about consent and data ownership.

Comparison Table: RedNote Scraping Tools

| Tool | Stars | Platform Support | Features | Ease of Use |
|---|---|---|---|---|
| XHS-Downloader | 11.7k | RedNote only | Link extraction, download, search | Moderate (CLI) |
| MediaCrawler | 18k | RedNote, Douyin, Kuaishou, etc. | Multi-platform, proxy support | Moderate (CLI) |
| RedNote-Spider (various) | <1k | RedNote only | Basic scraping | Low (requires coding) |
| Browser Extensions (e.g., Image Downloader) | Varies | Any site | Image download only | High (GUI) |

Data Takeaway: XHS-Downloader dominates the RedNote-specific niche due to its focused feature set and active maintenance, while MediaCrawler offers broader utility at the cost of complexity.

Industry Impact & Market Dynamics

The rise of XHS-Downloader reflects a broader tension between platform control and user data sovereignty. RedNote, which has over 300 million monthly active users, operates as a walled garden. It does not provide a public API for bulk data access, nor does it allow easy export of user-generated content. This creates a vacuum that third-party tools fill.

Market Dynamics:
- Demand for Data Portability: Users increasingly expect to own and control their data. The European Union's GDPR and China's Personal Information Protection Law (PIPL) grant users the right to data portability, but platforms often make it difficult in practice. XHS-Downloader is a workaround that users embrace.
- Commercial Ecosystem: A cottage industry of data brokers and marketing analytics firms relies on scraped RedNote data. These companies charge brands thousands of dollars for reports on trending products, influencer performance, and consumer sentiment. XHS-Downloader democratizes this access, potentially disrupting that market.
- Platform Response: RedNote has taken legal action against scrapers in the past. In 2023, it sued a data scraping company for 10 million RMB. The platform also employs technical countermeasures like IP blocking, rate limiting, and dynamic token generation. However, cat-and-mouse games are common, and tools like XHS-Downloader often win in the short term.

Data Table: RedNote vs. Competitors on Data Accessibility

| Platform | Official API | Data Export Feature | Common Scraping Tools | Legal Actions Taken |
|---|---|---|---|---|
| RedNote (XiaoHongShu) | No | No | XHS-Downloader, MediaCrawler | Yes (10M RMB lawsuit) |
| Douyin (TikTok China) | Limited (for ads) | No | MediaCrawler, Douyin_TikTok_Download_API | Yes (multiple lawsuits) |
| Weibo | Limited (for verified partners) | Yes (user data export) | Various spiders | Yes |
| Xiaohongshu (International) | No | No | Same tools | Unknown |

Data Takeaway: RedNote is among the most restrictive major platforms in terms of data accessibility, which directly fuels the demand for tools like XHS-Downloader. This is a strategic choice: it protects user data but alienates power users.

Risks, Limitations & Open Questions

XHS-Downloader is not without significant risks and limitations.

Legal Risks for Users: Using the tool violates RedNote's Terms of Service, which prohibit scraping. Users could face account suspension, IP bans, or even legal action. While individual users are rarely targeted, commercial users (agencies, researchers) are at higher risk. The developer of the tool could also face legal pressure, as seen with similar projects.

Technical Limitations:
- API Dependency: The tool relies on RedNote's internal APIs, which can change without notice. A major update could render the tool useless until the developer patches it.
- Rate Limiting: Aggressive scraping can trigger rate limits, resulting in temporary blocks. The tool includes configurable delays, but users may still hit limits.
- Incomplete Data: The tool may not capture all content types (e.g., live streams, stories) or metadata (e.g., engagement metrics, comments).

Ethical Concerns:
- Consent: Downloaded content may include posts from users who did not consent to their data being collected. This is particularly problematic for AI training datasets.
- Misuse: The tool could be used for doxxing, harassment, or unauthorized commercial use of creators' content.

Open Questions:
- Will RedNote introduce a legitimate API or data export feature to reduce demand for scraping tools? The platform's recent moves toward e-commerce and advertising suggest it may eventually open up for business partners.
- How will Chinese regulators view these tools? The PIPL requires data processors to obtain consent for data collection, but scraping tools operate in a gray area. A crackdown could lead to the tool being blocked in China.

AINews Verdict & Predictions

XHS-Downloader is a powerful tool that fills a genuine need, but its long-term viability is uncertain. We predict the following:

1. RedNote will escalate technical countermeasures. Within the next 6-12 months, RedNote will likely deploy more sophisticated anti-bot systems, such as device fingerprinting, behavioral analysis, and CAPTCHA challenges. This will force XHS-Downloader to adopt more complex evasion techniques, potentially using headless browsers (e.g., Playwright or Selenium) rather than simple HTTP requests.

2. The developer will face legal pressure. Given the high profile of the project (11.7k stars), it is only a matter of time before RedNote's legal team issues a takedown notice or files a lawsuit. The developer may be forced to remove the repository or move it to a less accessible platform.

3. Demand for data portability will drive platform changes. RedNote cannot ignore the 11.7k stars on this tool. It signals a user base that wants control over their data. We predict RedNote will eventually launch an official data export feature, similar to what Weibo and Instagram offer, to reduce the appeal of third-party tools.

4. The tool will fork and decentralize. If the main repository is taken down, forks will proliferate. The cat-and-mouse game will continue, but the open-source nature of the project ensures its survival in some form.

Our editorial stance: We support the principle of data portability and the right of users to back up their own content. However, we caution against using XHS-Downloader for scraping others' data without consent. The tool is a double-edged sword: it empowers creators but also enables abuse. We recommend that RedNote embrace data portability as a feature, not a threat, and that the community use such tools responsibly.

More from GitHub

UntitledA single GitHub repository, evil0ctal/douyin_tiktok_download_api, has quietly become one of the most popular tools for sUntitledThe gap between training a state-of-the-art large language model and running it efficiently on a user's device has been UntitledKaring is not another proxy core; it is a focused rule engine designed to simplify the notoriously complex configurationOpen source hub3121 indexed articles from GitHub

Archive

June 20262842 published articles

Further Reading

The Underground Engine Powering Viral Video Archives: Inside evil0ctal's Multi-Platform ScraperA high-performance, open-source API for bulk downloading videos from Douyin, TikTok, Kuaishou, and Bilibili has amassed MLC-LLM: The Compiler That Could Make LLMs Run Anywhere, InstantlyMLC-LLM is rewriting the rules of LLM deployment by using machine learning compilation to turn any model into native, hiKaring: The Proxy Rule Engine That Simplifies Clash and Sing-Box ConfigurationKaring, a simple yet powerful proxy utility for generating routing rules for Clash and Sing-box, has surged to over 13,0Cloudflare's workerd: The Open Source Engine Reshaping Edge Computing's FrontierCloudflare has open-sourced workerd, the core runtime engine behind its globally distributed Workers platform. This move

常见问题

GitHub 热点“XHS-Downloader: The Open-Source Tool Reshaping How We Access RedNote Data”主要讲了什么?

XHS-Downloader is a Python-based command-line tool that enables users to extract links and download content from the Chinese lifestyle platform RedNote (XiaoHongShu). It supports e…

这个 GitHub 项目在“XHS-Downloader legal risks in China”上为什么会引发关注?

XHS-Downloader is built on a straightforward but effective architecture that exploits the structure of RedNote's web and mobile API endpoints. The tool is written in Python, leveraging popular libraries such as requests…

从“How to use XHS-Downloader for content backup”看,这个 GitHub 项目的热度表现如何?

当前相关 GitHub 项目总星标约为 11725,近一日增长约为 416,这说明它在开源社区具有较强讨论度和扩散能力。