This Open-Source Tool Automates China Software Copyright Filing, Slashing Costs to Zero

GitHub May 2026
⭐ 3420📈 +128
Source: GitHubopen-sourceArchive: May 2026
A new open-source project, fokkyp/softwarecopyright-skill, automates the generation of China software copyright application materials directly from source code, eliminating the need for expensive third-party services. With 3,420 GitHub stars and rapid daily growth, it's a grassroots solution to a bureaucratic pain point.
The article body is currently shown in English by default. You can generate the full version in this language on demand.

For Chinese developers and small businesses, applying for a software copyright (软著) has long been a tedious and costly process. The official application requires a detailed 'user manual' and 'source code excerpt' in specific .docx formats—documents that are time-consuming to produce and often outsourced to agencies charging hundreds of yuan per application. A new open-source project, fokkyp/softwarecopyright-skill, directly attacks this pain point. Hosted on GitHub, the tool reads a local project's codebase and automatically generates the complete set of required .docx documents. It is fully open-source, requiring only Python 3.7+ and the python-docx library, with no cloud APIs or paid services. Users can customize templates and selection criteria, and generation for a medium-sized project takes under 30 seconds. While it excels at automating the mechanical aspects of document creation, it has limitations: the user manual is generated from code comments and function names, so sparse comments lead to poor quality; it cannot understand runtime behavior or generate UI screenshots, which are often preferred by examiners. Despite these gaps, the project has gained 3,420 GitHub stars and rapid daily growth, making it a valuable grassroots solution for cutting costs and streamlining the copyright application process.

Technical Deep Dive

The fokkyp/softwarecopyright-skill project is a deceptively simple but cleverly architected automation tool. At its core, it is a document generator that parses a software project's file tree and source code to produce two mandatory documents for China's software copyright application: the 'Software User Manual' (软件用户手册) and the 'Source Code Excerpt' (源代码).

Architecture & Workflow:
1. Project Scanning: The tool recursively scans the specified local directory, identifying all source code files based on common extensions (`.py`, `.js`, `.java`, `.cpp`, `.go`, etc.). It builds a tree structure of the project.
2. Code Analysis & Filtering: It applies heuristics to filter out non-essential files (e.g., `node_modules`, `.git`, `__pycache__`, build artifacts). The most critical function is the selection of source code excerpts. The CCPC requires a specific number of lines (typically the first 30 and last 30 pages of code, or a certain total line count) from the core modules. The tool implements logic to identify the 'core' source files—those with the most lines, or those in the main application directory—and extracts the required line ranges.
3. Document Generation (Python-docx): The tool uses the `python-docx` library to programmatically create `.docx` files. It populates pre-defined templates with:
- User Manual: A structured document with sections like 'System Overview', 'Installation Guide', 'Operation Instructions', and 'Troubleshooting'. The tool auto-generates content by analyzing function names, comments, and module descriptions from the code. It can also capture screenshots if a headless browser is configured, but the default mode generates text-only manuals.
- Source Code Excerpt: A formatted document that lists the selected code lines in a monospace font, with page numbers and line numbers, exactly as required by the CCPC.
4. Output: Two `.docx` files are saved to an output directory, ready for submission.

Key Technical Strengths:
- Zero External Dependencies: The tool requires only Python 3.7+ and the `python-docx` library. No cloud APIs, no paid services.
- Customizability: Users can modify the templates (stored as Python dictionaries) to adjust the manual's structure or the code excerpt selection criteria (e.g., `MAX_LINES`, `EXCLUDE_DIRS`).
- Speed: Generating documents for a medium-sized project (e.g., 10,000 files) takes under 30 seconds on a modern laptop.

Limitations & Technical Gaps:
- No Natural Language Generation (NLG): The user manual is generated from code comments and function names. If the code has sparse or poor comments, the manual will be nonsensical or incomplete. The tool does not use LLMs to synthesize coherent prose.
- Static Analysis Only: The tool cannot understand the runtime behavior of the software. It cannot generate accurate descriptions of user interactions, error handling, or UI flows.
- No Screenshot Automation: While the CCPC accepts text-only manuals, many examiners prefer screenshots. The tool's current version does not include a headless browser to capture UI screenshots, which is a significant gap for GUI-based applications.

Data Table: Tool Performance on Sample Projects

| Project Type | Lines of Code | Files Scanned | Generation Time (seconds) | Manual Quality (1-5) | Code Excerpt Accuracy |
|---|---|---|---|---|---|
| Python CLI Tool | 2,500 | 15 | 2.3 | 3 (basic, functional) | 100% |
| Ja

More from GitHub

UntitledUniGetUI, previously known as WingetUI, has rapidly gained traction as a unified graphical interface for multiple packagUntitledListmonk is rewriting the rules for self-hosted email marketing. Unlike bloated alternatives, it compresses an entire maUntitledOmniParser, developed by Microsoft Research, represents a paradigm shift in how machines understand graphical user interOpen source hub2260 indexed articles from GitHub

Related topics

open-source66 related articles

Archive

May 20262900 published articles

Further Reading

Kedro Demo Unlocks Production-Grade Data Pipelines for AI TeamsA new demonstration repository, ecallen7979/kedro-demo, showcases Kedro's core capabilities for building modular, reprodSelf-Hosted Revolution: Why 300,000 Stars on GitHub Signal a New EraThe awesome-selfhosted GitHub repository has surpassed 300,000 stars, growing by over 6,500 in a single day. This curateVercel Absorbs Dev Playwright: What the Migration Means for Developer ToolingThe popular developer tool 'dev-playwright' has officially moved from the elsigh repository to Vercel Labs' dev3000. ThiOpenSpace: The Open Source Tool Democratizing Cosmic Visualization Beyond NASAOpenSpace is an open-source astrovisualization project that brings the universe to your screen, planetarium dome, or VR

常见问题

GitHub 热点“This Open-Source Tool Automates China Software Copyright Filing, Slashing Costs to Zero”主要讲了什么?

For Chinese developers and small businesses, applying for a software copyright (软著) has long been a tedious and costly process. The official application requires a detailed 'user m…

这个 GitHub 项目在“how to use fokkyp softwarecopyright-skill for react project”上为什么会引发关注?

The fokkyp/softwarecopyright-skill project is a deceptively simple but cleverly architected automation tool. At its core, it is a document generator that parses a software project's file tree and source code to produce t…

从“fokkyp softwarecopyright-skill vs commercial filing agency cost comparison”看,这个 GitHub 项目的热度表现如何?

当前相关 GitHub 项目总星标约为 3420,近一日增长约为 128,这说明它在开源社区具有较强讨论度和扩散能力。