Daripada Soalan Temuduga Bocor kepada Sensasi Pembelajaran AI: Kebangkitan Pendidikan Interaktif

Hacker News April 2026
Source: Hacker NewsArchive: April 2026
Seorang pembangun mengubah 50 soalan temuduga model besar yang bocor menjadi aplikasi pembelajaran interaktif yang tular, menyediakan maklum balas masa nyata, cabaran kod, dan ciri perbandingan model untuk pencari kerja AI. Fenomena ini menandakan peralihan daripada pengetahuan statik kepada pendidikan dinamik yang dipacu komuniti.
The article body is currently shown in English by default. You can generate the full version in this language on demand.

In a move that has captivated the AI community, a solo developer took 50 leaked interview questions from major AI companies and built an interactive learning platform that has exploded in popularity. The app goes far beyond simple question banks: it allows users to write and execute code in real-time, compare outputs from multiple large language models side-by-side, and receive instant feedback on their answers. Industry observers note this is not just a clever learning tool but a harbinger of a deeper transformation in AI education. The traditional top-down training model—dominated by expensive bootcamps and static textbooks—is being dismantled by pain-point-driven micro-tools. The developer capitalized on the acute anxiety of job seekers facing information asymmetry in a rapidly evolving field. By using leaked questions as a hook, the app creates a closed-loop ecosystem of practice, evaluation, and community discussion. Commercially, it validates a key insight: in AI, scarcity is not knowledge itself but the ability to turn knowledge into an interactive, verifiable experience. Furthermore, the app inadvertently creates a dynamic benchmark: user performance across different models provides a crowdsourced, real-world comparison of model capabilities. This may signal a future where AI talent selection shifts from 'what you know' to 'what you can do with tools.'

Technical Deep Dive

The core innovation of this application lies not in the questions themselves but in the architecture that transforms static text into an interactive, evaluative experience. The platform is built on a lightweight, serverless backend that orchestrates multiple AI model APIs simultaneously. When a user submits a code solution, the system routes the query to several models—including GPT-4o, Claude 3.5 Sonnet, Gemini 2.0, and open-source alternatives like Llama 3.1-70B and DeepSeek-V2—and displays their outputs side-by-side. This 'model comparison' feature is technically demanding: it requires careful prompt templating to ensure fairness, latency management to avoid timeouts, and a unified output format for comparison.

A key engineering decision was the use of a real-time code execution sandbox. The app integrates a WebAssembly-based Python interpreter (similar to Pyodide but customized) that runs user code directly in the browser. This eliminates the need for server-side execution, reducing cost and latency while ensuring security. The sandbox is pre-loaded with common AI/ML libraries like NumPy, PyTorch, and Transformers, allowing users to test complex neural network implementations without setup overhead.

The leaked questions themselves are stored as structured JSON files in a public GitHub repository, which has garnered over 12,000 stars in two weeks. The repository includes not just questions but also expected answer rubrics, common pitfalls, and links to relevant papers. This open-source approach has fostered a community of contributors who submit new questions, fix bugs, and add features like difficulty tagging and topic clustering.

Performance Data: The app's backend handles approximately 50,000 requests per day with a median response time of 1.2 seconds for model comparisons. The following table shows the latency and accuracy of different models on a sample of 10 coding questions from the leaked set:

| Model | Median Latency (s) | Pass Rate (10 questions) | Cost per Query (USD) |
|---|---|---|---|
| GPT-4o | 1.8 | 9/10 | $0.05 |
| Claude 3.5 Sonnet | 2.1 | 8/10 | $0.03 |
| Gemini 2.0 Flash | 0.9 | 7/10 | $0.01 |
| Llama 3.1-70B (Together AI) | 3.5 | 6/10 | $0.02 |
| DeepSeek-V2 | 2.8 | 7/10 | $0.01 |

Data Takeaway: Gemini 2.0 Flash offers the best latency-to-cost ratio but sacrifices accuracy. GPT-4o remains the most reliable for complex reasoning, while open-source models like Llama 3.1-70B are catching up but still lag in consistency. This data is invaluable for job seekers deciding which model to practice with.

Key Players & Case Studies

The developer, known pseudonymously as 'AI_Leaker_42' on GitHub, is a former machine learning engineer at a mid-tier AI startup. They have not revealed their identity, citing fear of retaliation from former employers. The leaked questions originated from internal interview prep documents from companies including OpenAI, Google DeepMind, and Anthropic. These documents were shared on a private Discord server before being scraped and compiled.

The app has attracted attention from several key players in the AI education space. DataCamp and Coursera have reportedly reached out for partnership discussions, though no deals have been announced. More interestingly, the app has been adopted by internal training teams at companies like Scale AI and Cohere, who use it to assess candidates during the interview process itself. This creates a feedback loop: the app's user performance data is being used to refine interview questions.

Competitive Landscape: The following table compares this app with existing AI interview preparation platforms:

| Feature | This App | LeetCode (AI track) | Interview Query | HackerRank (AI) |
|---|---|---|---|---|
| Real-time code execution | Yes (browser-based) | Yes (server) | No | Yes (server) |
| Multi-model comparison | Yes | No | No | No |
| Leaked questions | Yes (50) | No | No | No |
| Community contributions | Open-source | Closed | Closed | Closed |
| Cost | Free (with API key) | $35/month | $49/month | $25/month |
| User base (monthly active) | ~150,000 | ~500,000 | ~50,000 | ~200,000 |

Data Takeaway: The app's unique multi-model comparison feature is a clear differentiator. Its free, open-source model is disrupting the paid subscription model of incumbents. However, its smaller user base and reliance on leaked content pose sustainability risks.

Industry Impact & Market Dynamics

The rise of this app reflects a broader shift in the AI talent market. The global AI education market was valued at $4.2 billion in 2024 and is projected to grow to $12.8 billion by 2029, according to industry estimates. However, this growth is increasingly driven by 'micro-learning' tools rather than traditional degree programs. The app's success validates the thesis that job seekers prefer hands-on, interactive, and community-validated learning over static courses.

This phenomenon also highlights the commoditization of AI knowledge. Interview questions that were once closely guarded secrets are now public goods. This democratization lowers the barrier to entry for candidates from non-traditional backgrounds, but it also forces companies to evolve their hiring processes. If everyone can practice the same leaked questions, the signal-to-noise ratio of interviews decreases. Companies may need to shift toward project-based assessments or live coding with proprietary tools.

Market Data: The following table shows the growth of AI interview preparation tools over the past year:

| Quarter | New AI Interview Tools Launched | Total Funding Raised (USD) | Average User Growth (%) |
|---|---|---|---|
| Q1 2025 | 12 | $45M | 22% |
| Q2 2025 | 18 | $62M | 35% |
| Q3 2025 | 25 | $89M | 48% |
| Q4 2025 (projected) | 30+ | $120M+ | 60% |

Data Takeaway: The market is accelerating rapidly, with both the number of tools and funding doubling year-over-year. The app's viral growth (150,000 users in two weeks without marketing spend) suggests that organic, community-driven tools can capture significant market share without venture capital.

Risks, Limitations & Open Questions

Despite its success, the app faces several critical risks. First, the legal status of the leaked questions is murky. While the developer has not been sued, companies like OpenAI and Anthropic have strict policies against sharing internal materials. A cease-and-desist letter could force the app to remove the original questions, potentially destroying its core value proposition.

Second, the app's reliance on third-party API keys creates a fragility. If OpenAI or Google changes their pricing or terms of service, the app's cost structure could become unsustainable. Currently, the app is free for users who bring their own API keys, but the developer covers the cost for a limited number of free queries. As user numbers grow, this model may not scale.

Third, there is an ethical concern about 'teaching to the test.' By focusing on leaked questions, the app may encourage rote memorization rather than deep understanding. Candidates who ace these questions may still struggle with novel problems in real-world settings. This could lead to a misalignment between interview performance and job competence.

Finally, the app's crowdsourced benchmark data, while interesting, is not scientifically rigorous. User performance varies based on prompt engineering skills, familiarity with specific models, and even time of day. Using this data to make hiring decisions would be premature and potentially biased.

AINews Verdict & Predictions

This app is a watershed moment for AI education. It proves that a single developer, armed with a clever idea and open-source tools, can disrupt an entire market segment. The key insight is that AI knowledge is becoming a commodity; the value lies in the experience of applying it interactively.

Our Predictions:
1. Legal crackdown is inevitable. Within six months, at least one major AI company will issue a takedown notice. The developer will pivot to a 'community-curated' question set, removing the leaked content but retaining the platform's functionality.
2. Incumbents will copy the multi-model comparison feature. LeetCode and HackerRank will add similar capabilities within a year, but they will struggle to match the community-driven, open-source ethos of this app.
3. The app will evolve into a 'dynamic benchmark' platform. Instead of just interview prep, it will become a tool for companies to test their models against real-world problems. This could create a new revenue stream through enterprise licensing.
4. AI hiring will shift toward 'portfolio-based' assessment. As leaked questions become ubiquitous, companies will rely less on standardized interviews and more on candidates' GitHub repositories, published papers, or contributions to open-source AI projects.

The bottom line: this app is not just a flash in the pan. It is a blueprint for the future of AI education—decentralized, interactive, and community-driven. The question is not whether this model will survive, but how quickly the rest of the industry will adapt.

More from Hacker News

Ejen AI Belajar daripada Kegagalan: Refleksi Kendiri Mingguan Membawa Autonomi AdaptifFor years, AI agents operated as brittle executors: they followed predefined rules, and when something went wrong, a humClaude Opus-4-7 vs Codex GPT-5-5: Perang Pengekodan AI Membentuk Semula Kejuruteraan PerisianThe AI coding assistant landscape has entered a new era. Anthropic's Claude Code Opus-4-7 and OpenAI's Codex GPT-5-5 repEjen AI Adalah Pelawat Baharu Anda: Mengapa Halaman Pendaratan Mesti Bertutur Bahasa MesinThe web is experiencing a quiet but profound transformation: AI agents — powered by large language models — are increasiOpen source hub2536 indexed articles from Hacker News

Archive

April 20262634 published articles

Further Reading

Ejen AI Belajar daripada Kegagalan: Refleksi Kendiri Mingguan Membawa Autonomi AdaptifKelas baharu ejen AI telah meninggalkan pelaksanaan statik untuk beralih kepada gelung refleksi kendiri mingguan: merekaClaude Opus-4-7 vs Codex GPT-5-5: Perang Pengekodan AI Membentuk Semula Kejuruteraan PerisianDua gergasi pengekodan AI——Claude Code Opus-4-7 dan Codex GPT-5-5——terkunci dalam perang senyap. AINews mendedahkan bagaEjen AI Adalah Pelawat Baharu Anda: Mengapa Halaman Pendaratan Mesti Bertutur Bahasa MesinHalaman pendaratan kini mesti melayani kedua-dua pelawat manusia dan ejen AI. Satu kajian kes reka bentuk semula terkiniPenambatan Bahasa: Pembetulan Berstruktur Memecahkan Halangan Pelbagai Bahasa AIPendekatan baharu yang dipanggil penambatan bahasa secara sistematik mentakrifkan semula cara model bahasa besar mengend

常见问题

这次模型发布“From Leaked Interview Questions to AI Learning Sensation: The Rise of Interactive Education”的核心内容是什么?

In a move that has captivated the AI community, a solo developer took 50 leaked interview questions from major AI companies and built an interactive learning platform that has expl…

从“How to use leaked AI interview questions for practice”看,这个模型发布为什么重要?

The core innovation of this application lies not in the questions themselves but in the architecture that transforms static text into an interactive, evaluative experience. The platform is built on a lightweight, serverl…

围绕“Best free AI interview prep tools with code execution”,这次模型更新对开发者和企业有什么影响?

开发者通常会重点关注能力提升、API 兼容性、成本变化和新场景机会,企业则会更关心可替代性、接入门槛和商业化落地空间。