Fetch any URL as clean Markdown via proxy services (r.jina.ai / defuddle.md) or built-in scripts. Works with login-required pages like X/Twitter, WeCh
npx skills add https://github.com/joeseesun/qiaomu-markdown-proxy --skill qiaomu-markdown-proxyقم بتثبيت هذه المهارة باستخدام واجهة سطر الأوامر (CLI) وابدأ في استخدام سير عمل SKILL.md في مساحة عملك.
Convert any URL to clean Markdown, with built-in support for login-required pages (X/Twitter, WeChat, Feishu/Lark docs, etc.)
将任意 URL 转为干净的 Markdown,支持需要登录的页面(X/Twitter、微信公众号、飞书文档等)。
Send any URL to Claude, and it automatically fetches the full content as Markdown. Five content types have dedicated extraction:
| URL Type | Method | Why |
|---|---|---|
WeChat Articles (mp.weixin.qq.com) |
Built-in Playwright script | Anti-scraping protection requires headless browser |
Feishu/Lark Docs (feishu.cn, larksuite.com) |
Built-in Feishu API script | Requires API authentication, auto-converts to Markdown |
| YouTube | Dedicated YouTube skill | Video content has its own toolchain |
| PDF (remote or local) | Built-in PDF extraction (extract_pdf.sh) |
Three-method cascade: marker-pdf → pdftotext → pypdf |
| All other URLs | Proxy cascade via fetch.sh: r.jina.ai → defuddle.md → agent-fetch |
Free, no API key, content validation built-in |
pip install playwright beautifulsoup4 lxml
playwright install chromium
pip install marker-pdfbrew install popplerpip install pypdfnpx agent-fetch --help # No pre-install needed, npx auto-downloads
FEISHU_APP_ID and FEISHU_APP_SECRETecho $FEISHU_APP_ID # Verify configured
npx skills add joeseesun/qiaomu-markdown-proxy
Verify:
ls ~/.claude/skills/qiaomu-markdown-proxy/SKILL.md
Just send Claude a URL:
Built-in fetch_feishu.py script fetches documents via Feishu Open API and auto-converts to Markdown:
FEISHU_APP_ID and FEISHU_APP_SECRET environment variablesdocx:document:readonly permission| Issue | Solution |
|---|---|
| WeChat scraping fails | Run playwright install chromium to install browser |
| Feishu returns permission error | Check FEISHU_APP_ID and FEISHU_APP_SECRET env vars, confirm app has document read permission |
| Feishu wiki page fails | Confirm app has wiki:wiki:readonly permission |
| r.jina.ai returns empty | Auto-falls back to defuddle.md (no action needed) |
| All proxies fail | URL may have strict auth restrictions, try npx agent-fetch |
给 Claude 发一个 URL,自动抓取完整内容并转为 Markdown。支持五种内容类型的专用抓取:
| URL 类型 | 抓取方式 | 原因 |
|---|---|---|
微信公众号 (mp.weixin.qq.com) |
内置 Playwright 脚本 | 公众号有反爬,需无头浏览器 |
飞书文档 (feishu.cn/docx/, /wiki/, /docs/) |
内置飞书 API 脚本 | 需要 API 认证,自动转 Markdown |
| YouTube | 专用 YouTube skill | 视频内容有专用工具链 |
| PDF(远程 URL 或本地文件) | 内置 PDF 提取(extract_pdf.sh) |
三级 fallback:marker-pdf → pdftotext → pypdf |
| 其他所有 URL | 代理级联 fetch.sh:r.jina.ai → defuddle.md → agent-fetch |
免费、无需 API key、内置内容验证 |
pip install playwright beautifulsoup4 lxml
playwright install chromium
pip install marker-pdfbrew install popplerpip install pypdfnpx agent-fetch --help # 无需预装,npx 自动下载
FEISHU_APP_ID 和 FEISHU_APP_SECRETecho $FEISHU_APP_ID # 验证已配置
npx skills add joeseesun/qiaomu-markdown-proxy
验证:
ls ~/.claude/skills/qiaomu-markdown-proxy/SKILL.md
直接给 Claude 发 URL:
内置 fetch_feishu.py 脚本,通过飞书开放 API 抓取文档内容并自动转为 Markdown:
FEISHU_APP_ID 和 FEISHU_APP_SECRET 环境变量docx:document:readonly 权限| 问题 | 解决方法 |
|---|---|
| 公众号抓取失败 | 运行 playwright install chromium 安装浏览器 |
| 飞书文档返回权限错误 | 检查 FEISHU_APP_ID 和 FEISHU_APP_SECRET 环境变量,确认应用有文档读取权限 |
| 飞书知识库页面抓取失败 | 确认应用有 wiki:wiki:readonly 权限 |
| PDF 提取失败 | 安装任一工具:pip install marker-pdf、brew install poppler、pip install pypdf |
| r.jina.ai 返回空内容 | 自动降级到 defuddle.md(无需手动操作) |
| 所有代理都失败 | URL 可能有严格认证限制,尝试 npx agent-fetch |