social-post-extractor-mcp

MCP Tool

JNHFlow21/social-post-extractor-mcp

MCP server for extracting scripts and metadata from Douyin and Xiaohongshu links with Bailian ASR, OCR, and script.md/info.json outputs

Install

$ npx loaditout add JNHFlow21/social-post-extractor-mcp

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "social-post-extractor-mcp": {
      "command": "npx",
      "args": [
        "-y",
        "social-post-extractor-mcp"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

Social Post Extractor MCP

中文说明

这是一个面向内容提取场景的 MCP 服务，目标很直接：

你给它一个抖音链接
或者给它一个小红书链接
它返回结构化信息，并落盘生成脚本文件

当前支持：

抖音视频
小红书视频笔记
小红书图文笔记

默认产物：

script.md
info.json

功能总览

这个项目主要解决四件事：

统一解析抖音和小红书分享链接
自动识别视频笔记还是图文笔记
用云端模型完成视频转写、图片读字、轻量整理
统一输出适合后续内容加工的文件

项目优势

不再只支持抖音，已经扩展到小红书
小红书不仅支持视频，也支持图文笔记
输出不是一段临时文本，而是固定的 script.md + info.json
默认走百炼的轻量模型组合，成本低、速度快、配置简单
兼容旧接口，适合接到 Agent Reach 之类的 AI 工作流里

适合谁

想把短视频或图文笔记转成文字稿的内容创作者
想把 Douyin / Xiaohongshu 接入自己 AI 工作流的开发者
想让 AI 自动读取链接、提取结构化内容、继续写脚本或做分析的人

默认推荐模型

默认建议直接使用百炼：

paraformer-v2：视频语音转文字
qwen3-vl-flash：小红书图文图片读字
qwen-flash：轻量整理，只做分段、标点、明显错字修正

这套默认组合的目标不是“最强”，而是：

简单
好用
快
成本低

输出文件说明

script.md 用来给人看，也方便继续交给 AI 做后续处理。通常包含：

整理后的脚本
原始转写
小红书图文的图片文字提取结果

info.json 用来给程序或 Agent 看。通常包含：

平台
内容类型
标题
作者
发布时间
媒体地址
采用的 provider 和 model
任务状态
错误信息

如果你只是想先看效果

先看这两个 MCP 工具：

parse_social_post_info
extract_social_post_script

前者只解析信息，后者直接生成 script.md 和 info.json。

For AI Agents

AI-first MCP for extracting structured content from:

Douyin videos
Xiaohongshu video notes
Xiaohongshu image notes

Default outputs:

script.md
info.json

This repository is intended to be handed to an AI agent. Write prompts assuming the AI will read this file and perform the setup.

What This MCP Does

Input:

one Douyin share link
one Xiaohongshu share link
share text that contains one of those links

Output:

parsed metadata
raw transcript for videos
OCR text for Xiaohongshu image notes
lightly cleaned readable script
artifacts written to disk

Supported MCP tools:

parse_social_post_info
extract_social_post_script
parse_douyin_video_info
get_douyin_download_link
extract_douyin_text

Default Decision

If you are an AI agent configuring this repo, use this default stack unless the user explicitly asks for something else:

ASR_PROVIDER=bailian
`ASR_MODEL=par

Reviews

Loading reviews...

Quality Signals

Installs

Last updated23 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/27/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/JNHFlow21/social-post-extractor-mcp)](https://loaditout.ai/skills/JNHFlow21/social-post-extractor-mcp)

social-post-extractor-mcp

MCP Tool

JNHFlow21/social-post-extractor-mcp

MCP server for extracting scripts and metadata from Douyin and Xiaohongshu links with Bailian ASR, OCR, and script.md/info.json outputs

Install

$ npx loaditout add JNHFlow21/social-post-extractor-mcp

Platform-specific configuration:

.claude/settings.json

{
  "mcpServers": {
    "social-post-extractor-mcp": {
      "command": "npx",
      "args": [
        "-y",
        "social-post-extractor-mcp"
      ]
    }
  }
}

Add the config above to .claude/settings.json under the mcpServers key.

About

Social Post Extractor MCP

中文说明

这是一个面向内容提取场景的 MCP 服务，目标很直接：

你给它一个抖音链接
或者给它一个小红书链接
它返回结构化信息，并落盘生成脚本文件

当前支持：

抖音视频
小红书视频笔记
小红书图文笔记

默认产物：

script.md
info.json

功能总览

这个项目主要解决四件事：

统一解析抖音和小红书分享链接
自动识别视频笔记还是图文笔记
用云端模型完成视频转写、图片读字、轻量整理
统一输出适合后续内容加工的文件

项目优势

不再只支持抖音，已经扩展到小红书
小红书不仅支持视频，也支持图文笔记
输出不是一段临时文本，而是固定的 script.md + info.json
默认走百炼的轻量模型组合，成本低、速度快、配置简单
兼容旧接口，适合接到 Agent Reach 之类的 AI 工作流里

适合谁

想把短视频或图文笔记转成文字稿的内容创作者
想把 Douyin / Xiaohongshu 接入自己 AI 工作流的开发者
想让 AI 自动读取链接、提取结构化内容、继续写脚本或做分析的人

默认推荐模型

默认建议直接使用百炼：

paraformer-v2：视频语音转文字
qwen3-vl-flash：小红书图文图片读字
qwen-flash：轻量整理，只做分段、标点、明显错字修正

这套默认组合的目标不是“最强”，而是：

简单
好用
快
成本低

输出文件说明

script.md 用来给人看，也方便继续交给 AI 做后续处理。通常包含：

整理后的脚本
原始转写
小红书图文的图片文字提取结果

info.json 用来给程序或 Agent 看。通常包含：

平台
内容类型
标题
作者
发布时间
媒体地址
采用的 provider 和 model
任务状态
错误信息

如果你只是想先看效果

先看这两个 MCP 工具：

parse_social_post_info
extract_social_post_script

前者只解析信息，后者直接生成 script.md 和 info.json。

For AI Agents

AI-first MCP for extracting structured content from:

Douyin videos
Xiaohongshu video notes
Xiaohongshu image notes

Default outputs:

script.md
info.json

This repository is intended to be handed to an AI agent. Write prompts assuming the AI will read this file and perform the setup.

What This MCP Does

Input:

one Douyin share link
one Xiaohongshu share link
share text that contains one of those links

Output:

parsed metadata
raw transcript for videos
OCR text for Xiaohongshu image notes
lightly cleaned readable script
artifacts written to disk

Supported MCP tools:

parse_social_post_info
extract_social_post_script
parse_douyin_video_info
get_douyin_download_link
extract_douyin_text

Default Decision

If you are an AI agent configuring this repo, use this default stack unless the user explicitly asks for something else:

ASR_PROVIDER=bailian
`ASR_MODEL=par

Reviews

Loading reviews...

Quality Signals

Installs

Last updated23 days ago

Security: AREADME

Safety

Risk Levelmedium

Data Access

read

Network Accessnone

Details

Sourcegithub-crawl

Last commit3/27/2026

View on GitHub→

Embed Badge

[![Loaditout](https://loaditout.ai/api/badge/JNHFlow21/social-post-extractor-mcp)](https://loaditout.ai/skills/JNHFlow21/social-post-extractor-mcp)

social-post-extractor-mcp

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge

social-post-extractor-mcp

Install

About

Tags

Reviews

Quality Signals

Safety

Details

Embed Badge