Extend Claude with Hugging Face Spaces

Extend Claude with Hugging Face Spaces

What can I do with this?

Landscape Image from Shou Xin model

Example from Shou Xin Image Generator

This MCP Server plugin gives Claude Desktop new abilities including:

  • Generating Images
  • Transcribe Audio (turn speech in to text)
  • Produce Sound Effects and Speech
  • Have Claude Chat with other AIs
  • Use advanced reasoning Vision techniques
  • Recommend pet dogs using specialised AI
  • and more…

It does this by calling on other AI Models hosted on Hugging Face Spaces - the hub for Open Source AI Model Research.

Take a look at this X Thread, or this Reddit Post for some examples of usage.

What is this?

Claude combining Search and Image Generation

Search and 3D Model Generation

It’s an MCP Server - software that lets Claude (or other AI’s) dynamically run tools to perform actions on your computer, or interact with other services.

Claude can combine results from different MCP servers. This example shows Claude searching the Web and generating a 3D Model from the search result.

How do I work this?

To begin, install Claude Desktop and NodeJS.

Then follow the install instructions from the GitHub page.

Recommened Spaces and Models

Below are some Hugging Face Spaces that have been tested to work well with Claude.

Simply add the Space Name to your mcp-hfspace configuration and use!

Space Name Type Space Link
shuttleai/shuttle-3.1-aesthetic Image Generation Link
black-forest-labs/FLUX.1-schnell Image Generation Link
yanze/PuLID-FLUX Image Manipulation Link
Inspyrenet-Rembg Background Removal Link
diyism/Datou1111-shou_xin Image Generation Link
stabilityai/stable-diffusion-3.5-large-turbo Image Generation Link
dbaranchuk/Switti Image Generation Link
fantaxy/Sound-AI-SFX Sound Effects Generation Link
parler-tts/parler_tts Speech Generation Link
styletts2/styletts2 Speech Generation Link
hf-audio/whisper-large-v3-turbo Audio Transcription Link
haoheliu/audioldm2-text2audio-text2music Music Generation Link
microsoft/OmniParser Computer Vision (GUI) Link
merve/paligemma2-vqav2 Computer Vision (General) Link
Qwen/QVQ-72B-preview Computer Vision (Reasoning) Link
DawnC/PawMatchAI Computer Vision (Dog Identification) Link
DawnC/PawMatchAI/on_find_match_click Text (Dog Recommendation) Link
Qwen/Qwen2.5-72B-Instruct Qwen 2.5 Chat LLM Link
prithivMLmods/Mistral-7B-Instruct-v0.3 Mistral Chat LLM Link
huggingchat/document-parser Document Parsing Link