Extend Claude with Hugging Face Spaces

Resources

What can I do with this?

Landscape Image from Shou Xin model — Example from Shou Xin Image Generator

This MCP Server plugin gives Claude Desktop new abilities including:

Generating Images
Transcribe Audio (turn speech in to text)
Produce Sound Effects and Speech
Have Claude Chat with other AIs
Use advanced reasoning Vision techniques
Recommend pet dogs using specialised AI
and more…

It does this by calling on other AI Models hosted on Hugging Face Spaces - the hub for Open Source AI Model Research.

Take a look at this X Thread, or this Reddit Post for some examples of usage.

What is this?

Claude combining Search and Image Generation — Search and 3D Model Generation

It’s an MCP Server - software that lets Claude (or other AI’s) dynamically run tools to perform actions on your computer, or interact with other services.

Claude can combine results from different MCP servers. This example shows Claude searching the Web and generating a 3D Model from the search result.

How do I work this?

To begin, install Claude Desktop and NodeJS.

Then follow the install instructions from the GitHub page.

Recommened Spaces and Models

Below are some Hugging Face Spaces that have been tested to work well with Claude.

Simply add the Space Name to your mcp-hfspace configuration and use!

Space Name	Type	Space Link
`shuttleai/shuttle-3.1-aesthetic`	Image Generation	Link
`black-forest-labs/FLUX.1-schnell`	Image Generation	Link
`yanze/PuLID-FLUX`	Image Manipulation	Link
`Inspyrenet-Rembg`	Background Removal	Link
`diyism/Datou1111-shou_xin`	Image Generation	Link
`stabilityai/stable-diffusion-3.5-large-turbo`	Image Generation	Link
`dbaranchuk/Switti`	Image Generation	Link
`fantaxy/Sound-AI-SFX`	Sound Effects Generation	Link
`parler-tts/parler_tts`	Speech Generation	Link
`styletts2/styletts2`	Speech Generation	Link
`hf-audio/whisper-large-v3-turbo`	Audio Transcription	Link
`haoheliu/audioldm2-text2audio-text2music`	Music Generation	Link
`microsoft/OmniParser`	Computer Vision (GUI)	Link
`merve/paligemma2-vqav2`	Computer Vision (General)	Link
`Qwen/QVQ-72B-preview`	Computer Vision (Reasoning)	Link
`DawnC/PawMatchAI`	Computer Vision (Dog Identification)	Link
`DawnC/PawMatchAI/on_find_match_click`	Text (Dog Recommendation)	Link
`Qwen/Qwen2.5-72B-Instruct`	Qwen 2.5 Chat LLM	Link
`prithivMLmods/Mistral-7B-Instruct-v0.3`	Mistral Chat LLM	Link
`huggingchat/document-parser`	Document Parsing	Link