Extend Claude with HF Spaces

Extend Claude with HF Spaces

What can I do with this?

Landscape Image from Shou Xin model

Example from Shou Xin Image Generator

This plugin gives Claude Desktop new abilities including:

  • Create Images
  • Transcribe Audio (turn speech in to text)
  • Produce Sound Effects and Speech
  • Have Claude Chat with other AIs
  • Use specialised Vision techniques
  • Recommend pet dogs using specialised AI
  • and more…

It does this by calling on other AI Models hosted on Hugging Face Spaces - the hub for Open Source AI Model Research.

Take a look at this X Thread, or this Reddit Post for some examples of usage.

What is this?

Claude combining Search and Image Generation

Search and 3D Model Generation

It’s an MCP Server - software that lets Claude (or other AI’s) dynamically run tools to perform actions on your computer, or interact with other services.

Claude can combine many MCP servers at once. This example shows Claude searching the Web to answer a question, and then generate a 3D Model from the search result.

How do I work this?

To begin, install Claude Desktop and NodeJS.

After that, the easiest way to is to use mcp-get or Smithery. If you are using NVM on Windows use the install script here.

The source code for mcp-hfspace is available on GitHub.

Recommened Spaces and Models

Below are some interesting Spaces that work well with Claude. Simply add the Space Name to the mcp-hfspace configuration file and use!

Space Name Type Space Link
shuttleai/shuttle-3.1-aesthetic Image Generation Link
black-forest-labs/FLUX.1-schnell Image Generation Link
yanze/PuLID-FLUX Image Manipulation Link
Inspyrenet-Rembg Background Removal Link
diyism/Datou1111-shou_xin Image Generation Link
stabilityai/stable-diffusion-3.5-large-turbo Image Generation Link
dbaranchuk/Switti Image Generation Link
fantaxy/Sound-AI-SFX Sound Effects Generation Link
parler-tts/parler_tts Speech Generation Link
styletts2/styletts2 Speech Generation Link
hf-audio/whisper-large-v3-turbo Audio Transcription Link
haoheliu/audioldm2-text2audio-text2music Music Generation Link
microsoft/OmniParser Computer Vision (GUI) Link
merve/paligemma2-vqav2 Computer Vision (General) Link
Qwen/QVQ-72B-preview Computer Vision (Reasoning) Link
DawnC/PawMatchAI Computer Vision (Dog Identification) Link
DawnC/PawMatchAI/on_find_match_click Text (Dog Recommendation) Link
Qwen/Qwen2.5-72B-Instruct Qwen 2.5 Chat LLM Link
prithivMLmods/Mistral-7B-Instruct-v0.3 Mistral Chat LLM Link
huggingchat/document-parser Document Parsing Link