by privetin
Dataset Viewer is an MCP server that interacts with the Hugging Face Dataset Viewer API, enabling users to browse, analyze, and manage datasets hosted on the Hugging Face Hub.
Dataset Viewer is an MCP (Multi-Cloud Platform) server designed to interact with the Hugging Face Dataset Viewer API. It enables users to browse, analyze, and manage datasets hosted on the Hugging Face Hub, providing a robust interface for data exploration and manipulation.
To use Dataset Viewer, you first need to install it. This involves cloning the repository, setting up a Python virtual environment with uv
, and installing the project in development mode. Once installed, you can configure it by setting the HUGGINGFACE_TOKEN
environment variable for private dataset access. For integration with Claude Desktop, you can add a specific configuration to your claude_desktop_config.json
file, specifying the command and arguments to run the server.
dataset://
URI scheme for accessing Hugging Face datasets.Dataset Viewer offers a suite of tools for dataset interaction:
Dataset Viewer is ideal for:
Q: What are the prerequisites for installing Dataset Viewer?
A: You need Python 3.12 or higher and uv
(a fast Python package installer and resolver).
Q: How do I access private Hugging Face datasets?
A: You need to set the HUGGINGFACE_TOKEN
environment variable with your Hugging Face API token.
Q: Can I filter dataset rows using SQL-like queries?
A: Yes, the filter
tool allows you to use SQL WHERE and ORDER BY clauses to filter dataset rows.
Q: How can I integrate Dataset Viewer with Claude Desktop?
A: You can add a specific mcpServers
configuration to your Claude Desktop config file, pointing to the Dataset Viewer server command.
An MCP server for interacting with the Hugging Face Dataset Viewer API, providing capabilities to browse and analyze datasets hosted on the Hugging Face Hub.
dataset://
URI scheme for accessing Hugging Face datasetsThe server provides the following tools:
validate
dataset
: Dataset identifier (e.g. 'stanfordnlp/imdb')auth_token
(optional): For private datasetsget_info
dataset
: Dataset identifierauth_token
(optional): For private datasetsget_rows
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namepage
(optional): Page number (0-based)auth_token
(optional): For private datasetsget_first_rows
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split nameauth_token
(optional): For private datasetsget_statistics
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split nameauth_token
(optional): For private datasetssearch_dataset
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namequery
: Text to search forauth_token
(optional): For private datasetsfilter
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namewhere
: SQL WHERE clause (e.g. "score > 0.5")orderby
(optional): SQL ORDER BY clausepage
(optional): Page number (0-based)auth_token
(optional): For private datasetsget_parquet
dataset
: Dataset identifierauth_token
(optional): For private datasetsgit clone https://github.com/privetin/dataset-viewer.git
cd dataset-viewer
# Create virtual environment
uv venv
# Activate virtual environment
# On Unix:
source .venv/bin/activate
# On Windows:
.venv\Scripts\activate
# Install in development mode
uv add -e .
HUGGINGFACE_TOKEN
: Your Hugging Face API token for accessing private datasetsAdd the following to your Claude Desktop config file:
On Windows: %APPDATA%\Claude\claude_desktop_config.json
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"dataset-viewer": {
"command": "uv",
"args": [
"--directory",
"parent_to_repo/dataset-viewer",
"run",
"dataset-viewer"
]
}
}
}
MIT License - see LICENSE for details
Please log in to share your review and rating for this MCP.
Discover more MCP servers with similar functionality and use cases
by mckinsey
Build high-quality data visualization apps quickly with low‑code configuration, leveraging Plotly, Dash, and Pydantic while allowing deep customisation through Python, JavaScript, HTML, and CSS.
by antvis
mcp-server-chart is a Model Context Protocol (MCP) server developed by AntV that generates over 25 types of visual charts. It provides robust chart generation and data analysis capabilities, integrating with various AI clients and platforms.
by reading-plus-ai
mcp-server-data-exploration is an MCP server designed for autonomous data exploration on CSV-based datasets. It acts as a personal Data Scientist assistant, providing intelligent insights with minimal effort.
by Canner
Wren Engine is a semantic engine designed for Model Context Protocol (MCP) clients and AI agents, enabling accurate and context-aware access to enterprise data.
by GongRzhe
A Model Context Protocol (MCP) server for generating various types of charts using QuickChart.io, enabling chart creation through MCP tools.
by ergut
mcp-bigquery-server is a Model Context Protocol (MCP) server that enables Large Language Models (LLMs) to securely and efficiently interact with Google BigQuery datasets. It acts as a translator, allowing LLMs to query and analyze data in BigQuery using natural language instead of SQL.
by isaacwasserman
Provides tools for saving data tables and generating Vega‑Lite visualizations via an MCP interface, supporting both textual specifications and PNG image output.
by surendranb
Google Analytics MCP Server is a Python-based tool that enables Large Language Models (LLMs) to access and analyze Google Analytics 4 (GA4) data using natural language, providing conversational querying of over 200 GA4 dimensions and metrics.
by tinybirdco
Provides a Model Context Protocol server implementation for Tinybird, allowing analytics agents to forward data to Tinybird's platform.