by privetin
Dataset Viewer is an MCP server that interacts with the Hugging Face Dataset Viewer API, enabling users to browse, analyze, and manage datasets hosted on the Hugging Face Hub.
Dataset Viewer is an MCP (Multi-Cloud Platform) server designed to interact with the Hugging Face Dataset Viewer API. It enables users to browse, analyze, and manage datasets hosted on the Hugging Face Hub, providing a robust interface for data exploration and manipulation.
To use Dataset Viewer, you first need to install it. This involves cloning the repository, setting up a Python virtual environment with uv
, and installing the project in development mode. Once installed, you can configure it by setting the HUGGINGFACE_TOKEN
environment variable for private dataset access. For integration with Claude Desktop, you can add a specific configuration to your claude_desktop_config.json
file, specifying the command and arguments to run the server.
dataset://
URI scheme for accessing Hugging Face datasets.Dataset Viewer offers a suite of tools for dataset interaction:
Dataset Viewer is ideal for:
Q: What are the prerequisites for installing Dataset Viewer?
A: You need Python 3.12 or higher and uv
(a fast Python package installer and resolver).
Q: How do I access private Hugging Face datasets?
A: You need to set the HUGGINGFACE_TOKEN
environment variable with your Hugging Face API token.
Q: Can I filter dataset rows using SQL-like queries?
A: Yes, the filter
tool allows you to use SQL WHERE and ORDER BY clauses to filter dataset rows.
Q: How can I integrate Dataset Viewer with Claude Desktop?
A: You can add a specific mcpServers
configuration to your Claude Desktop config file, pointing to the Dataset Viewer server command.
An MCP server for interacting with the Hugging Face Dataset Viewer API, providing capabilities to browse and analyze datasets hosted on the Hugging Face Hub.
dataset://
URI scheme for accessing Hugging Face datasetsThe server provides the following tools:
validate
dataset
: Dataset identifier (e.g. 'stanfordnlp/imdb')auth_token
(optional): For private datasetsget_info
dataset
: Dataset identifierauth_token
(optional): For private datasetsget_rows
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namepage
(optional): Page number (0-based)auth_token
(optional): For private datasetsget_first_rows
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split nameauth_token
(optional): For private datasetsget_statistics
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split nameauth_token
(optional): For private datasetssearch_dataset
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namequery
: Text to search forauth_token
(optional): For private datasetsfilter
dataset
: Dataset identifierconfig
: Configuration namesplit
: Split namewhere
: SQL WHERE clause (e.g. "score > 0.5")orderby
(optional): SQL ORDER BY clausepage
(optional): Page number (0-based)auth_token
(optional): For private datasetsget_parquet
dataset
: Dataset identifierauth_token
(optional): For private datasetsgit clone https://github.com/privetin/dataset-viewer.git
cd dataset-viewer
# Create virtual environment
uv venv
# Activate virtual environment
# On Unix:
source .venv/bin/activate
# On Windows:
.venv\Scripts\activate
# Install in development mode
uv add -e .
HUGGINGFACE_TOKEN
: Your Hugging Face API token for accessing private datasetsAdd the following to your Claude Desktop config file:
On Windows: %APPDATA%\Claude\claude_desktop_config.json
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
{
"mcpServers": {
"dataset-viewer": {
"command": "uv",
"args": [
"--directory",
"parent_to_repo/dataset-viewer",
"run",
"dataset-viewer"
]
}
}
}
MIT License - see LICENSE for details
Reviews feature coming soon
Stay tuned for community discussions and feedback