github not yet analyzed

Jina AI

AI-powered search and retrieval platform. Search the web, read page content, extract structured data, and ground AI responses.

maintainer: jina
license: —
first seen: 2026-05-22
last seen: 2026-06-17
releases · 30d: 0
short id

Drift inferred · capture-to-capture

MEDIUM 2026-06-13 gained untrusted · 2 tools changed permissions
LOW 2026-06-06 gained verified

tool count over time

2026-05-30 21t

2026-06-06 21t

2026-06-13 21t

capabilities 21 tools

transport http verified reported Smithery-verified counts 21 tools · 0 res · 0 prompts permission surface via registry introspection

tools

capture_screenshot_url

Capture high-quality screenshots of web pages in base64 encoded JPEG format. Use this tool when you need to visually inspect a website, take a snapshot for analysis, or show users what a webpage looks like.

in ▸ url return_url firstScreenOnly
classify_text

Classify texts into user-defined labels using Jina embeddings. Use this when you need to categorize, tag, or sort text content into predefined categories. Perfect for sentiment analysis, topic classification, content moderation, or any text categorization task.

in ▸ model texts labels
deduplicate_images

Get top-k semantically unique images (URLs or base64-encoded) using Jina CLIP v2 embeddings and submodular optimization. Use this when you have many visually similar images and want the most diverse subset.

in ▸ k images
deduplicate_strings

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

in ▸ k strings
expand_query

Expand and rewrite search queries based on an up-to-date query expansion model. This tool takes an initial query and returns multiple expanded queries that can be used for more diversed and deeper searches. Useful for improving deep research results by searching broader and deeper.

in ▸ query
extract_pdf

Extract figures, tables, and equations from PDF documents using layout detection. Perfect for extracting visual elements from academic papers on arXiv or any PDF URL. Returns base64-encoded images of detected elements with metadata.

in ▸ id url type max_edge
guess_datetime_url

Guess the last updated or published datetime of a web page. This tool examines HTTP headers, HTML metadata, Schema.org data, visible dates, JavaScript timestamps, HTML comments, Git information, RSS/Atom feeds, sitemaps, and international date formats to provide the most accurate update time with confidence scores. Returns the best guess timestamp and confidence level.

in ▸ url
parallel_read_url

Read multiple web pages in parallel to extract clean content efficiently. For best results, provide multiple URLs that you need to extract simultaneously. This is useful for comparing content across multiple sources or gathering information from multiple pages at once.

in ▸ urls timeout
parallel_search_arxiv

Run multiple arXiv searches in parallel for comprehensive research coverage and diverse academic angles. For best results, provide multiple search queries that explore different research angles and methodologies. You can use expand_query to help generate diverse queries, or create them yourself.

in ▸ timeout searches
parallel_search_ssrn

Run multiple SSRN searches in parallel for comprehensive social science research coverage and diverse academic angles. For best results, provide multiple search queries that explore different research angles and methodologies. You can use expand_query to help generate diverse queries, or create them yourself.

in ▸ timeout searches
parallel_search_web

Run multiple web searches in parallel for comprehensive topic coverage and diverse perspectives. For best results, provide multiple search queries that explore different aspects of your topic. You can use expand_query to help generate diverse queries, or create them yourself.

in ▸ timeout searches
primer

Get up-to-date contextual information of the current session to provide localized, time-aware responses. Use this when you need to know the current time, user's location, or network environment to give more relevant and personalized information.
read_url

Extract and convert web page content to clean, readable markdown format. Perfect for reading articles, documentation, blog posts, or any web content. Use this when you need to analyze text content from websites, bypass paywalls, or get structured data.

in ▸ url withAllLinks withAllImages
search_arxiv

Search academic papers and preprints on arXiv repository. Perfect for finding research papers, scientific studies, technical papers, and academic literature. Use this when researching scientific topics, looking for papers by specific authors, or finding the latest research in fields like AI, physics, mathematics, computer science, etc.

in ▸ num tbs query
search_bibtex

Search for academic papers and return BibTeX citations. Searches DBLP (computer science) and Semantic Scholar (broad academic coverage) for comprehensive results. Returns formatted BibTeX entries ready to use in LaTeX documents.

in ▸ num year query author
search_images

Search for images across the web, similar to Google Images. Use this when you need to find photos, illustrations, diagrams, charts, logos, or any visual content. Perfect for finding images to illustrate concepts, locating specific pictures, or discovering visual resources. Images are returned by default as small base64-encoded JPEG images.

in ▸ gl hl tbs query location return_url
search_jina_blog

Search Jina AI news and blog posts at jina.ai/news for articles about AI, machine learning, neural search, embeddings, and Jina products. Use this to find official Jina documentation, tutorials, product announcements, and technical deep-dives.

in ▸ num tbs query
search_ssrn

Search academic papers and preprints on SSRN (Social Science Research Network). Perfect for finding research papers in social sciences, economics, law, finance, accounting, management, and humanities. Use this when researching social science topics, looking for working papers, or finding the latest research in business and economics fields.

in ▸ num tbs query
search_web

Search the entire web for current information, news, articles, and websites. Use this when you need up-to-date information, want to find specific websites, research topics, or get the latest news. Ideal for answering questions about recent events, finding resources, or discovering relevant content.

in ▸ gl hl num tbs query location
show_api_key

Return the bearer token from the Authorization header of the MCP settings, which is used to debug.
sort_by_relevance

Rerank a list of documents by relevance to a query using Jina Reranker API. Use this when you have multiple documents and want to sort them by how well they match a specific query or topic. Perfect for document retrieval, content filtering, or finding the most relevant information from a collection.

in ▸ query top_n documents

code evidence

evidence-backed findings quoted directly from the published source artifact — not inferred

This server hasn't been statically analyzed yet (npm/PyPI artifacts only).

Jina AI

Drift inferred · capture-to-capture

tools

Navigate

Stream

Display