github not yet analyzed

Jina AI

github

AI-powered search and retrieval platform. Search the web, read page content, extract structured data, and ground AI responses.

maintainer
jina
license
first seen
2026-05-22
last seen
2026-06-17
releases · 30d
0
short id

Drift inferred · capture-to-capture

  1. MEDIUM gained untrusted · 2 tools changed permissions
  2. LOW gained verified

tool count over time

21t
21t
21t
capabilities 21 tools
transport http verified reported Smithery-verified counts 21 tools · 0 res · 0 prompts permission surface via registry introspection

tools

  • capture_screenshot_url

    Capture high-quality screenshots of web pages in base64 encoded JPEG format. Use this tool when you need to visually inspect a website, take a snapshot for analysis, or show users what a webpage looks like.

    in ▸ url return_url firstScreenOnly

  • classify_text

    Classify texts into user-defined labels using Jina embeddings. Use this when you need to categorize, tag, or sort text content into predefined categories. Perfect for sentiment analysis, topic classification, content moderation, or any text categorization task.

    in ▸ model texts labels

  • deduplicate_images

    Get top-k semantically unique images (URLs or base64-encoded) using Jina CLIP v2 embeddings and submodular optimization. Use this when you have many visually similar images and want the most diverse subset.

    in ▸ k images

  • deduplicate_strings

    Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

    in ▸ k strings

  • expand_query

    Expand and rewrite search queries based on an up-to-date query expansion model. This tool takes an initial query and returns multiple expanded queries that can be used for more diversed and deeper searches. Useful for improving deep research results by searching broader and deeper.

    in ▸ query

  • extract_pdf

    Extract figures, tables, and equations from PDF documents using layout detection. Perfect for extracting visual elements from academic papers on arXiv or any PDF URL. Returns base64-encoded images of detected elements with metadata.

    in ▸ id url type max_edge

  • guess_datetime_url

    Guess the last updated or published datetime of a web page. This tool examines HTTP headers, HTML metadata, Schema.org data, visible dates, JavaScript timestamps, HTML comments, Git information, RSS/Atom feeds, sitemaps, and international date formats to provide the most accurate update time with confidence scores. Returns the best guess timestamp and confidence level.

    in ▸ url

  • parallel_read_url

    Read multiple web pages in parallel to extract clean content efficiently. For best results, provide multiple URLs that you need to extract simultaneously. This is useful for comparing content across multiple sources or gathering information from multiple pages at once.

    in ▸ urls timeout

  • parallel_search_arxiv

    Run multiple arXiv searches in parallel for comprehensive research coverage and diverse academic angles. For best results, provide multiple search queries that explore different research angles and methodologies. You can use expand_query to help generate diverse queries, or create them yourself.

    in ▸ timeout searches

  • parallel_search_ssrn

    Run multiple SSRN searches in parallel for comprehensive social science research coverage and diverse academic angles. For best results, provide multiple search queries that explore different research angles and methodologies. You can use expand_query to help generate diverse queries, or create them yourself.

    in ▸ timeout searches

  • parallel_search_web

    Run multiple web searches in parallel for comprehensive topic coverage and diverse perspectives. For best results, provide multiple search queries that explore different aspects of your topic. You can use expand_query to help generate diverse queries, or create them yourself.

    in ▸ timeout searches

  • primer

    Get up-to-date contextual information of the current session to provide localized, time-aware responses. Use this when you need to know the current time, user's location, or network environment to give more relevant and personalized information.

  • read_url

    Extract and convert web page content to clean, readable markdown format. Perfect for reading articles, documentation, blog posts, or any web content. Use this when you need to analyze text content from websites, bypass paywalls, or get structured data.

    in ▸ url withAllLinks withAllImages

  • search_arxiv

    Search academic papers and preprints on arXiv repository. Perfect for finding research papers, scientific studies, technical papers, and academic literature. Use this when researching scientific topics, looking for papers by specific authors, or finding the latest research in fields like AI, physics, mathematics, computer science, etc.

    in ▸ num tbs query

  • search_bibtex

    Search for academic papers and return BibTeX citations. Searches DBLP (computer science) and Semantic Scholar (broad academic coverage) for comprehensive results. Returns formatted BibTeX entries ready to use in LaTeX documents.

    in ▸ num year query author

  • search_images

    Search for images across the web, similar to Google Images. Use this when you need to find photos, illustrations, diagrams, charts, logos, or any visual content. Perfect for finding images to illustrate concepts, locating specific pictures, or discovering visual resources. Images are returned by default as small base64-encoded JPEG images.

    in ▸ gl hl tbs query location return_url

  • search_jina_blog

    Search Jina AI news and blog posts at jina.ai/news for articles about AI, machine learning, neural search, embeddings, and Jina products. Use this to find official Jina documentation, tutorials, product announcements, and technical deep-dives.

    in ▸ num tbs query

  • search_ssrn

    Search academic papers and preprints on SSRN (Social Science Research Network). Perfect for finding research papers in social sciences, economics, law, finance, accounting, management, and humanities. Use this when researching social science topics, looking for working papers, or finding the latest research in business and economics fields.

    in ▸ num tbs query

  • search_web

    Search the entire web for current information, news, articles, and websites. Use this when you need up-to-date information, want to find specific websites, research topics, or get the latest news. Ideal for answering questions about recent events, finding resources, or discovering relevant content.

    in ▸ gl hl num tbs query location

  • show_api_key

    Return the bearer token from the Authorization header of the MCP settings, which is used to debug.

  • sort_by_relevance

    Rerank a list of documents by relevance to a query using Jina Reranker API. Use this when you have multiple documents and want to sort them by how well they match a specific query or topic. Perfect for document retrieval, content filtering, or finding the most relevant information from a collection.

    in ▸ query top_n documents

code evidence
evidence-backed findings quoted directly from the published source artifact — not inferred

This server hasn't been statically analyzed yet (npm/PyPI artifacts only).