Skip to content

morphik-org/morphik-core

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Morphik Logo

PRs Welcome GitHub commit activity GitHub closed issues PyPI - Downloads Discord

Docs - Community - Why Morphik? - Bug reports

Morphik is an alternative to traditional RAG for highly technical and visual documents.

Morphik provides developers the tools to ingest, search (deep and shallow), transform, and manage unstructured and multimodal documents. Some of our features include:

  • Multimodal Search: We employ techniques such as ColPali to build search that actually understands the visual content of documents you provide. Search over images, PDFs, videos, and more with a single endpoint.
  • Knowledge Graphs: Build knowledge graphs for domain-specific use cases in a single line of code. Use our battle-tested system prompts, or use your own.
  • Fast and Scalable Metadata Extraction: Extract metadata from documents - including bounding boxes, labeling, classification, and more.
  • Integrations: Integrate with existing tools and workflows. Including (but not limited to) Google Suite, Slack, and Confluence.
  • Cache-Augmented-Generation: Create persistent KV-caches of your documents to speed up generation.

The best part? Morphik has a free tier and is open source! Get started by signing up at Morphik.

Table of Contents

Getting Started with Morphik (Recommended)

The fastest and easiest way to get started with Morphik is by signing up for free at Morphik. Your first 200 pages and 100 queries are on us! After this, you can pay based on usage with discounted rates for heavier use.

Self-hosting the open-source version

If you'd like to self-host Morphik, you can find the dedicated instruction here. We offer options for direct installation and installation via docker.

Important: Due to limited resources, we cannot provide full support for open-source deployments. We have an installation guide, and a Discord community to help, but we can't guarantee full support.

Using Morphik

Once you've signed up for Morphik, you can get started with ingesting and search your data right away.

Code (Example: Python SDK)

For programmers, we offer a Python SDK and a REST API. Ingesting a file is as simple as:

from morphik import Morphik

morphik = Morphik("<your-morphik-uri>")
morphik.ingest_file("path/to/your/super/complex/file.pdf")

Similarly, searching and querying your data is easy too:

morphik.query("What's the height of screw 14-A in the chair assembly instructions?")

Morphik Console

You can also interact with Morphik via the Morphik Console. This is a web-based interface that allows you to ingest, search, and query your data. You can upload files, connect to different data sources, and chat with your data all within the same place.

Model Context Protocol

Finally, you can also access Morphik via MCP. Instructions are available here.

Contributing

You're welcome to contribute to the project! We love:

Currently, we're focused on improving speed, integrating with more tools, and finding the research papers that provide the most value to our users. If you have thoughts, let us know in the discord or in GitHub!

Open source vs paid

Certain features - such as Morphik Console - are not available in the open-source version. Any feature in the ee namespace is not available in the open-source version and carries a different license. Any feature outside that is open source under the MIT expat license.

Contributors

Visit our special thanks page dedicated to our contributors here.

PS

We took inspiration from PostHog while writing this README. If you're from PostHog, thank you ❤️