Skip to content

Summarize webpages from specified URLs using the LangChain framework and the ChatOllama model

License

Notifications You must be signed in to change notification settings

cobanov/easy-web-summarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Easy Webpage Summarizer

A Python script designed to summarize webpages from specified URLs using the LangChain framework and the ChatOllama model. It leverages advanced language models to generate detailed summaries, making it an invaluable tool for quickly understanding the content of web-based documents.

Requirements

ollama must be installed and served

ollama run llama3:instruct
pip install -r requirements.txt

Features

  • Summarization of webpages and youtube videos directly from URLs.
  • Translates to Turkish language (other languages will be added soon!)
  • Integration with LangChain and ChatOllama for state-of-the-art summarization.
  • Command-line interface for easy use and integration into workflows.

Usage

To use the webpage summarizer, run the script from the command line, providing the URL of the document you wish to summarize:

python summarizer.py -u "http://example.com/document"

Replace http://example.com/document with the actual URL of the document you want to summarize.

Web UI

To use the webpage summarizer in you web browser, you can also try gradio app.

python app/webui.py

gradio

Docker

docker build -t web_summarizer .
docker run -p 7860:7860 web_summarizer

# Run if you run ollama on host
docker run -d --network='host' -p 7860:7860 web_summarizer

Development

To contribute to the development of this script, clone the repository, make your changes, and submit a pull request. We welcome contributions that improve the script's functionality or extend its capabilities.

  • Summarize youtube videos
  • Dockerize project
  • Translate to different languages
  • Streaming text output on gradio
  • Serve on web

License

This script is released under the MIT License. See the LICENSE file in the repository for full details.