Code Monkey home page Code Monkey logo

aisearch's Introduction

microsearch

microsearch is a minimal Python search engine designed for simplicity and efficiency. The project allows users to perform searches using Python, and it also provides an option to deploy a FastAPI app with an endpoint and a website for a user-friendly experience. It has been designed to provide users with a straightforward way to deploy their own search engine and search documents from their favorite blogs. The project includes a script for asynchronously downloading all the posts from a series of RSS feeds.

Features:

  • Python Implementation: microsearch is entirely implemented in Python, making it accessible and easy to understand for developers with varying levels of experience.

  • FastAPI App Deployment: The project provides an option to deploy a FastAPI app, allowing users to interact with the search engine through a dedicated endpoint and a user-friendly website.

  • RSS Feed Crawling Script: To populate the search engine with data, microsearch offers a script for asynchronously downloading posts from a series of RSS feeds. This feature ensures that users can conveniently aggregate content from their chosen blogs.

Getting started

The first step is to download this repo

git clone https://github.com/alexmolas/microsearch.git

Then, I recommend you install everything in a virtual environment. I usually use virtualenv but any other environment manager should work.

virtualenv -p python3.10 venv

activate the environment

source venv/bin/activate

and install the package and the dependencies

pip install .

Crawl data

Now we need to download the content of the blogs. I'm sharing here a list of feed examples, but please feel free to use your own. To download the content do

python download_content.py --feed-path feeds.txt

Launch app

Finally, once the content is crawled and stored you can run the app as

python -m app.app --data-path output.parquet

and if you navigate to http://127.0.0.1:8000/ you'll be able to query the engine.

aisearch's People

Contributors

alexmolas avatar mehrh8 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.