Code Monkey home page Code Monkey logo

izoogood / torrscrapper Goto Github PK

View Code? Open in Web Editor NEW
11.0 1.0 1.0 17.08 MB

web scrapper designed to extract magnets for various types of media, such as games, movies, and more. It enables users to search for torrents and view the results without the distraction of ads.

Python 32.71% HTML 39.89% CSS 25.06% JavaScript 2.30% Procfile 0.04%
1337x django pirate-bay pirate-bay-api torrents torrents-collection web-scraping movies-downloader games-collection tor

torrscrapper's Introduction

Torrscrapper

Torrscrapper is a web scraping application designed to extract torrents for various types of media, such as games, movies, and more. It enables users to search for torrents and view the results without the distraction of ads.

Getting Started

These instructions will help you set up and run the Torrscrapper application on your local machine.

Prerequisites

Before you begin, ensure you have the following software installed:

Start Searching

Run the below commands in the project directory:

  1. pip install -r requirements.txt
  2. python manage.py runserver

Now visit http://127.0.0.1:8000/ and start searching for your torrents without any ads.

Features

Advanced Scraping Techniques

  1. Concurrent Scraping: Utilizes Python modules like asyncio to perform multiple web scraping requests simultaneously, significantly reducing the total time spent in scraping.
  2. Cloudflare Bypass: Employs the cloudscrape module to effectively bypass Cloudflare security measures, ensuring reliable access to torrent sites.

Comprehensive Torrent Databases

  1. Games Database: Features a pre-built SQLite database containing links to magnet files for over 3,000 popular games.
  2. Movies Database: Comes with a pre-built SQLite database housing magnet links for more than 10,000 movies.

Ongoing Enhancements

  • Database Updates: Ongoing work includes updating the SQLite databases using Python scripts to ensure the latest and most popular content is readily available.
  • Site Support Expansion: Planning to extend support to additional torrent sites

torrscrapper's People

Contributors

izoogood avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

vineetpcodes123

torrscrapper's Issues

getting this error when trying to start the server

C:\Users\fqf1f\torrscrapper>python manage.py runserver
Watching for file changes with StatReloader
Performing system checks...

Exception in thread django-main-thread:
Traceback (most recent call last):
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\threading.py", line 1038, in _bootstrap_inner
self.run()
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\threading.py", line 975, in run
self._target(*self._args, **self.kwargs)
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\utils\autoreload.py", line 64, in wrapper
fn(*args, **kwargs)
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\core\management\commands\runserver.py", line 133, in inner_run
self.check(display_num_errors=True)
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\core\management\base.py", line 485, in check
all_issues = checks.run_checks(
^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\core\checks\registry.py", line 88, in run_checks
new_errors = check(app_configs=app_configs, databases=databases)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\core\checks\urls.py", line 14, in check_url_config
return check_resolver(resolver)
^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\core\checks\urls.py", line 24, in check_resolver
return check_method()
^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\urls\resolvers.py", line 494, in check
for pattern in self.url_patterns:
^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\utils\functional.py", line 57, in get
res = instance.dict[self.name] = self.func(instance)
^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\urls\resolvers.py", line 715, in url_patterns
patterns = getattr(self.urlconf_module, "urlpatterns", self.urlconf_module)
^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\utils\functional.py", line 57, in get
res = instance.dict[self.name] = self.func(instance)
^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\urls\resolvers.py", line 708, in urlconf_module
return import_module(self.urlconf_name)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\importlib_init
.py", line 126, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "", line 1206, in _gcd_import
File "", line 1178, in _find_and_load
File "", line 1149, in _find_and_load_unlocked
File "", line 690, in _load_unlocked
File "", line 940, in exec_module
File "", line 241, in call_with_frames_removed
File "C:\Users\fqf1f\torrscrapper\webapps\urls.py", line 20, in
path('',include('torrscrapper.urls')),
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\site-packages\django\urls\conf.py", line 38, in include
urlconf_module = import_module(urlconf_module)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\fqf1f\AppData\Local\Programs\Python\Python311\Lib\importlib_init
.py", line 126, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "", line 1206, in _gcd_import
File "", line 1178, in _find_and_load
File "", line 1149, in _find_and_load_unlocked
File "", line 690, in _load_unlocked
File "", line 940, in exec_module
File "", line 241, in _call_with_frames_removed
File "C:\Users\fqf1f\torrscrapper\torrscrapper\urls.py", line 2, in
from . import views
File "C:\Users\fqf1f\torrscrapper\torrscrapper\views.py", line 6, in
from .scraping_utils import scrape_data
File "C:\Users\fqf1f\torrscrapper\torrscrapper\scraping_utils.py", line 9, in
import humanize
ModuleNotFoundError: No module named 'humanize'

403 Forbidden Error When Scraping 1337x Site

Description

receives a 403 Forbidden error when attempting to scrape the 1337x site.

Logs

2024-01-27 16:45:25 - INFO - ---------------------------------------------------
2024-01-27 16:45:25 - INFO - 🚀Starting the scraping session
2024-01-27 16:45:25 - INFO - 🌐 Site #1 - Starting scraping for site: 1337x
2024-01-27 16:45:27 - ERROR - Failed to scrape 1337x. Status code: 403
2024-01-27 16:45:27 - INFO - Site #1 - Completed scraping. Time taken: 1.81 seconds
2024-01-27 16:45:27 - INFO - Site #1 - --------------------------------

Steps to Reproduce

  1. Initiate the scraping session for the 1337x site.
  2. Observe the error logged by the system.

Possible Solutions

  • Find 1337x fallbacks, perhaps use tor to route traffic ?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.