Code Monkey home page Code Monkey logo

Comments (7)

light-bit avatar light-bit commented on August 17, 2024 1

I noticed this as well. Unfortunately Funda changed the layout of their website, it seems that the scraper has to be updated in order to work with this new layout.

Good news is that the bs4 still returns the page content. Let's hope someone has the time to update the scripts accordingly.

from funda-scraper.

whchien avatar whchien commented on August 17, 2024 1

Hi @BTuyn @light-bit @PieterK123 @dadadima94

I just updated the new version of funda_scraper. If you can install the latest version (v1.0.0), the scraping should work without problems now. Please let me know if you encounter anything unusual. All the feedback would be appreciated!

from funda-scraper.

PieterK123 avatar PieterK123 commented on August 17, 2024

Same here. Would indeed be great if we can still get the script working.

from funda-scraper.

PieterK123 avatar PieterK123 commented on August 17, 2024

You managed to get it working @BTuyn?

from funda-scraper.

dadadima94 avatar dadadima94 commented on August 17, 2024

Same here!

from funda-scraper.

BTuyn avatar BTuyn commented on August 17, 2024

Sorry for the late response. Seems like the problem is fixed indeed, but after checking the update, I did notice some other bugs.
The raw dataframe seems fine, however, the clean dataframe is missing quite a lot of rows and isn't processing data properly for a few columns such as 'has_balcony' and 'has_garden'. This is a different issue than the previous issue though and I see somebody else already raised a issue ticket for it.

from funda-scraper.

whchien avatar whchien commented on August 17, 2024

Hi @BTuyn

Yes, you are correct. I removed the columns 'has_balcony' and 'has_garden' from the clean data frame. Funda changes how they describe these exteriors, so the original preprocessing script needs to be revised. I will include the new preprocessing logic in the next release.

from funda-scraper.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.