Comments (6)
I was looking in some other sites that can give some information on this:
http://www.agenteimovel.com.br/mercado-imobiliario/a-venda/sao-paulo,sp/
http://www.zapimoveis.com.br/quanto-vale-este-imovel-mapa-dos-precos/
from serenata-de-amor.
Looking in ZAP Imoveis, I found that searching for 'estates for rent' instead of 'estates for sale', solve the problem cited by @luizpericolo, the response has a rental price (PrecoLocacao) that we can use:
{
"AreaUtil": null,
"Coordenadas": {
"Latitude": -30.0332536,
"Longitude": -51.2152337
},
"DataUltimoAnuncio": null,
"Dormitorios": null,
"Endereco": null,
"ID": 10779682,
"NotaLocacao": 19.2896946427,
"NotaVenda": 0,
"PrecoLocacao": 6500,
"PrecoM2": null,
"PrecoVenda": null,
"TipoOfertaID": 2,
"indOferta": true
}
With coordinates it's possible to get the address.
And with ID we can get details about estate like "AreaTotal", "AreaUtil", etc:
{
"AreaTotal": 100,
"AreaUtil": 100,
"Bairro": "BOM FIM",
"Dormitorios": 0,
"DormitoriosMaxima": null,
"EstagioObra": 0,
"FotoPrincipal": "http://imagens.zapcorp.com.br/2594410/3056797f-224c-4cf7-8307-9b4834b403fb{tipo}.jpg",
"ID": 10779682,
"IndDistrato": false,
"PrecoLocacao": 6500,
"PrecoVenda": null,
"SubTipoImovel": "Loja/Salão",
"SubTipoOferta": 2,
"Suites": 0,
"SuitesMaxima": null,
"TipoOfertaID": 2,
"Vagas": 0,
"VagasMaxima": null
}
I created a simple script, just for tests, that get and print some estates. If it can be used for the project I can create a script to fetch the data.
from serenata-de-amor.
As addition, one could also get the real state data from VivaReal. I've done it on a personal project.
from serenata-de-amor.
The first site looks really scrapable. It would be easy to build "all possible urls" to collect and save the average real estate price and no further computation would be needed since the site already returns this information. And the site returns information by neighborhood. It might not be as granular as @cuducos commented on PR #27, but it is better than having an average price by state.
The second site would be a bit harder in my opinion. Searching for some neighborhoods here in Rio and checking the response that is rendered to the map you get entries like this:
{
"AreaUtil": null
"Coordenadas": {
"Latitude": -22.902176,
"Longitude": -43.280269
}
"DataUltimoAnuncio": null
"Dormitorios": null
"Endereco": null
"ID": 73390
"PrecoM2": null
"PrecoVenda": 271362.8263
}
It neither returns a value for "PrecoM2" (price by square meter) nor a value for "AreaUtil" (that means something like built area). That means we don't really have information about price for square meter for any real estate and we can't calculate it either. Aside from that the body of the POST that is sent for this API relies on long and lat coordinates.
from serenata-de-amor.
Wow… IMHO that's such an useful dataset to match with housing/accommodation expenses. Thanks for that.
I'm just not sure about which data to use for the scrapping. Real state prices in congressman origin, Brasília or what? Any ideas @Irio?
from serenata-de-amor.
I think PR #100 covers that, so I'm closing this issue.
from serenata-de-amor.
Related Issues (20)
- The dataset 2017-02-15-receipts-texts-raw.tar.xz: Raw Cloud Vision API responses is no longer downloadable.
- Missing electronic receipt filter in Jarbas UI HOT 1
- Ministry of Economy's open consultation regarding Open Data (ends next july 15th)
- Installation using requirements.txt without Anaconda fails HOT 3
- [Suggestion] Please, port Rosie to Mastodon! HOT 1
- Review
- Base 2020
- Internal server error on reimbursement API for some IDs
- Reimbursement API does not return the ID used on the web application
- Prepare the Serenata apps to be deployable in a Kubernetes cluster HOT 4
- Migrate Serenata database for a managed database solution HOT 2
- rosie is not posting on twitter
- Is the project still alive? HOT 3
- Rosie stops mid-classification due to MemoryError in a 32gb ram machine HOT 1
- The trello link is 404
- Creating a Telegram Bot for the project - Criar um bot de telegram para o projeto HOT 1
- Revisão do readme.md
- [bug] Pipeline is failing
- Material para estudar.
- Projeto foi abandonado?? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from serenata-de-amor.