Comments (3)
indeed our cities with population table is broken
select country_iso_code,wikidataurl,cityPop,subdivision_1_iso_code from citiesWithPopulation where city_name='Berlin'
country_iso_code | wikidataurl | cityPop | subdivision_1_iso_code |
---|---|---|---|
US | http://www.wikidata.org/entity/Q821199 | 19866.0 | CT |
US | http://www.wikidata.org/entity/Q821244 | 10051.0 | NH |
US | http://www.wikidata.org/entity/Q1569850 | 5524.0 | WI |
US | http://www.wikidata.org/entity/Q614184 | 4485.0 | MD |
US | http://www.wikidata.org/entity/Q524646 | 2866.0 | MA |
SV | http://www.wikidata.org/entity/Q582242 | (null) | US |
US | http://www.wikidata.org/entity/Q1086827 | (null) | NJ |
US | http://www.wikidata.org/entity/Q1130950 | (null) | PA |
from geograpy3.
The same issue takes place with many other cities as well. The following code reproduces it. Note that Belgium is absent in the list of countries whereas Brussels (probably Brussels, Wisconsin, USA) is present in the list of cities within USA.
import geograpy
url = 'https://en.wikipedia.org/wiki/UEFA_Euro_2020'
places = geograpy.get_geoPlace_context(url=url)
print(places.country_cities)
Output:
{
"Azerbaijan": ["Baku"],
"Belarus": ["Dublin"],
"Romania": ["Bucharest"],
"Australia": ["Denmark", "Seville"],
"Portugal": ["Portugal"],
"Spain": ["Seville", "Bilbao"],
"Denmark": ["Copenhagen"],
"United Kingdom": ["March", "London", "Glasgow"],
"Switzerland": ["Nyon"],
"Netherlands": ["Vijfhuizen", "Rome", "Amsterdam"],
"Belgium": ["Brussels"],
"Germany": ["Munich"],
"Ireland": ["Portmarnock", "Dublin"],
"Hungary": ["Budapest"],
"Italy": ["Rome"],
"Colombia": ["Armenia"],
"United States": [
"England",
"London",
"Scotland",
"Dublin",
"Rome",
"Brussels",
"English",
"Glasgow",
"Amsterdam",
"Turkey",
"Denmark",
"North",
"Italy",
"Ireland",
"Finland",
"Munich",
"Copenhagen",
"Russia",
"Seville",
"Belgium",
],
"Canada": ["Brussels", "Dublin", "London", "Scotland"],
}
from geograpy3.
the legacy get_geo_place_context approach will be problematic. You might want to go with the new Locator or LocationContext interfaces which use the wikidata information and the populationas a disambiguator. See
Singapore, Berlin and Athens as examples below which are correctly prioritized to be located in SG, DE and GR
select distinct cl.wikidataid,label,name,pop,regionIso,regionName,countryIso,countryName,gndId,geoNameId,lat,lon from
city_labels l
join CityLookup cl on l.wikidataid=cl.wikidataid
where l.label in ('Berlin',',St. Petersburg','Singapore','Athens')
order by pop desc
wikidataid | label | name | pop | regionIso | regionName | countryIso | countryName | gndId | geoNameId | lat | lon |
---|---|---|---|---|---|---|---|---|---|---|---|
Q334 | Singapore | Singapore | 5888926 | SG | Singapore | SG | Singapore | 4055089-8 | 1880251 | 104 | 1 |
Q64 | Berlin | Berlin | 3644826 | DE-BE | Berlin | DE | Germany | 4005728-8 | 6547383 | 53 | 13 |
Q64 | Berlin | Berlin | 3644826 | DE-BE | Berlin | DE | Germany | 4005728-8 | 6547539 | 53 | 13 |
Q64 | Berlin | Berlin | 3644826 | DE-BE | Berlin | DE | Germany | 4005728-8 | 2950159 | 53 | 13 |
Q64 | Berlin | Berlin | 3644826 | DE-BE | Berlin | DE | Germany | 4005728-8 | 2950157 | 53 | 13 |
Q1524 | Athens | Athens | 664046 | GR-I | Attica Region | GR | Greece | 4003366-1 | 264371 | 38 | 24 |
Q203263 | Athens | Athens | 115452 | US-GA | Georgia | US | United States of America | 4195479-8 | 4180386 | 34 | -83 |
Q79439 | Athens | Athens | 24000 | US-AL | Alabama | US | United States of America | 4830668 | 35 | -87 | |
Q755420 | Athens | Athens | 23832 | US-OH | Ohio | US | United States of America | 4207790-4 | 4505542 | 39 | -82 |
Q755420 | Athens | Athens | 21342 | US-OH | Ohio | US | United States of America | 4207790-4 | 4505542 | 39 | -82 |
Q821199 | Berlin | Berlin | 19866 | US-CT | Connecticut | US | United States of America | 5282244 | 42 | -73 | |
Q3292481 | Athens | Athens | 13548 | US-TN | Tennessee | US | United States of America | 4603284 | 35 | -85 | |
Q3292481 | Athens | Athens | 13458 | US-TN | Tennessee | US | United States of America | 4603284 | 35 | -85 | |
Q755425 | Athens | Athens | 12710 | US-TX | Texas | US | United States of America | 4671545 | 32 | -96 | |
Q821244 | Berlin | Berlin | 10051 | US-NH | New Hampshire | US | United States of America | 5083330 | 44 | -71 | |
Q1086827 | Berlin | Berlin | 7588 | US-NJ | New Jersey | US | United States of America | 4500771 | 40 | -75 | |
Q1569850 | Berlin | Berlin | 5524 | US-WI | Wisconsin | US | United States of America | 5245497 | 44 | -89 | |
Q3720748 | Berlin | Berlin Township | 5357 | US-NJ | New Jersey | US | United States of America | 4500777 | 40 | -75 | |
Q614184 | Berlin | Berlin | 4485 | US-MD | Maryland | US | United States of America | 4348460 | 38 | -75 | |
Q3709146 | Athens | Athens | 4089 | US-NY | New York | US | United States of America | 5107467 | 42 | -74 | |
Q570807 | Athens | Athens | 3013 | CA-ON | Ontario | CA | Canada | 5887943 | 45 | -76 | |
Q524646 | Berlin | Berlin | 2866 | US-MA | Massachusetts | US | United States of America | 4930431 | 42 | -72 | |
Q578681 | Athens | Athens | 1988 | US-IL | Illinois | US | United States of America | 4232997 | 40 | -90 | |
Q821215 | Berlin | Berlin | 1880 | US-NY | New York | US | United States of America | 5108863 | 43 | -73 | |
Q4892348 | Berlin | Berlin | 1145 | US-WI | Wisconsin | US | United States of America | 44 | -89 | ||
Q977715 | Athens | Athens | 1105 | US-WI | Wisconsin | US | United States of America | 5244312 | 45 | -90 | |
Q2218754 | Athens | Athens | 1048 | US-WV | West Virginia | US | United States of America | 4797549 | 37 | -81 | |
Q2218754 | Athens | Athens | 1048 | US-WV | West Virginia | US | United States of America | 4797549 | 37 | -81 | |
Q2791349 | Athens | Athens | 1024 | US-MI | Michigan | US | United States of America | 4984489 | 42 | -85 | |
Q3477164 | Athens | Athens | 1019 | US-ME | Maine | US | United States of America | 4956946 | 45 | -70 | |
Q4892350 | Berlin | Berlin | 945 | US-WI | Wisconsin | US | United States of America | 5245512 | 45 | -90 | |
Q142659 | Berlin | Berlin | 898 | US-OH | Ohio | US | United States of America | 5147132 | 41 | -82 | |
Q2345711 | Berlin | Berlin | 551 | US-GA | Georgia | US | United States of America | 4182096 | 31 | -84 | |
Q3239684 | Athens | Athens | 249 | US-LA | Louisiana | US | United States of America | 4315137 | 33 | -93 | |
Q1930098 | Berlin | Lincoln | 162 | US-IA | Iowa | US | United States of America | 4864742 | 42 | -93 | |
Q821229 | Berlin | Berlin | 34 | US-ND | North Dakota | US | United States of America | 5058265 | 46 | -98 | |
Q12303254 | Berlin | Berlin | DK-83 | Southern Denmark | DK | Denmark | 55 | 10 | |||
Q844930 | Athens | Classical Athens | GR-I | Attica Region | GR | Greece | 9962195 | 38 | 24 | ||
Q20144232 | Berlin | Berlín | MX-CHP | Chiapas | MX | Mexico | 8922112 | 16 | -93 | ||
Q582242 | Berlin | Berlín | SV-US | Usulután Department | SV | El Salvador | 3587266 | 14 | -89 | ||
Q4892339 | Berlin | Berlin | US-AL | Alabama | US | United States of America | 4047787 | 34 | -87 | ||
Q4813438 | Athens | Athens | US-AR | Arkansas | US | United States of America | 4099912 | 34 | -94 | ||
Q4813442 | Athens | Athens | US-CA | California | US | United States of America | 5325144 | 34 | -118 | ||
Q2504681 | Berlin | Berlin | US-IL | Illinois | US | United States of America | 4233627 | 40 | -90 | ||
Q4813446 | Athens | Athens | US-IN | Indiana | US | United States of America | 4917703 | 41 | -86 | ||
Q17509488 | Berlin | Berlin | US-KS | Kansas | US | United States of America | 38 | -95 | |||
Q42685464 | Athens | Athens | US-KS | Kansas | US | United States of America | 38 | -96 | |||
Q821190 | Berlin | Berlin | US-KY | Kentucky | US | United States of America | 4283975 | 39 | -84 | ||
Q4813443 | Athens | Athens | US-KY | Kentucky | US | United States of America | 4282781 | 38 | -84 | ||
Q33422655 | Athens | Athens | US-ME | Maine | US | United States of America | 4956946 | 45 | -70 | ||
Q7522845 | Singapore | Singapore | US-MI | Michigan | US | United States of America | 43 | -86 | |||
Q4813447 | Athens | Athens | US-MN | Minnesota | US | United States of America | 5016799 | 45 | -93 | ||
Q4813449 | Athens | Athens | US-MS | Mississippi | US | United States of America | 4416664 | 34 | -88 | ||
Q4892354 | Berlin | Berlin | US-OK | Oklahoma | US | United States of America | 4530350 | 35 | -100 | ||
Q1130437 | Athens | Athens | US-PA | Pennsylvania | US | United States of America | 5178651 | 42 | -77 | ||
Q1130437 | Athens | Athens | US-PA | Pennsylvania | US | United States of America | 5178651 | 42 | -77 | ||
Q1130950 | Berlin | Berlin | US-PA | Pennsylvania | US | United States of America | 4556518 | 40 | -79 | ||
Q1130950 | Berlin | Berlin | US-PA | Pennsylvania | US | United States of America | 4556518 | 40 | -79 | ||
Q21196360 | Berlin | Berlin | US-TX | Texas | US | United States of America | 30 | -96 | |||
Q14711981 | Athens | Athens | US-VA | Virginia | US | United States of America | 4829249 | 38 | -77 | ||
Q755427 | Athens | Athens | US-VT | Vermont | US | United States of America | 5233317 | 43 | -73 | ||
Q821245 | Berlin | Berlin | US-VT | Vermont | US | United States of America | 44 | -73 | |||
Q664745 | Berlin | Berlin | US-WV | West Virginia | US | United States of America | 4798708 | 39 | -80 | ||
Q4892343 | Berlin | Berlin | ZA-EC | Eastern Cape | ZA | South Africa | -33 | 28 |
from geograpy3.
Related Issues (20)
- add python 3.9 checks to CI HOT 1
- Download from Github releases
- Geograpy3 for other languages HOT 10
- United Kingdom not recognised as a country. HOT 1
- [BUG] downloads are done on every call instead of just once
- [BUG] warning on loading the JSON Files HOT 2
- API for lookup
- Improve performance by avoiding ORM loading of all data
- [BUG]wikidataid is not unique and labels are not handled as lists HOT 1
- Signapore Michigan in CityLookup but Singapore, Singapore is not ... HOT 4
- Similarity matching is too error prone HOT 2
- [BUG] location.db access fails within read-only docker container HOT 3
- [BUG]OperationalError: no such table: countries HOT 28
- [BUG] Cities are linked to Netherlands, not to Kingdom of the Netherlands HOT 2
- [BUG] Multiple false positives on quite simple text HOT 4
- :curacao: Please support "Curacao" :pray: HOT 1
- :bug: "United Kingdom" is identified as "United States of America" HOT 1
- add nominatim wrapper
- [BUG] "United Kingdom" Matches to United States HOT 9
- [BUG] UnicodeDecodeError: 'charmap' HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from geograpy3.