Using PySpark to find tweets containing drug terms and perform geospatial analysis.
- 500 Cities Census Tract Boundaries
- Illegal drug terms
- Schedule 2 drug names
- 100 million geo-tagged tweets in the US
- In CSV format, with | as delimeter
- Source: collected through the Twitter Open API
[('0107000-01073000800', 0.0005182689816014512), ('0177256-01125012000', 0.000244140625), ('0412000-04013812500', 0.00048661800486618007), ('0427400-04013422509', 0.00019557989438685703)]