Tropical deforestation is one of the main challenges thwarting climate change and biodiversity conservation efforts worldwide. Thanks to the improved quality and resoultion of satellite imagery available, research organizations are better equipped to monitor land use changes and track their association with economic, political and social changes. Deforestation dynamics in the Brazilian Amazon are complex but offer a great oportunity to connect different types of data and predict local deforestation rates that could help enforcement agencies anticipate and prioritize areas of action.
This first portfolio piece is part of a project I worked on for a Data science program, modelling deforestation rates associated with soymeal production in Brazil. I first scraped deforestation and commodity price data from publicly available sources such as deforestation databases (Trase) and a stock exchage (Nasdaq) and then completed analysis in three parts:
- Used Python Pandas libraries for data wrangling and exploratory analysis of datasets with over 295K rows of 10-14 years of data.
- Engineered features to quantify and aggregate contributions from different regions and soy products.
- Created visualizations with Matplotlib and Tableau.
Brazilian soymeal deforestation risk trends, 2004-2018
Brazil soymeal export destinations. Darker shade indicates higher transaction value in USD
What the data showed: With very few exceptions, most countries buy soy from Brazil. After a soy moratorium agreement, some soy production moved to areas previously deforested for cattle ranching, but the largest proportion of soymeal transactions were carried out by producers or buyers that did not subscribe to no-deforestation commitments. Despite the shift from primary to secondary clearings, soy crops continue to drive deforestion.
{% include index.html %}
- Built robust models by optimizing linear and random forest regressors using GridSearch CV.
What the data showed: Fluctuations in soymeal prices explained almost 20% of the variance in deforestation risk, illustrating the dynamic nature of incentives for deforestation despite international agreements and policy advances.
- Used Facebook Prophet time series analysis to predict soy meal prices in the short term range.
What the analysis revealed: Based on the period I looked at, deforestation is forecasted to continue in the near future, partially in response to increasing soy prices. Time series predictions would benefit from a more accurate selection of lag-time.