Skip to main content

EarthStat Library

Project description

earthstat

image image

Inspired through participating in the AgML community's "Forecast Subnational Yield" activity, this Python library emerges as a vital tool for professionals and researchers engaged with remote sensing raster data. Designed with a focus on processing huge amount of TIFF files, our package excels at extracting statistical information for specific spatial units. By converting raster datasets into easily accessible CSV files. This library Ideal to prepare csv datasets for training Machine Learning (ML) models for different purposes. Also, significantly enhances the ability to leverage remote sensing data for impactful analyses (monitoring climate change, etc.). AgML community and the challenge of forecasting subnational agricultural yields has directly influenced the development of this library, ensuring it meets the high standards required for advanced environmental and agricultural data processing.

Features

EarthStat simplifies geospatial analysis by streamlining the extraction of statistical information from spatial units, providing a comprehensive toolset for efficient data processing and analysis:

  • Easy Data Preparation: Define directories for raster files, shapefile paths, and masks effortlessly. Our library automates compatibility checks for pixel size, projection, and alignment between raster data (TIFF), mask, and shapefiles, ensuring smooth data integration.

  • Efficient ROI Selection & Raster Clipping: Select your Region of Interest (ROI) with an intuitive filtering function. Clip raster data to your ROI quickly with just one line of code.

  • Advanced Data Aggregation: aggregates raster data across selected spatial units but also intelligently masks the raster data during the aggregation process.

  • Comprehensive Data Export: After aggregation, the library compiles the aggregated data along with related information from the shapefile into CSV files for each spatial unit.

EarthStat Python Library - Improvements Roadmap

Data Processing and Scenario Management Enhancements

  • offering more statistical options for aggregation.
  • Introduce thresholding option for masks to refine data selection.
  • Refactor Dataloader and Data Compatibility for no mask scenario.
  • Merge individual data initialization functions into a single function, streamlining user interaction and input handling.

Automation for User Convenience

  • Implement automatic detection of the lag between date ranges of predictor data.
  • Automatically identify the column names for countries in the dataset.
  • Enable users to specify date ranges for predictor data, improving data filtering capabilities.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

earthstat-0.2.0.tar.gz (23.3 kB view hashes)

Uploaded Source

Built Distribution

earthstat-0.2.0-py2.py3-none-any.whl (14.5 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page