Seedcase Sprout logo: Main page Seedcase Sprout logo: Main page
  • Overview
  • Guide
  • Design
  • Reference
  • Glossary

Sprout logo.

Sprout font.

Grow organised and FAIR data

Overview Explore guide

Copier DOI GitHub License PyPI Version GitHub Release Supported Python Version from PEP 621 TOML Build documentation Check package code coverage OpenSSF Scorecard CodeQL pre-commit.ci status lifecycle Project Status: Active – The project has reached a stable, usable state and is being actively developed. OpenSSF Best Practices

Sprout is a Python package for building up, describing, and managing research data. It structures your data as standardised Data Packages that follow the FAIR principles, giving you flexibility and control over each step of the data management workflow.

Key features ✨

  • Creates and manages standardised Data Packages with metadata stored in datapackage.json.
  • Checks both your metadata and your data against the Data Package standard, with clear messages pointing directly to where issues occur.
  • Stores data in Apache Parquet format—an open, efficient, and widely supported format.
  • Tracks data changes through a batch system with automatic versioning, supporting data lineage and auditability.
  • Auto-extracts field properties from your data, so you spend less time writing metadata manually.

About the Seedcase Project

Sprout is part of the Seedcase Project, which helps research projects manage and process data in line with the FAIR principles.

Check out the Seedcase Project

Want to contribute?

We would love your feedback or contributions! Head over to the GitHub repository to share your ideas or contribute code. Your input makes us better!

 
  • License: CC-BY 4.0

  • Code of Conduct

  • Contributing

  • Funded by the Novo Nordisk Foundation

  • Edit this page
  • Report an issue
  • View source