GitHub

Meadowrun is a library for data scientists and data engineers who run python code on AWS, Azure, or Kubernetes. Meadowrun:

scales from a single function to thousands of distributed tasks.
syncs your local code and libraries for a faster, easier iteration loop. Edit your code and rerun your analysis without worrying about building packages or Docker images.
optimizes for cost, choosing the cheapest instance types and turning them off when they're no longer needed.

For more context, see our case studies of how Meadowrun is used in real life, or see the project homepage

To get started, go to our documentation, or join the chat on Gitter

Quickstart

First, install Meadowrun using pip:

pip install meadowrun

Next, assuming you've configured the AWS CLI and are a root/administrator user, you can run:

import meadowrun
import asyncio

print(
    asyncio.run(
        meadowrun.run_function(
            lambda: sum(range(1000)) / 1000,
            meadowrun.AllocEC2Instance(),
            meadowrun.Resources(logical_cpu=1, memory_gb=8, max_eviction_rate=80),
            meadowrun.Deployment.mirror_local()
        )
    )
)

The documentation has examples of how to use other package managers (conda, poetry), and other platforms (Azure, GKE, Kubernetes).

Name		Name	Last commit message	Last commit date
Latest commit History 693 Commits
.github		.github
build_scripts		build_scripts
docker_images		docker_images
docs		docs
src		src
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
LICENSE		LICENSE
README.md		README.md
clean_test_data.bat		clean_test_data.bat
generate_protobufs.bat		generate_protobufs.bat
generate_protobufs.sh		generate_protobufs.sh
meadowrun-logo-full.svg		meadowrun-logo-full.svg
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

License

kurtschelfthout/meadowrun

Folders and files

Latest commit

History

Repository files navigation

Quickstart

About

Resources

License

Stars

Watchers

Forks

Languages