Skip to content

kurtschelfthout/meadowrun

 
 

Repository files navigation

Meadowrun

PyPI - Python Version PyPI PyPI - Downloads Conda Conda

Join the chat at https://gitter.im/meadowdata/meadowrun

Meadowrun is a library for data scientists and data engineers who run python code on AWS, Azure, or Kubernetes. Meadowrun:

  • scales from a single function to thousands of distributed tasks.
  • syncs your local code and libraries for a faster, easier iteration loop. Edit your code and rerun your analysis without worrying about building packages or Docker images.
  • optimizes for cost, choosing the cheapest instance types and turning them off when they're no longer needed.

For more context, see our case studies of how Meadowrun is used in real life, or see the project homepage

To get started, go to our documentation, or join the chat on Gitter

Quickstart

First, install Meadowrun using pip:

pip install meadowrun

Next, assuming you've configured the AWS CLI and are a root/administrator user, you can run:

import meadowrun
import asyncio

print(
    asyncio.run(
        meadowrun.run_function(
            lambda: sum(range(1000)) / 1000,
            meadowrun.AllocEC2Instance(),
            meadowrun.Resources(logical_cpu=1, memory_gb=8, max_eviction_rate=80),
            meadowrun.Deployment.mirror_local()
        )
    )
)

The documentation has examples of how to use other package managers (conda, poetry), and other platforms (Azure, GKE, Kubernetes).

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.4%
  • Other 0.6%