项目作者: dask

项目描述 :
Start a cluster in EC2 for dask.distributed
高级语言: Python
项目地址: git://github.com/dask/dask-ec2.git
创建时间: 2016-02-11T04:31:04Z
项目社区:https://github.com/dask/dask-ec2

开源协议:

下载


ARCHIVED

As of November 3rd, 2020 this respository is now archived. Please consult
the Dask Cloud docs page
for more information on deploying Dask with cloud resources.

Dask EC2 Build Status Coverage Status

Easily launch a cluster on Amazon EC2 configured with dask.distributed,
Jupyter Notebooks, and Anaconda.

Installation

You also install dask-ec2 using pip:

  1. $ pip install dask-ec2

You can also install dask-ec2 and its dependencies from the
conda-forge repository using
conda:

  1. $ conda install dask-ec2 -c conda-forge

Usage

Note: dask-ec2 uses
boto3 to interact with
Amazon EC2. You can configure your AWS credentials using
Environment Variables
or Configuration Files.

The dask-ec2 up command can be used to create and provision a cluster on Amazon EC2:

  1. $ dask-ec2 up --help
  2. Usage: dask-ec2 up [OPTIONS]
  3. Options:
  4. --keyname TEXT Keyname on EC2 console [required]
  5. --keypair PATH Path to the keypair that matches the keyname
  6. [required]
  7. --name TEXT Tag name on EC2
  8. --tags TEXT Additional EC2 tags. Comma separated K:V
  9. pairs: K1:V1,K2:V2
  10. --region-name TEXT AWS region [default: us-east-1]
  11. --vpc-id TEXT EC2 VPC ID
  12. --subnet-id TEXT EC2 Subnet ID on the VPC
  13. --iaminstance-name TEXT IAM Instance Name
  14. --ami TEXT EC2 AMI [default: ami-d05e75b8]
  15. --username TEXT User to SSH to the AMI [default: ubuntu]
  16. --type TEXT EC2 Instance Type [default: m3.2xlarge]
  17. --count INTEGER Number of nodes [default: 4]
  18. --security-group TEXT Security Group Name [default: dask-ec2-default]
  19. --security-group-id TEXT Security Group ID (overwrites Security Group
  20. Name)
  21. --volume-type TEXT Root volume type [default: gp2]
  22. --volume-size INTEGER Root volume size (GB) [default: 500]
  23. --file PATH File to save the metadata [default:
  24. cluster.yaml]
  25. --provision / --no-provision Provision salt on the nodes [default: True]
  26. --anaconda / --no-anaconda Bootstrap anaconda [default: True]
  27. --dask / --no-dask Install Dask.Distributed in the cluster
  28. [default: True]
  29. --notebook / --no-notebook Start a Jupyter Notebook in the head node
  30. [default: True]
  31. --nprocs INTEGER Number of processes per worker [default: 1]
  32. --source / --no-source Install Dask/Distributed from git master
  33. [default: False]
  34. -h, --help Show this message and exit.

The minimal required arguments for the dask-ec2 up command are:

  1. $ dask-ec2 up --keyname my_aws_key --keypair ~/.ssh/my_aws_key.pem

This will create a cluster.yaml in the directory that it was executed, and
this file is required to use the other commands in the CLI.

Once a cluster is running, the dask-ec2 command can be used to create or destroy
a cluster, ssh into nodes, or other functions:

  1. $ dask-ec2
  2. Usage: dask-ec2 [OPTIONS] COMMAND [ARGS]...
  3. Options:
  4. --version Show the version and exit.
  5. -h, --help Show this message and exit.
  6. Commands:
  7. anaconda Provision anaconda
  8. dask-distributed dask.distributed option
  9. destroy Destroy cluster
  10. notebook Provision the Jupyter notebook
  11. provision Provision salt instances
  12. ssh SSH to one of the node. 0-index
  13. up Launch instances