Commit Graph

112 Commits

Author SHA1 Message Date
Michael Vines 7ee4dec3f1 Upgrade GCE GPU image to 18.04 2018-11-12 12:18:50 -08:00
Michael Vines c07d09c011 Add net/scp.sh for easier file transfer to/from network nodes 2018-11-12 11:48:53 -08:00
Michael Vines 3466f139a4 set -e shuffling 2018-11-11 16:24:36 -08:00
Michael Vines def7d156f6 codemod --extensions sh '#!/usr/bin/env bash -e' '#!/usr/bin/env bash\nset -e' 2018-11-11 16:24:36 -08:00
Michael Vines 33aab094ef codemod --extensions sh '#!/bin/bash' '#!/usr/bin/env bash' 2018-11-11 16:24:36 -08:00
Michael Vines cf6f344ccc Add CUDA_HOME env var to permit overriding the CUDA install location 2018-11-11 16:24:18 -08:00
Michael Vines 49014393e1 Be less fancy for bash 4.4 compat 2018-11-10 18:05:55 -08:00
Michael Vines 818d03c835
Bump earlyoom version 2018-11-10 15:56:17 -08:00
Michael Vines b8261d7d83
Determine network version for tar and local deploys 2018-11-08 22:02:42 -08:00
Michael Vines 51ed48941b
Continue if docker0 is not present 2018-11-07 19:33:20 -08:00
Michael Vines 87ac549689
Work around AWS key management limitation 2018-11-07 18:48:27 -08:00
Michael Vines f8f11b7f50
Remove docker0 interface if present 2018-11-07 18:23:24 -08:00
Michael Vines 82f914e0dc
Work around AWS boot check weirdness 2018-11-07 15:46:04 -08:00
Michael Vines 9359cc69d5
Invert gpu check 2018-11-07 14:44:40 -08:00
Michael Vines b02b636b36
Support local tarball deploys 2018-11-07 14:44:40 -08:00
Michael Vines a537154c28
Remove all cuda dependencies from release tarball beyond solana-fullnode-cuda 2018-11-07 14:44:40 -08:00
Michael Vines 16d23292dc
Improve error messages 2018-11-07 10:35:10 -08:00
Michael Vines 2ef8ebe111
AWS AMIs are region specific 2018-11-07 10:05:58 -08:00
Michael Vines f8673931b8
Increase boot timeout 2018-11-07 08:32:15 -08:00
Michael Vines dd4fb7aa90 Add AWS-based nets 2018-11-07 07:47:39 -08:00
Michael Vines c4bc331663 Add support for using a release tar 2018-11-07 07:47:39 -08:00
Michael Vines cd18a1b7db
t 2018-11-06 14:08:47 -08:00
Michael Vines 6aac096c77
Add timeout to prevent a stuck ssh 2018-11-06 14:08:28 -08:00
Michael Vines 7b58bd621a
Remove node check from client start-up
If the network loses a validator or two, it's the job of the sanity
check to detect this not the bench clients
2018-11-06 13:57:06 -08:00
Michael Vines 1a7830f460
Set imageName if G 2018-11-05 13:33:42 -08:00
Michael Vines 8041461a07
Bump EC2 validator machine type 2018-11-05 08:47:51 -08:00
Michael Vines eae9372a5d Upgrade GCP CPU-based testnet to 18.04 2018-11-04 19:18:47 -08:00
Michael Vines f3b04894b9 Try harder to snap download 2018-11-03 00:29:13 +00:00
Pankaj Garg 85869552e0
Update testnet scripts to use release tar ball (#1660)
* Update testnet scripts to use release tar ball

* use curl instead of s3cmd
2018-10-30 18:05:38 -07:00
Pankaj Garg 3cc78d3a41
Added a new remote node configuration script to set rmem/wmem (#1647)
* Added a new remote node configuration script to set rmem/wmem

* Update common.sh for rmem/wmem configuration
2018-10-30 09:17:35 -07:00
Pankaj Garg fbde9bb731
Run bench-tps for longer duration in testnet (#1638)
- Increased to 2+ hours
2018-10-29 15:03:08 -07:00
Pankaj Garg 7abd456d45
Increase rmem and wmem for remote nodes in testnet (#1635) 2018-10-29 13:04:54 -07:00
Michael Vines 489894cb32
Mention logs more 2018-10-27 08:49:52 -07:00
Pankaj Garg dfde83bdce
Wildcard early OOM deb package revision (#1554) 2018-10-19 14:17:19 -07:00
Pankaj Garg 30c79fd40d
Change validator node machine type (#1537)
- The current nodes are using lower RAM compared to leader/clients
2018-10-17 17:16:50 -07:00
Pankaj Garg 32fc0cd7e9
Fix bug introduced during RUST_LOG escaping (#1507)
* Fix bug introduced during RUST_LOG escaping
- remote node configuration should not be quoted

* shellcheck disable SC2090
2018-10-15 16:49:22 -07:00
Pankaj Garg 9fc30f6db4
Escape RUST_LOG configuration in remote-node.sh (#1489)
* Escape RUST_LOG configuration in remote-node.sh

- If it was set to #, it was causing other parameters to be commented out

* escape other variables as well

* disabled shell check

* Fix shellcheck error
2018-10-13 13:35:54 -07:00
Michael Vines 5c523716aa Ship native programs 2018-10-10 16:49:48 -07:00
Pankaj Garg 0a39722719
Add support to trigger testnet from a PR (#1434)
* Add support for different node counts

* Update variable names

* Delete network even after failures

* Add array for node counts

* Changed number of nodes to a space separated string of numbers

* Adjust number of nodes

* Snap will not be published if the env variable DO_NOT_PUBLISH_SNAP is set

* Address review comments

* Replaced influx db URL
2018-10-05 16:32:05 -07:00
Michael Vines b1e941cab9
Return all instances 2018-10-01 07:51:48 -07:00
Pankaj Garg 7fb7839c8f
Configure GPU type/count from command line in GCE scripts (#1376)
* Configure GPU type/count from command line in GCE scripts

* Change CLI to input full leader machine type information with GPU
2018-09-27 11:55:56 -07:00
sakridge 3199f174a3
Add option to pass boot disk type to gce create (#1308) 2018-09-22 16:43:47 -07:00
Tyera Eulberg f273351789 Add missing port number 2018-09-18 09:36:54 -06:00
Tyera Eulberg 0125163190 Remove wallet.sh, update entrypoint syntax for wallet network argument 2018-09-17 11:53:33 -06:00
Michael Vines 155ee8792f Add GPU support to ec2-provider 2018-09-17 09:26:25 -07:00
Michael Vines f89f121d2b Add AWS EC2 support 2018-09-17 09:26:25 -07:00
Pankaj Garg be7cce1fd2
Tweak GCE scripts for higher node count (#1229)
* Tweak GCE scripts for higher node count

- Some validators were unable to rsync config from leader when
  the node count was high (e.g. 25). Looks like the leader node was
  getting more rsync requests in parallel than it count handle.
- This change staggers the validators bootup, and rsync time

* Address review comments
2018-09-14 17:17:08 -07:00
Michael Vines ee74b367ce Add docker install script 2018-09-12 17:09:37 -07:00
Michael Vines f06113500d bench-tps/net sanity: add ability to check for unexpected extra nodes 2018-09-12 15:38:57 -07:00
Michael Vines af3eb5a16c
.sh 2018-09-11 11:29:49 -07:00