Commit Graph

122 Commits

Author SHA1 Message Date
Michael Vines 33a5d5fe93 Enable debug builds by default for better backtraces 2018-11-17 10:52:08 -08:00
Michael Vines d96a6b42a5 Move drone into its own crate 2018-11-16 20:42:21 -08:00
carllin cf95708c18 Set drone address to always be the initial network entry point (#1847)
* Set drone address to always be the initial network entry point, so that even when leaders rotate the client can still find the drone

* Extract drone address as a separate argument to bench-tps

* Add drone port to client.sh instead of setting it in bench-tps

* Add drone entrypoint to scripts

* Fix build error
2018-11-16 19:56:26 -08:00
Sathish c973de1d76
Decouple log and metrics rate (#1839)
Use separate env for log and metrics rate.

Set default log level to WARN if unset.
2018-11-15 22:27:16 -08:00
Michael Vines 83fc3c10cf Setup CUDA env for local builds 2018-11-15 08:00:52 -08:00
Michael Vines 017c281eaf Remove CUDA support from Snap 2018-11-12 20:31:16 -08:00
Michael Vines c5b1bc1128 Remove obsolete update-default-cuda.sh 2018-11-12 20:31:16 -08:00
Michael Vines 9e7b9487b0 perf-libs now drives setting CUDA_HOME 2018-11-12 18:49:15 -08:00
Michael Vines 851e012c6c Upgrade EC2 image to 18.04 with CUDA 9.2 and 10 2018-11-12 15:17:34 -08:00
Michael Vines 7f76403d0a Clean ~/solana during network start to avoid tripping over leftover files 2018-11-12 15:09:14 -08:00
Michael Vines 7ee4dec3f1 Upgrade GCE GPU image to 18.04 2018-11-12 12:18:50 -08:00
Michael Vines c07d09c011 Add net/scp.sh for easier file transfer to/from network nodes 2018-11-12 11:48:53 -08:00
Michael Vines 3466f139a4 set -e shuffling 2018-11-11 16:24:36 -08:00
Michael Vines def7d156f6 codemod --extensions sh '#!/usr/bin/env bash -e' '#!/usr/bin/env bash\nset -e' 2018-11-11 16:24:36 -08:00
Michael Vines 33aab094ef codemod --extensions sh '#!/bin/bash' '#!/usr/bin/env bash' 2018-11-11 16:24:36 -08:00
Michael Vines cf6f344ccc Add CUDA_HOME env var to permit overriding the CUDA install location 2018-11-11 16:24:18 -08:00
Michael Vines 49014393e1 Be less fancy for bash 4.4 compat 2018-11-10 18:05:55 -08:00
Michael Vines 818d03c835
Bump earlyoom version 2018-11-10 15:56:17 -08:00
Michael Vines b8261d7d83
Determine network version for tar and local deploys 2018-11-08 22:02:42 -08:00
Michael Vines 51ed48941b
Continue if docker0 is not present 2018-11-07 19:33:20 -08:00
Michael Vines 87ac549689
Work around AWS key management limitation 2018-11-07 18:48:27 -08:00
Michael Vines f8f11b7f50
Remove docker0 interface if present 2018-11-07 18:23:24 -08:00
Michael Vines 82f914e0dc
Work around AWS boot check weirdness 2018-11-07 15:46:04 -08:00
Michael Vines 9359cc69d5
Invert gpu check 2018-11-07 14:44:40 -08:00
Michael Vines b02b636b36
Support local tarball deploys 2018-11-07 14:44:40 -08:00
Michael Vines a537154c28
Remove all cuda dependencies from release tarball beyond solana-fullnode-cuda 2018-11-07 14:44:40 -08:00
Michael Vines 16d23292dc
Improve error messages 2018-11-07 10:35:10 -08:00
Michael Vines 2ef8ebe111
AWS AMIs are region specific 2018-11-07 10:05:58 -08:00
Michael Vines f8673931b8
Increase boot timeout 2018-11-07 08:32:15 -08:00
Michael Vines dd4fb7aa90 Add AWS-based nets 2018-11-07 07:47:39 -08:00
Michael Vines c4bc331663 Add support for using a release tar 2018-11-07 07:47:39 -08:00
Michael Vines cd18a1b7db
t 2018-11-06 14:08:47 -08:00
Michael Vines 6aac096c77
Add timeout to prevent a stuck ssh 2018-11-06 14:08:28 -08:00
Michael Vines 7b58bd621a
Remove node check from client start-up
If the network loses a validator or two, it's the job of the sanity
check to detect this not the bench clients
2018-11-06 13:57:06 -08:00
Michael Vines 1a7830f460
Set imageName if G 2018-11-05 13:33:42 -08:00
Michael Vines 8041461a07
Bump EC2 validator machine type 2018-11-05 08:47:51 -08:00
Michael Vines eae9372a5d Upgrade GCP CPU-based testnet to 18.04 2018-11-04 19:18:47 -08:00
Michael Vines f3b04894b9 Try harder to snap download 2018-11-03 00:29:13 +00:00
Pankaj Garg 85869552e0
Update testnet scripts to use release tar ball (#1660)
* Update testnet scripts to use release tar ball

* use curl instead of s3cmd
2018-10-30 18:05:38 -07:00
Pankaj Garg 3cc78d3a41
Added a new remote node configuration script to set rmem/wmem (#1647)
* Added a new remote node configuration script to set rmem/wmem

* Update common.sh for rmem/wmem configuration
2018-10-30 09:17:35 -07:00
Pankaj Garg fbde9bb731
Run bench-tps for longer duration in testnet (#1638)
- Increased to 2+ hours
2018-10-29 15:03:08 -07:00
Pankaj Garg 7abd456d45
Increase rmem and wmem for remote nodes in testnet (#1635) 2018-10-29 13:04:54 -07:00
Michael Vines 489894cb32
Mention logs more 2018-10-27 08:49:52 -07:00
Pankaj Garg dfde83bdce
Wildcard early OOM deb package revision (#1554) 2018-10-19 14:17:19 -07:00
Pankaj Garg 30c79fd40d
Change validator node machine type (#1537)
- The current nodes are using lower RAM compared to leader/clients
2018-10-17 17:16:50 -07:00
Pankaj Garg 32fc0cd7e9
Fix bug introduced during RUST_LOG escaping (#1507)
* Fix bug introduced during RUST_LOG escaping
- remote node configuration should not be quoted

* shellcheck disable SC2090
2018-10-15 16:49:22 -07:00
Pankaj Garg 9fc30f6db4
Escape RUST_LOG configuration in remote-node.sh (#1489)
* Escape RUST_LOG configuration in remote-node.sh

- If it was set to #, it was causing other parameters to be commented out

* escape other variables as well

* disabled shell check

* Fix shellcheck error
2018-10-13 13:35:54 -07:00
Michael Vines 5c523716aa Ship native programs 2018-10-10 16:49:48 -07:00
Pankaj Garg 0a39722719
Add support to trigger testnet from a PR (#1434)
* Add support for different node counts

* Update variable names

* Delete network even after failures

* Add array for node counts

* Changed number of nodes to a space separated string of numbers

* Adjust number of nodes

* Snap will not be published if the env variable DO_NOT_PUBLISH_SNAP is set

* Address review comments

* Replaced influx db URL
2018-10-05 16:32:05 -07:00
Michael Vines b1e941cab9
Return all instances 2018-10-01 07:51:48 -07:00