Questions tagged [nomad]

Nomad is a product created by HashiCorp that is able to manage a cluster of nodes and software.

13 questions
4
votes
1 answer

Hashicorp's Nomad as a HTCondor alternative?

I couldn't find any info on this short of spinning up a Nomad cluster and experimenting so maybe someone here may be able to help. Saying you want to run 100 iterations of a batch java job, each with a different set of parameters and get the…
Frankie
  • 419
  • 1
  • 6
  • 19
3
votes
0 answers

How to get Docker to retry image pull?

Our Nomad agents sometimes fails jobs because they cannot pull images from ECR. /var/log/docker will contain messages like: Not continuing with pull after error: error pulling image configuration: Get …
Bittrance
  • 2,970
  • 2
  • 21
  • 27
2
votes
0 answers

Promoting Nomad task meta tags via consul to Prometheus

We are using Nomad to schedule tasks which are then registered in Consul from where they are picked up in Prometheus scraping. service.hcl: job "myjob" { group "mygroup" { count = "1" task "mytask" { driver = "docker" config { …
DaDaDom
  • 532
  • 6
  • 16
2
votes
1 answer

nomad shows job as dead

I am new to nomad. We are using nomad in production along with docker, consul, vault, ansible. When I run a nomad job from my local machine, Job registers successfully but show status as dead. Roadrunner-2:nomad Shailesh$ nomad run -detach…
Shailesh Sutar
  • 1,427
  • 4
  • 22
  • 40
2
votes
0 answers

GlusterFS volume stalls / blocks after quick access for PHP session files from PHP Docker container

2021-10-05 UPDATED QUESTION AND TEXT AFTER MORE ANALYSIS, STRIPPED DOWN TO MINIMAL CASE Short description A Nomad / Consul cluster is running, with Traefik (with minimal configuration) as a system task on each Nomad client. There are 3 nomad…
Paul
  • 51
  • 5
1
vote
1 answer

Load balancing prefer own subnet / availability zone first

I have 3 front end boxes that are each in their own availability zone. They all 3 run a reverse proxy server (in this case traefik). How can I keep the traffic within the AZ to avoid the inter AZ transfer costs. I'm using consul and nomad to handle…
Botto
  • 231
  • 1
  • 9
1
vote
2 answers

Interpolate values as integers in HLC files

I am writing the template for a parametrized HashiCorp Nomad job. One of its parameters is priority, which is supposed to be an integer between 0 and 100. Like other tools, Nomad supports variable interpolation, so that a variable can be defined at…
1
vote
0 answers

How to configure dynamic routing of gRPC requests with envoy, nomad and consul

We use nomad to deploy our applications - which provide gRPC endpoints - as tasks. The tasks are then registered to Consul, using nomad's service stanza. The routing for our applications is achieved with envoy proxy. We are running central envoy…
DaDaDom
  • 532
  • 6
  • 16
1
vote
1 answer

Why does the Nomad agent shutdown due to a connection refused when it is started?

Running sudo nomad agent -dev results in getsockopt: connection refused: * Get http://127.0.0.1:8500/v1/agent/checks: dial tcp 127.0.0.1:8500: getsockopt: connection refused 2017/02/19 20:31:36.971932 [INFO] client: node registration complete …
030
  • 5,731
  • 12
  • 61
  • 107
1
vote
1 answer

Nomad nodes remove each other services from Consul

I got two nodes Nomad cluster along with a Consul instance so that the jobs can register services to connect to. However, the services keep getting synced and deregistered. Here is what I have from the Consul logs: 2021-01-26T14:49:59.174Z [INFO] …
Spack
  • 1,594
  • 13
  • 22
0
votes
0 answers

How should I load balance large port range for high availablity?

I have a service which publishes individual services on different ports via consul and fabio. I have a pair of these fabio internal load balancers. They both have the complete set of valid open rules. This will be some subset of 1024-49151. …
0
votes
0 answers

High interrupt and context switch count on Nomad / Docker

We are seeing rather high interrupt and context switch counts on our Nomad clients: # vmstat 1 procs -----------memory---------- ---swap-- -----io---- -system-- ------cpu----- r b swpd free buff cache si so bi bo in cs us sy id…
Tarwin
  • 121
  • 4
0
votes
1 answer

Nomad secrets folder empty in Docker container

I'm facing an issue with Nomad's NOMAD_SECRETS_DIR folder allocation for a Docker container. I've got the following template stanza in my job description: template { destination = "/local/pki/certs/my_certificate.crt" perms = "440" data…
Spack
  • 1,594
  • 13
  • 22