Questions tagged [nvidia]

An American global technology company based in Santa Clara, California, best known for its graphics processors (GPUs).

62 questions
2
votes
1 answer

Pod is stuck in PodInitializing status when an initContainer is OOMKilled

I have the following on-prem Kubernetes environment: OS: Red Hat Enterprise Linux release 8.6 (Ootpa) Kubernetes: 1.23.7 (single-node, build with kubeadm) NVIDIA driver: 515.65.01 nvidia-container-toolkit: 1.10.0-1.x86_64 (rpm) containerd:…
Daigo
  • 278
  • 1
  • 17
2
votes
2 answers

Alternative to nvidia-settings GpuPowerMizerMode in Ubuntu?

We have a Ubuntu 20.04 server with Nvidia GPUs and want to change the Power Mode / GpuPowerMizerMode to Prefer Maximum Performance. One way to do this is nvidia-settings -a "[gpu:0]/GpuPowerMizerMode=1" However, that would require an X session with…
1
vote
1 answer

Virtualisation primary GPU

The Server is running on Proxmox VE. My goal is to use any GPU in a VM. So I blacklisted nvidia noveau radeon amdgpu to ensure all GPUs are correctly accessible to assign the VFIO driver. I've added all the ids from lspci -vnn to…
1
vote
0 answers

nvidia-smi must be run by root before it can be used by regular users

On a newly built Ubuntu 16.04 machine, running nvidia-smi fails as a regular user $ nvidia-smi NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running. Running…
hanxue
  • 1,367
  • 2
  • 11
  • 12
1
vote
0 answers

hypervisor.cpuid.v0 or hidden state='on' equivlent in hyper-v

I'd like to hide to a vm, that it's being virtualized on hyper-v. I've done: ExposeVirtualizationExtensions : True But it doesn't seem to have the same effect. The goal is to pass a nvidia geforce 1080 to a linux vm for the purpose of tensorflow.…
1
vote
1 answer

Unable to use gpu in azure windows server 2016

I am trying to run a GPU intensive application(Lumion) on the Azure cloud. Image used - Windows Server 2016. Hardware - NV6_Promo with 1xK80 GPU. Any application, when launched, run without using the GPU. The system has the latest Nvidia Tesla…
Nithin Jose
  • 149
  • 7
1
vote
1 answer

GLX is compiled with wrong version (Display resolution and hardware acceleration stopped working)

I have two Ubuntu 14 Desktops (identical). Both were working fine until Friday. Some updates appear to have been performed on the non-working machine... 2019-03-18 02:29:32 install linux-base:all 4.5ubuntu1~14.04.1 2019-03-18 02:29:32…
BurningKrome
  • 525
  • 2
  • 8
  • 21
1
vote
1 answer

ESXi Tesla passthrough enabled but not assignable

I am facing an issue with an ESXi ( 6.7.0 Update 1 ) and the passthrough of a GPU card (NVIDIA Tesla P4). The GPU card is listed in the "Passthrough capable" PCI Devices section as "Enabled / Needs reboot" but no matter the number of time the ESXi…
JohnLoopM
  • 151
  • 6
1
vote
0 answers

Simultaneous usage of Nvidia and AMD GPUs

I have a server which hosts three different GPU platforms: Onboard GPU, Nvidia and AMD GPUs. I have not installed X server, as I do not intend to bring the desktop up. I always use ssh and use the GPUs for computations only. Now, I have been able to…
Arya Mz
  • 111
  • 1
1
vote
1 answer

Google Cloud - Monitor running on Microsoft Display Driver instead of NVIDIA K80 GPU

My Google Cloud Instance is running on Microsoft Display Driver,instead of the GPU.I tried to install Hyper-V,but Google Cloud Processors don't support it.Please help,i need to run Unity,but can't without the GPU.I'm running 6-core Intel Xeon CPU(i…
1
vote
1 answer

apt-get bricked by nvidia drivers

I was updating my machine while some drivers crashed. After the reboot my X server was broken and I have reinstalled it. Now apt-get is stuck with this error: ╭─phra at kali in /home/phra ╰─λ sudo apt-get install -f …
phra
  • 41
  • 8
1
vote
1 answer

Nvidia Pascal architecture: DMA Size / maximum amount of host system RAM?

We are planning to build a pair of multi-GPU Linux servers for machine learning and data science tasks. Per our requirements, we need to put a lot of RAM in these machines; we're planning on 24x 64GiB LRDIMMs for a total of 1.5TiB. For GPUs, we were…
mvoelske
  • 111
  • 3
1
vote
2 answers

Ganglia's GPU Nvidia module: do we need to patch the ganglia-webfrontend?

I am trying to add the GPU Nvidia module in ganglia (/ganglia/gmond_python_modules/gpu/nvidia/). Do we need to apply the ganglia_web.patch patch? If I do not apply the patch, I don't see any GPU metrics when I go to http://localhost/ganglia/ If I…
Franck Dernoncourt
  • 940
  • 1
  • 12
  • 28
1
vote
0 answers

nvidia driver displaying odd bios,uuid under Grid K2

I have a number of servers that have GRID K2 nvidia Tesla cards in. Initially these were working fine. But I recently upgraded the kernel driver and have found a problem where CUDA based apps were no longer detecting GPU's being present. On closer…
hookenz
  • 14,132
  • 22
  • 86
  • 142
1
vote
0 answers

Linux: cuda (pytorch) does not allocate available vram

I am trying out pixray/clipit but cuda fails to allocate the remaining 1GiB of my graphics card. My graphics card is "Nvidia GTX 1660 super" which has the same amount of RAM as the "Nvidia GTX 1660 Ti" which belongs to somebody I know - and it…
france1
  • 23
  • 9