Databricks: install PYODBC failling on cluster and on random occasions when installing using a notebook

0

I am using pyodbc in my codes and tried installing pyodbc using PyPI on the actual cluster, however it failed. (DBX cluster runtime = 5.1ML Demo, Python version 3(i.e. 3.5)).

To solve this issue I've run this code in the notebook to install pyodbc:

“%sh curl https://packages.microsoft.com/keys/microsoft.asc | apt-key add - curl https://packages.microsoft.com/config/ubuntu/16.04/prod.list > /etc/apt/sources.list.d/mssql-release.list apt-get update ACCEPT_EULA=Y apt-get install msodbcsql17 apt-get -y install unixodbc-dev sudo apt-get install python3-pip -y pip3 install --upgrade pyodbc

However, it fails on random occasions too and gives an error:

"#include ^~~~~~~ compilation terminated. error: command 'gcc' failed with exit status 1"

which is solved by restarting the cluster.

Question, how can I install the pyodbc library on the cluster? It works fine for DBX cluster runtime 5.4 (?).

Camilla

Posted 2019-08-08T11:53:44.863

Reputation: 1

No answers