Questions tagged [cassandra]

Apache Cassandra is an open source distributed key-value database cluster, designed to scale linearly when adding nodes. It provides multi-datacenter replication.

It was designed to handle very large amounts of data spread out across many commodity servers while providing a highly available service with no single point of failure. It is a NoSQL solution that was initially developed by Facebook and powered their Inbox Search feature until late 2010.

Official page

Source: wikipedia

209 questions
2
votes
1 answer

IP address resolution in Cassandra MultiDC setup

We have a scenario for which we are considering using apache Cassandra for deployment for our data storage needs.The setup is to be spread across multiple data centers in different regions(physical locations). With each data center having multiple…
1
vote
1 answer

Apache Cassandra repair snapshots are not deleted

On a Cassandra cluster running Apache Cassandra 3.11.4 that is repaired using Cassandra Reaper 1.4.1, I am experiencing the problem that the snapshots that are created by the repair process are sometimes not deleted. This means that over time, more…
1
vote
0 answers

Risks of adding Cassandra 3.11 nodes to a 2.1 cluster

We're running DSE v4.8.5 (which uses Cassandra 2.1.13), and we need to migrate to the community edition of Cassandra v3.11 and then retire the old DSE servers. Someone (it might have been me) suggested the idea of adding the new C* 3.11 nodes to…
Rich
  • 11
  • 1
1
vote
1 answer

Can't authenticate to cassandra: cannot achieve consistency level LOCAL_ONE

Context: We have a Cassandra cluster with 3 nodes deployed as a Stateful Set in Openshift. The three nodes are configured in the same datacenter, same rack. I also made a script to test the Cassandra consistency level errors. It runs as a pod within…
aspyct
  • 340
  • 6
  • 19
1
vote
0 answers

How to use Cassandra with Spark in a Docker image?

(I hope this question is fit for ServerFault, if not, comment and I'll delete it) I'm trying to create a docker image where Cassandra and Spark would be installed and configured to work together. I never used Spark (and never created a Dockerfile),…
HypeWolf
  • 113
  • 5
1
vote
1 answer

I have 2 DCs, RF is 3 in both of them and consistency is set to quorum. What will happen when a read/write request comes and one of the DC is down?

I have 2 DCs, RF is 3 in both of them and consistency is set to quorum. What will happen when a read/write request comes and one of the DC is down ?
1
vote
1 answer

Should I see identical sstables on multiple cassandra nodes?

We're trying to verify the state of replication in our cassandra cluster. My colleague has found that only a small number of sstable files exist on multiple nodes. The others are all unique. To me, this makes sense. As I understand it, each node…
daxlerod
  • 223
  • 1
  • 6
1
vote
1 answer

Cassandra nodes in multiple data centers

http://www.cs.cornell.edu/projects/ladis2009/papers/lakshman-ladis2009.pdf : "Cassandra is configured such that each row is replicated across multiple data centers." So in that case is it good to have almost same number of Cassandra nodes in each…
sunil-khedar
1
vote
1 answer

cassandra port is closed

I am trying to connect to a cassandra installation from a remote machine. Cassandra is running the thrift service on 9160 which is the default port. But I cannot connect the DB and I always get connection timed out exception. Here are the details. …
Thiru
  • 13
  • 1
  • 3
1
vote
1 answer

Cassandra multinode installation,Cannot open file /var/log/cassandra/gc.log due to Permission denied

I want to install Cassandra on 4 remote nodes. Everything works fine until I come to point when I check status nodetool status Datacenter: datacenter1 ======================= Status=Up/Down |/ State=Normal/Leaving/Joining/Moving -- Address Load …
MikiBelavista
  • 301
  • 2
  • 4
  • 12
1
vote
0 answers

How to edit my host file to enable three Cassandra nodes?

This is my /etc/host cat /etc/hosts 127.0.0.1 localhost 127.0.1.1 jholmes-System-Product-Name My nodetool status nodetool --host 127.0.0.1 status Datacenter: datacenter1 ======================= Status=Up/Down |/…
MikiBelavista
  • 301
  • 2
  • 4
  • 12
1
vote
1 answer

What is the side effect of having Cassandra tables with partition sizes of more than 100MB?

I am running Apache Cassandra 3.11.1 and have 6 tables sizes in the failing state. Max partition is larger than 100MB. For these 6 tables the partition sizes are on average between 200MB and upwards of 5GB. These 6 tables are split across 3 key…
Charles Green
  • 55
  • 1
  • 8
1
vote
1 answer

How to do an automated Restore of Cassandra Backup(Snapshot)?

I do have a single node Cassandra EC2 instance (apache-cassandra-3.10), in which I take snapshots (backup) on daily basis and upload it on AWS S3. I also take the backup of complete "schema" and "initial tokens" and keep it over S3.I have managed to…
1
vote
1 answer

Can't access Cassandra remote node

I’m using cassandra on a remote node I can access just with console. Since the node has a private ip, the cluster manager set a proxy to access private_ip:9042 in order to allow me to execute query against the db, so I have a public…
Akinn
  • 135
  • 5
1
vote
0 answers

Apache Cassandra / Presto select query

We have 2 node cluster with RF2. When we select data using Presto and passing all partition key in where condition. Sometime data is returned and sometimes it is not though the cqlsh does return data everytime. a high level summary is like this…