转载：cassandra读写性能原理分析

Data & Architecture DBA 2010-04-01 08:54:38 累计浏览 4,790 次

本机暂存

内容概览

这篇讲的是Cassandra数据库在高并发读写场景下，其性能表现背后的底层原理。作者从数据在内存与磁盘间的流动路径出发，深入剖析了Cassandra如何利用LSM-Tree结构来极致化写入吞吐量。

核心思路在于将随机写转化为顺序写：数据先写入内存中的MemTable，满了之后再顺序刷入磁盘，生成不可变的SSTable文件。这带来了极高的写入速度，但也为读取带来了挑战，因为数据可能分散在多个文件中。

文章的亮点在于详细拆解了Cassandra为优化读性能所做的“权衡”与“设计”。例如，它如何通过布隆过滤器快速排除不存在的SSTable，减少不必要的磁盘IO；如何定期执行压缩（Compaction）操作来合并SSTable，既减少文件数量，又清理过期数据。文中对不同压缩策略（如Size-Tiered和Leveled）的适用场景也做了对比，帮助读者理解如何在写放大与读放大之间做出选择。

总的来说，这不仅仅是对配置参数的说明，而是带领读者理解Cassandra在“快速写入”与“高效查询”这两个看似矛盾的目标之间，是如何通过精巧的存储架构设计达成平衡的。

1．关于cassandra的读性能分析的一篇文章：

Mike Perham continues his series now explaining: “reads and […] why they are slow”.

So what happens with a Cassandra read?

a client makes a read request to a random node
the node acts as a proxy determining the nodes having copies of data
the node request the corresponding data from each node
the client can select the strength of the read consistency:
- single read => the request returns once it gets the first response, but data can be stale
- quorum read => the request returns only after the majority responded with the same value

Mark mentions a couple of corner cases related to this behavior that is more complicated.

the node also performs read repair of any inconsistent response
each node reading data uses either Memtable (in-memory) or SSTables (disk)

Mike and Jonathan provide a very detailed explanation of the read performance:

Mike: To scan the SSTable, Cassandra uses a row-level column index and bloom filter to find the necessary blocks on disk, deserializes them and determines the actual data to return. There’s a lot of disk IO here which ultimately makes the read latency higher than a similar DBMS.

Jonathan: The reason uncached reads are slower in Cassandra is not because the SSTable is inherently io-intensive (it’s actually better than b-tree based storage on a 1:1 basis) but because in the average case you’ll have to merge row fragments from 2-4 SSTables to complete the request, since SSTables are not update-in-place.

It is also important to note that Cassandra employs row caching that addresses reads latency.

2．关于cassandra的写性能分析的一篇文章:

An interesting explanation of how Cassandra write ops are happening:

client submits its write request to a single, random Cassandra node
the node behavior is similar to a proxy writing data to the cluster
writes are replicated to N nodes according to the replication placement strategy (the details of RackAwareStrategy are quite interesting)
each of the N nodes performs 2 actions when receiving a write (in the form of RowMutation):
- append the mutation to the commit log for transactional purposes
- update an in-memory Memtable structure with the change

There are also a couple of asynchronous operations:

Memtable is written to disk in a structure called SSTable
SSTables corresponding to a column family are merged into a raw ColumnFamily datafile.

参考文档：

http://nosql.mypopescu.com/post/474623402/cassandra-reads-performance-explained

http://nosql.mypopescu.com/post/454521259/cassandra-write-operation-performance-explained

同分类推荐文章

使用deepseek进行Oracle恢复,引起重大故障（2026-06-22 10:56:00）
接手一个只差临门一脚的数据库恢复（2026-06-18 00:13:09）
我做了一个 AI 版的 StarRocks 升级风险扫描工具，直接帮我定位到一个风险（2026-06-15 01:00:00）

查看更多数据库文章 →

建议继续学习

由浅入深探究mysql索引结构原理、性能分析与优化（累计阅读 16,523）
Linux如何统计进程的CPU利用率（累计阅读 16,308）
如何查找消耗资源较大的SQL （累计阅读 15,211）
hbase介绍（累计阅读 12,367）
Linux Used内存到底哪里去了？（累计阅读 11,867）
HBase技术介绍（累计阅读 8,076）
SQL vs NoSQL：数据库并发写入性能比拼（累计阅读 8,003）
WEB性能测试工具推荐（累计阅读 7,066）
Linux下CPU的利用率（累计阅读 6,653）
MySQL vs NoSQL 效率与成本之争（累计阅读 5,159）