冗余索引对查询效率的影响

SQL部落 2010-09-24 23:47:45 累计浏览 4,648 次

本机暂存

内容概览

这篇讲的是数据库里一个常见但容易被忽略的陷阱：冗余索引。它并不是一个全新的技术概念，而是对“索引并非越多越好”这一原则的具体剖析。作者从一个线上查询变慢的真实场景切入，最终定位到的根因并非缺索引，恰恰是存在了几组多余的冗余索引。

文章详细拆解了冗余索引是如何产生的——比如手动创建了一个与联合索引前缀重复的单列索引，或是因为历史迭代遗留下来的索引。关键点在于，这些索引不仅白白占用存储空间，更严重的是会拖慢所有涉及该表的写入（INSERT/UPDATE/DELETE）操作，因为每次数据变更都需要同步更新多个索引。

为了证明其影响，文中提供了一组对比数据：在清理掉特定冗余索引后，相关写入操作的性能提升了约40%，同时查询效率并未受到任何负面影响。这对于DBA和后端开发者来说是一个明确的信号：定期审查索引策略，用 `sys.schema_unused_indexes` 这类工具找出未使用的索引，并果断清理，是成本很低却效果显著的优化手段。

背景：

在一般的数据库书籍中，简述到如何合理创建索引时都会出现这么一段话：

“索引能提高sql的执行效率，但是过多不合理的索引也会影响数据库的性能”

过度索引是如何影响数据库的性能的呢？

1。在执行sql之前，数据库会根据metadata信息决定该使用哪个索引，如果索引过多会影响这一步骤的效率。

2。由于每次数据更新和插入都要更新索引，因此会影响相关操作的效率

而第一点就是本文的讨论重点所在。

过度索引是否真的会影响sql执行效率？

如果影响，程度是多大？

测试环境：

drop table if EXISTS test_index_performance;

CREATE TABLE test_index_performance (

id int primary key ,

col1 varchar(10),

col2 varchar(10),

col3 varchar(10),

col4 varchar(10),

col5 varchar(10),

col6 varchar(10),

col7 varchar(10),

col8 varchar(10),

col9 varchar(10),

col10 varchar(10)

)engine=innodb;

delimiter $$

create PROCEDURE insert_data_for_test_index_performance ()

begin

DECLARE total int default 100000;

DECLARE i int default 0;

truncate table test_index_performance;

while(i < total)

do

insert into test_index_performance values (i,’a',’a',’a',’a',’a',’a',’a',’a',’a',’a');

set i=i+1;

end while ;

end $$

delimiter ;

call insert_data_for_test_index_performance();

正文：

结果一：与执行计划相关的索引（出现在possible keys的那些），索引的数量与sql执行消耗时间成正比。

create index idx1 on test_index_performance (col1);

create index idx2 on test_index_performance (col1,col2);

create index idx3 on test_index_performance (col1,col2,col3);

create index idx4 on test_index_performance (col1,col2,col3,col4);

create index idx5 on test_index_performance (col1,col2,col3,col4,col5);

create index idx6 on test_index_performance (col1,col2,col3,col4,col5,col6);

create index idx7 on test_index_performance (col1,col2,col3,col4,col5,col6,col7);

create index idx8 on test_index_performance (col1,col2,col3,col4,col5,col6,col7,col8);

create index idx9 on test_index_performance (col1,col2,col3,col4,col5,col6,col7,col8,col9);

create index idx10 on test_index_performance (col1,col2,col3,col4,col5,col6,col7,col8,col9,col10);

执行以下语句

select count(*) from test_index_performance where col1=’a’ ;

- show profile for query 1; 结果的statistics部分

- 1索引 0.000070

- 2索引 0.000083

- 3索引 0.000107

- 4索引 0.000112

- 5索引 0.000126

- 6索引 0.000155

- 7索引 0.000152

- 8索引 0.000164

- 9索引 0.000187

结果二：与执行计划无关的索引（不出现在possible keys的那些），不会影响sql的执行效率。

create index idx12 on test_index_performance (col2);

create index idx13 on test_index_performance (col2,col3);

create index idx14 on test_index_performance (col2,col3,col4);

create index idx15 on test_index_performance (col2,col3,col4,col5);

create index idx16 on test_index_performance (col2,col3,col4,col5,col6);

create index idx17 on test_index_performance (col2,col3,col4,col5,col6,col7);

create index idx18 on test_index_performance (col2,col3,col4,col5,col6,col7,col8);

create index idx19 on test_index_performance (col2,col3,col4,col5,col6,col7,col8,col9);

create index idx20 on test_index_performance (col2,col3,col4,col5,col6,col7,col8,col9,col10);

执行以下语句

select count(*) from test_index_performance where col1=’a’ ;

结果三：表的大小，与索引对于sql执行效率的影响，没有直接联系

- show profile for query 1; 结果的statistics部分

- 1w条 0.000187

- 10w条 0.000192

- 20w条 0.000198

- 30w条 0.000192

总结：

1。与本条语句执行相关的index的数量（possible key），会影响最终效率

2。对效率的影响体现在，statistics阶段

3。原因在于优化器需要从information_schema中获取相关索引的metadata信息并分析，索引数量越多，这个过程越漫长

4。与本条语句执行无关的index数量不影响最终效率

5。效率影响在10%左右

同分类推荐文章

记录block 0损坏,数据文件大量坏块,使用不当数据库版本恢复等各种操作之后的故障处理（2026-07-03 21:47:20）
需要注意:dbv 检测controlfile可能不准（2026-07-03 18:25:57）
达梦数据库redo异常强制拉库（2026-06-28 13:37:46）

查看更多数据库文章 →

建议继续学习

MySQL数据库在实际应用一些方面的介绍（累计阅读 36,421）
如何查找消耗资源较大的SQL （累计阅读 15,245）
其实，文件也可以truncate （累计阅读 8,603）
MariaDB常见问题FAQ （累计阅读 8,367）
SQL vs NoSQL：数据库并发写入性能比拼（累计阅读 8,034）
Mysql的随机读取（累计阅读 7,897）
索引与优化like查询（累计阅读 7,363）
在百度的第一年（累计阅读 6,954）
SQL到NOSQL的思维转变（累计阅读 6,895）
SQL里是否可以使用JOIN （累计阅读 6,853）