首页 > 资讯 > 数据库 >MySQL去重该使用distinct还是group by？

351

分享到

MySQL去重该使用distinct还是group by？

mysql 去重distinct group by 2022-05-29 01:05:49 351人浏览安东尼

摘要

前言关于group by 与distinct 性能对比:网上结论如下，不走索引少量数据distinct性能更好，大数据量group by 性能好，走索引group by性能好。走索引时分组种类少distinct快。

前言

关于group by 与distinct 性能对比:网上结论如下，不走索引少量数据distinct性能更好，大数据量group by 性能好，走索引group by性能好。走索引时分组种类少distinct快。关于网上的结论做一次验证。

准备阶段屏蔽查询缓存

查看Mysql中是否设置了查询缓存。为了不影响测试结果，需要关闭查询缓存。


show variables like '%query_cache%';

在这里插入图片描述

查看是否开启查询缓存决定于query_cache_type和query_cache_size。

方法一：关闭查询缓存需要找到my.ini，修改query_cache_type需要修改C:\ProgramData\mysql\MySQL Server 5.7\my.ini配置文件，修改query_cache_type=0或2。
方法二：设置query_cache_size为0，执行以下语句。


set global query_cache_size = 0;

方法三：如果你不想关闭查询缓存，也可以在使用RESET QUERY CACHE。

现在测试环境中query_cache_type=2代表按需进行查询缓存，默认的查询方式是不会进行缓存，如需缓存则需要在查询语句中加上sql_cache。

数据准备

t0表存放10W少量种类少的数据


drop table if exists t0;
create table t0(
id bigint primary key auto_increment,
a varchar(255) not null
) engine=InnoDB default charset=utf8mb4 collate=utf8mb4_bin;
1
2
3
4
5
drop procedure insert_t0_simple_cateGory_data_sp;
delimiter //
create procedure insert_t0_simple_category_data_sp(IN num int)
begin
set @i = 0;
while @i < num do
	insert into t0(a) value(truncate(@i/1000, 0));
 set @i = @i + 1;
end while;
end
//
call insert_t0_simple_category_data_sp(100000);

t1表存放1W少量种类多的数据


drop table if exists t1;
create table t1 like t0;
1
2
drop procedure insert_t1_complex_category_data_sp;
delimiter //
create procedure insert_t1_complex_category_data_sp(IN num int)
begin
set @i = 0;
while @i < num do
	insert into t1(a) value(truncate(@i/10, 0));
 set @i = @i + 1;
end while;
end
//
call insert_t1_complex_category_data_sp(10000);

t2表存放500W大量种类多的数据


drop table if exists t2;
create table t2 like t1;
1
2
drop procedure insert_t2_complex_category_data_sp;
delimiter //
create procedure insert_t2_complex_category_data_sp(IN num int)
begin
set @i = 0;
while @i < num do
	insert into t1(a) value(truncate(@i/10, 0));
 set @i = @i + 1;
end while;
end
//
call insert_t2_complex_category_data_sp(5000000);

测试阶段

验证少量种类少数据

未加索引


set profiling = 1;
select distinct a from t0;
show profiles;
select a from t0 group by a;
show profiles;
alter table t0 add index `a_t0_index`(a);

在这里插入图片描述

由此可见：少量种类少数据下，未加索引，distinct和group by性能相差无几。

加索引


alter table t0 add index `a_t0_index`(a);

执行上述类似查询后

在这里插入图片描述

由此可见：少量种类少数据下，加索引，distinct和group by性能相差无几。

验证少量种类多数据未加索引

执行上述类似未加索引查询后

在这里插入图片描述

由此可见：少量种类多数据下，未加索引，distinct比group by性能略高，差距并不大。

加索引


alter table t1 add index `a_t1_index`(a);

执行类似未加索引查询后

在这里插入图片描述

由此可见：少量种类多数据下，加索引，distinct和group by性能相差无几。

验证大量种类多数据

未加索引


SELECT count(1) FROM t2;

在这里插入图片描述

执行上述类似未加索引查询后

在这里插入图片描述

由此可见：大量种类多数据下，未加索引，distinct比group by性能高。

加索引


alter table t2 add index `a_t2_index`(a);

执行上述类似加索引查询后

在这里插入图片描述

由此可见：大量种类多数据下，加索引，distinct和group by性能相差无几。

总结性能比少量种类少少量种类多大量种类多未加索引相差无几distinct略优distinct更优加索引相差无几相差无几相差无几

去重场景下，未加索引时，更偏向于使用distinct，而加索引时，distinct和group by两者都可以使用。

总结

到此这篇关于MySQL去重该使用distinct还是group by？的文章就介绍到这了,更多相关mysql 去重distinct group by内容请搜索自学编程网以前的文章或继续浏览下面的相关文章希望大家以后多多支持自学编程网！

您可能感兴趣的文档:

点击免费下载>>软考高级考试备考技巧/历年真题/备考精华资料

--结束END--

本文标题: MySQL去重该使用distinct还是group by？

本文链接: https://www.lsjlt.com/news/9585.html(转载时请注明来源链接)

有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341

本篇文章演示代码以及资料文档资料下载

下载Word文档到电脑，方便收藏和打印～

下载Word文档

去做题

回答

如何调试操作系统的错误？
操作系统

2023-11-15发布

回答

操作系统中的I/O系统是如何实现的？
操作系统

2023-11-15发布

回答

如何实现操作系统的内存管理？
操作系统

2023-11-15发布

回答

什么是虚拟内存，它对操作系统有什么影响？
操作系统

2023-11-15发布

回答

ASP中的MVC架构和WebForms架构有什么区别和使用场景？
ASP.NET

2023-11-15发布

回答

ASP中的数据验证和数据校验有什么不同？
ASP.NET

2023-11-15发布

回答

ASP中的ADO对象和DAO对象有什么区别和使用方法？
ASP.NET

2023-11-15发布

回答

Node.js中的包管理器NPM是什么？如何使用它进行依赖管理？
node.js

2023-11-15发布

回答

Vue.js中的动态组件是什么？如何使用它来动态渲染组件？
VUE

2023-11-15发布

回答

如何使用Vue.js实现懒加载和预加载？
VUE

2023-11-15发布

MySQL去重该使用distinct还是group by？

本篇文章演示代码以及资料文档资料下载

MySQL去重该使用distinct还是group by？

Mysql: distinct去重 group by的区别

MySQL去重中distinct和group by的区别浅析

MySQL去重中distinct和group by的区别浅析

MySQL中distinct和group by去重效率区别是什么

Mysql中distinct与group by的去重方面的区别

MySQL中distinct和group by去重效率区别浅析

MySQL中distinct和group by去重效率区别浅析

MySQL中的distinct与group by如何使用

MySQL中的distinct与group by比较使用方法

mysql中使用distinct如何去除重复记录

MySQL中使用去重distinct方法的示例详解

MySQL中使用group by 是总是出现1055的错误

MySQL中使用group by 是总是出现1055的错误(推荐)

我应该在 MySQL 中使用 datetime 还是 timestamp 数据类型？

sql中group by的作用

order by在sql中的用法

sql中删除表的语句是

sql中修改表的命令

sql中using是什么意思

mysql中如何设置两个主键

mysql中replace函数的使用方法

mysql中复合主键怎么设置

怎么在mysql中创建library数据库

sql中的case语句用法