10 个你不知道的 PostgreSQL 成果

发布时间：2018-08-18 08:59:26 所属栏目：编程来源：开源中国编译

导读：技能沙龙 | 邀您于8月25日与国美/AWS/转转三位专家配合切磋小措施电商拭魅战英文原文：The Postgres 10 feature you didn't know about: CREATE STATISTICS 假如你曾行使 Postgres 做过一些机能优化，你或者已经行使过 EXPLAIN 。EXPLAIN 向你展示了 Postgre

这些欠好的数据是在什么时辰影响我们的呢？下面来看一个例子。

EXPLAIN ANALYZE SELECT col1,col2,count(*) from tbl group by col1, col2;                    
                                                         QUERY PLAN                                                           
----------------------------------------------------------------------------------------------------------------------------- 
 GroupAggregate  (cost=1990523.20..2091523.04 rows=100000 width=16) (actual time=2697.246..4470.789 rows=1001 loops=1) 
   Group Key: col1, col2 
   ->  Sort  (cost=1990523.20..2015523.16 rows=9999984 width=8) (actual time=2695.498..3440.880 rows=10000000 loops=1) 
         Sort Key: col1, col2 
         Sort Method: external sort  Disk: 176128kB 
         ->  Seq Scan on tbl  (cost=0.00..144247.84 rows=9999984 width=8) (actual time=0.008..665.689 rows=10000000 loops=1) 
 Planning time: 0.072 ms 
 Execution time: 4494.583 ms

聚合行时，Postgres 选择做散列聚合或组合。假如它以为散列表吻合，则选择散列聚合，不然它会选择对全部行举办排序，然后凭证 col1、col2 对它们举办分组。

此刻，planner 预计组的数目（便是 col1、col2 的差异值的数目）将为 100000。它估量到它没有足够的 work_mem 将该散列表存储在内存中。因此，它行使基于磁盘的排序来运行该查询。可是，正如在查询打算中所看到的那样，现实施数仅为 1001。大概，我们有足够的内存来执行哈希聚合。

让 planner 去捕捉 n_distinct 统计信息，从头运行查询并找出功效。

CREATE STATISTICS s2 (ndistinct) on col1, col2 from tbl;                                   
ANALYZE tbl; 
 
EXPLAIN ANALYZE SELECT col1,col2,count(*) from tbl group by col1, col2;                    
                                                      QUERY PLAN                                                        
----------------------------------------------------------------------------------------------------------------------- 
 HashAggregate  (cost=219247.63..219257.63 rows=1000 width=16) (actual time=2431.767..2431.928 rows=1001 loops=1) 
   Group Key: col1, col2 
   ->  Seq Scan on tbl  (cost=0.00..144247.79 rows=9999979 width=8) (actual time=0.008..643.488 rows=10000000 loops=1) 
 Planning time: 0.129 ms 
 Execution time: 2432.010 ms 
(5 rows)

（编辑：湖南网）

【声明】本站内容均来自网络，其相关言论仅代表作者个人观点，不代表本站立场。若无意侵犯到您的权利，请及时与联系站长删除相关内容!

7/9

首页

尾页

MySQL源码安置	运用图形界面从SQL导入
MySQL5.7+keepalived+	windows系统下jsp+mys