【Oracle】SQL对某字段模糊查询，哪种方案最快？

问题：有一张表hy_test，查找其字段name中包含ufo的记录数，下面哪种方案最快？

A.select count(*) from hy_test where name like '%ufo%'

B.select count(*) from hy_test where instr(name,'ufo')> 0

C.with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test where rowid in (select rowid from temp)

D.with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test a where exists (select rowid from temp where a.rowid=rowid)

A是常规方案，B是网文推荐的方案，C D 是不常见但也有人推荐的方案。

你心中的答案是哪个？

我先给name加上了索引

 create index hy_test_name on hy_test(name);

然后看各自的解释计划：

select count(*) from hy_test where name like '%ufo%'

Plan hash value: 2970624229

--------------------------------------------------------------------------------------

| Id  | Operation             | Name         | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT      |              |     1 |    62 |  3067   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE       |              |     1 |    62 |            |          |

|*  2 |   INDEX FAST FULL SCAN| HY_TEST_NAME |  7426 |   449K|  3067   (1)| 00:00:01 |

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

select count(*) from hy_test where instr(name,'ufo')> 0

Plan hash value: 2970624229

--------------------------------------------------------------------------------------

| Id  | Operation             | Name         | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT      |              |     1 |    62 |  3070   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE       |              |     1 |    62 |            |          |

|*  2 |   INDEX FAST FULL SCAN| HY_TEST_NAME |  7426 |   449K|  3070   (1)| 00:00:01 |

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter(INSTR("NAME",U'ufo')>0)

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test where rowid in (select rowid from temp)

Plan hash value: 2970624229

--------------------------------------------------------------------------------------

| Id  | Operation             | Name         | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT      |              |     1 |    62 |  3067   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE       |              |     1 |    62 |            |          |

|*  2 |   INDEX FAST FULL SCAN| HY_TEST_NAME |  7426 |   449K|  3067   (1)| 00:00:01 |

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("HY_TEST"."NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test a where exists (select rowid from temp where a.rowid=rowid)

Plan hash value: 2970624229

--------------------------------------------------------------------------------------

| Id  | Operation             | Name         | Rows  | Bytes | Cost (%CPU)| Time     |

--------------------------------------------------------------------------------------

|   0 | SELECT STATEMENT      |              |     1 |    62 |  3067   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE       |              |     1 |    62 |            |          |

|*  2 |   INDEX FAST FULL SCAN| HY_TEST_NAME |  7426 |   449K|  3067   (1)| 00:00:01 |

--------------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("A"."NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

至少从解释计划来看，四种方案都是差不多的，Cost 都在3067左右，细究的话。instr方案还稍慢点,到了3070！

drop index hy_test_name

再把索引去掉比较：

无索引

select count(*) from hy_test where name like '%ufo%'

Plan hash value: 1972112514

------------------------------------------------------------------------------

| Id  | Operation          | Name    | Rows  | Bytes | Cost (%CPU)| Time     |

------------------------------------------------------------------------------

|   0 | SELECT STATEMENT   |         |     1 |    62 |  3857   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE    |         |     1 |    62 |            |          |

|*  2 |   TABLE ACCESS FULL| HY_TEST |  7426 |   449K|  3857   (1)| 00:00:01 |

------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

select count(*) from hy_test where instr(name,'ufo')> 0

 Plan hash value: 1972112514

------------------------------------------------------------------------------

| Id  | Operation          | Name    | Rows  | Bytes | Cost (%CPU)| Time     |

------------------------------------------------------------------------------

|   0 | SELECT STATEMENT   |         |     1 |    62 |  3860   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE    |         |     1 |    62 |            |          |

|*  2 |   TABLE ACCESS FULL| HY_TEST |  7426 |   449K|  3860   (1)| 00:00:01 |

------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter(INSTR("NAME",U'ufo')>0)

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

  EXPLAIN PLAN FOR

with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test where rowid in (select rowid from temp)

Plan hash value: 1972112514

------------------------------------------------------------------------------

| Id  | Operation          | Name    | Rows  | Bytes | Cost (%CPU)| Time     |

------------------------------------------------------------------------------

|   0 | SELECT STATEMENT   |         |     1 |    62 |  3857   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE    |         |     1 |    62 |            |          |

|*  2 |   TABLE ACCESS FULL| HY_TEST |  7426 |   449K|  3857   (1)| 00:00:01 |

------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("HY_TEST"."NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

  EXPLAIN PLAN FOR

with temp as (select rowid from hy_test t where t.name like '%ufo%')

select count(*) from hy_test a where exists (select rowid from temp where a.rowid=rowid)

Plan hash value: 1972112514

------------------------------------------------------------------------------

| Id  | Operation          | Name    | Rows  | Bytes | Cost (%CPU)| Time     |

------------------------------------------------------------------------------

|   0 | SELECT STATEMENT   |         |     1 |    62 |  3857   (1)| 00:00:01 |

|   1 |  SORT AGGREGATE    |         |     1 |    62 |            |          |

|*  2 |   TABLE ACCESS FULL| HY_TEST |  7426 |   449K|  3857   (1)| 00:00:01 |

------------------------------------------------------------------------------

Predicate Information (identified by operation id):

---------------------------------------------------

   2 - filter("A"."NAME" LIKE U'%ufo%')

Note

-----

   - dynamic statistics used: dynamic sampling (level=2)

现在看，instr仍然是最慢的，cost为3860，其它三个都在3857！

结论：大体上四种方案都没什么差别，细究下，网红方案instr反而是最慢的，看上去最low的%ufo%并不慢。

对此，你能同意吗？面试时遇到这题你会怎么写？

如果你想重复我的实验，可以用以下SQL创建表和数据：

CREATE TABLE hy_test

(

    id NUMBER not null primary key,

    name NVARCHAR2(60) not null,

    score NUMBER(4,0) NOT NULL,

    createtime TIMESTAMP (6) not null

)

 Insert into hy_test

 select rownum,dbms_random.string('*',dbms_random.value(6,20)),dbms_random.value(0,20),sysdate from dual

 connect by level<=2000000

 order by dbms_random.random

然后用一下Java程序改写数据：

package recordchanger;

import java.sql.Connection;

import java.sql.DriverManager;

import java.sql.ResultSet;

import java.sql.SQLException;

import java.sql.Statement;

import java.text.DecimalFormat;

import java.util.Random;

import java.util.Set;

import java.util.TreeSet;

public class RecordChanger {

    public boolean changeOnePencent(String table) {

        Connection conn = null;

        Statement stmt = null;

        try{

            Class.forName(DBParam.Driver).newInstance();

            conn = DriverManager.getConnection(DBParam.DbUrl, DBParam.User, DBParam.Pswd);

            stmt = conn.createStatement();

            long startMs = System.currentTimeMillis();

            int totalCount=fetchExistCount(table,stmt);

            System.out.println("There are "+toEastNumFormat(totalCount)+" records in the table:'"+table+"'.");

            int changeCount=totalCount/100;

            System.out.println("There are "+toEastNumFormat(changeCount)+" records should be changed.");

            Set<Integer> idSet=fetchIdSet(totalCount,changeCount,table,stmt);

            System.out.println("There are "+toEastNumFormat(idSet.size())+" records in idSet.");

            int changed=updateRecords(idSet,table,stmt);

            System.out.println("There are "+toEastNumFormat(changed)+" records have been changed.");

            long endMs = System.currentTimeMillis();

            System.out.println("It takes "+ms2DHMS(startMs,endMs)+" to update 1% records of table:'"+table+"'.");

        } catch (Exception e) {

            e.printStackTrace();

        } finally {

            try {

                stmt.close();

                conn.close();

            } catch (SQLException e) {

                System.out.print("Can't close stmt/conn because of " + e.getMessage());

            }

        }

        return false;

    }

    private int updateRecords(Set<Integer> idSet,String tableName,Statement stmt)  throws SQLException{

        int updated=0;

        for(int id:idSet) {

            String sql="update "+tableName+" set name='"+getRNDName()+"' where id='"+id+"' ";

            updated+= stmt.executeUpdate(sql);

        }

        return updated;

    }

    private String getRNDName() {

        String[] arr= {"Andy","Bill","Cindy","ufo","sufo","ufoa","ufot","AufoT","BufoT","1ufoufo","钱八","岳飞","关羽","刘备","曹操","张辽","虚竹","王语嫣"};

        int index=getRandom(0,arr.length);

        return arr[index];

    }

    // fetch a set of id which should be changed

    private Set<Integer> fetchIdSet(int totalCount,int changeCount,String tableName,Statement stmt)  throws SQLException{

        Set<Integer> idSet=new TreeSet<Integer>();

        while(idSet.size()<changeCount) {

            int id=getRandom(0,totalCount);

            if(idSet.contains(id)==false && isIdExist(id,tableName,stmt)) {

                idSet.add(id);

            }

        }

        return idSet;

    }

    private boolean isIdExist(int id,String tableName,Statement stmt)  throws SQLException{

        String sql="select count(*) as cnt from "+tableName+" where id='"+id+"' ";

        ResultSet rs = stmt.executeQuery(sql);

        while (rs.next()) {

            int cnt = rs.getInt("cnt");

            return cnt==1;

        }

        rs.close();

        return false;

    }

    // get a random num between min and max

    private static int getRandom(int min, int max){

        Random random = new Random();

        int s = random.nextInt(max) % (max - min + 1) + min;

        return s;

    }

    // fetch exist record count of a table

    private int fetchExistCount(String tableName,Statement stmt)  throws SQLException{

        String sql="select count(*) as cnt from "+tableName+"";

        ResultSet rs = stmt.executeQuery(sql);

        while (rs.next()) {

            int cnt = rs.getInt("cnt");

            return cnt;

        }

        rs.close();

        return 0;

    }

    // 将整数在万分位以逗号分隔表示

    public static String toEastNumFormat(long number) {

        DecimalFormat df = new DecimalFormat("#,####");

        return df.format(number);

    }

    // change seconds to DayHourMinuteSecond format

    private static String ms2DHMS(long startMs, long endMs) {

        String retval = null;

        long secondCount = (endMs - startMs) / 1000;

        String ms = (endMs - startMs) % 1000 + "ms";

        long days = secondCount / (60 * 60 * 24);

        long hours = (secondCount % (60 * 60 * 24)) / (60 * 60);

        long minutes = (secondCount % (60 * 60)) / 60;

        long seconds = secondCount % 60;

        if (days > 0) {

            retval = days + "d" + hours + "h" + minutes + "m" + seconds + "s";

        } else if (hours > 0) {

            retval = hours + "h" + minutes + "m" + seconds + "s";

        } else if (minutes > 0) {

            retval = minutes + "m" + seconds + "s";

        } else {

            retval = seconds + "s";

        }

        return retval + ms;

    }

    public static void main(String[] args) {

        RecordChanger rc=new RecordChanger();

        rc.changeOnePencent("hy_test");

    }

    protected class DBParam {

        public final static String Driver = "oracle.jdbc.driver.OracleDriver";

        public final static String DbUrl = "jdbc:oracle:thin:@dev-dm-ufo.dev.un.local:2050/ufo";

        public final static String User = "ufo";

        public final static String Pswd = "test01";

    }

}

--END-- 2020-01-06 16:43

【Oracle】SQL对某字段模糊查询，哪种方案最快？的更多相关文章

Mybatis mysql 一个搜索框多个字段模糊查询几种方法
第一种 or 根据搜索框给定的关键词,模糊搜索用户名和账号都匹配的用户集合 <select id="list" parameterType="com.user.Us ...
sql整型字段模糊查询
select count(*) cnt from vhuiy where CAST(id as text) like'%12%'--id为int类型更详细的链接:http://www.studyof ...
MySQL单表多字段模糊查询
今天工作时遇到一个功能问题:就是输入关键字搜索的字段不只一个字段,比如我输入: 超天才 ,需要检索出包含这个关键字的 name . company.job等多个字段.在网上查询了一会就找到了答案. ...
MySQL简单实现多字段模糊查询
我所做的商城项目前些时提了新需求,要求前台搜索商品除了能通过商品名称搜索到以外,还可以通过别个信息搜索,比如:商品编号.详情内容描述等等,类似于全文搜索了.我首先想到的就是lucene,但是对代码这样 ...
MySQL单表多字段模糊查询解决方法又折磨半天concat(字段不能为空，如为空则用IFNULL(字段，'');
SELECT `id`,`weixin_id`,`user_name`,`sex`,`area_id`,`address_near`,`phone`,`create_time`,`import_use ...
Mysql 之实现多字段模糊查询
在一个table中有省,市,县,期,栋,单元,室几个字段,然后用户输入一个地址从表中的字段拼接起来进行模糊查询. 解决办法: <MySQL权威指南>中CONCAT的使用方法,在书中的对CO ...
Mysql多字段模糊查询
MySQL同一字段多值模糊查询一. 同一字段多值模糊查询,使用多个or进行链接,效率不高,但没有更好的解决方案.(有看到CHARINDEX 关键字,可查询结果并不是模糊,举个栗子例如SELECT ...
通过带参数的Sql语句来实现模糊查询（多条件查询）
#region 通过带参数的Sql语句来实现模糊查询(多条件查询) StringBuilder sb = new StringBuilder("select * from books&quo ...
在SQL Server中用好模糊查询指令LIKE
简介:like在sql中的使用在SQL Server中用好模糊查询指令LIKE 查询是SQL Server中重要的功能,而在查询中将Like用上,可以搜索到一些意想不到的结果和效果,like的神奇之 ...

随机推荐

C#设计模式之10-外观模式
外观模式(Facade Pattern) 该文章的最新版本已迁移至个人博客[比特飞],单击链接 https://www.byteflying.com/archives/407 访问. 外观模式属于结构 ...
Oracle数据库启动及状态等查询
一.监听 1)启动监听: lsnrctl start 2)查看监听状态: lsnrctl status 3)停止监听: lsnrctl stop 4)检查是否可进行网络连接: tnsping ${si ...
【算法•日更•第二十三期】数据结构：two-pointer（尺取法）&莫队
▎引入 ☞『例题』一道十分easy的题: 洛谷P1638 长度为n的序列,m种数找一个最短区间,使得所有数出现一遍 n≤1e6 ,m≤2e3. ☞『分析』这道题非常的简单,但是如果不会two-p ...
Python 批量保存word
from docx import Document path = r'D:\pywork\12' # word信息表所在文件夹 w = Document(path + '/' + 'word模板表.d ...
int ,long , long long , __int64类型的范围
首先见测试代码(在g++/gcc下运行): #include<iostream> using namespace std; int main() { cout<<sizeof( ...
【翻译】Promises/A+规范
目录介绍译文 1. 术语(Terminology) 2. 要求(Requirements) 2.1 Promise状态 2.2 then方法 2.3 Promise解析程序 3. 注释 3.1 p ...
js 常用业务工具方法（es5，es6）持续更新
数组去重数组去重最原始的方法就是使用双层循环. es5: // 使用indexOf function unique(array) { var res = []; for (var i = 0, le ...
使用vim的妙招
使用F1执行文件 Vim是一个类似于Vi的著名的功能强大.高度可定制的文本编辑器. 我们Linux运维经常在Linux中使用到Vim编辑器,当使用Vim写shell脚本或者python脚本的时候,想要 ...
Chrome扩展应用Postman地址（直接搜是搜不到的）
https://chrome.google.com/webstore/detail/postman/fhbjgbiflinjbdggehcddcbncdddomop
国际象棋棋盘输出-PHP代码
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8&quo ...

【Oracle】SQL对某字段模糊查询，哪种方案最快？

【Oracle】SQL对某字段模糊查询，哪种方案最快？的更多相关文章

随机推荐

热门专题