Prometheus promQL查询语言

Prometheus提供了一种名为PromQL (Prometheus查询语言)的函数式查询语言，允许用户实时选择和聚合时间序列数据。表达式的结果既可以显示为图形，也可以在Prometheus的表达式浏览器中作为表格数据查看，或者通过HTTP API由外部系统使用。

准备工作

在进行查询，这里提供下我的配置文件如下

[root@node00 prometheus]# cat prometheus.yml

# my global config

global:

  scrape_interval:     15s # Set the scrape interval to every  seconds. Default is every  minute.

  evaluation_interval: 15s # Evaluate rules every  seconds. The default is every  minute.

  # scrape_timeout is set to the global default (10s).

# Alertmanager configuration

alerting:

  alertmanagers:

  - static_configs:

    - targets:

      # - alertmanager:

# Load rules once and periodically evaluate them according to the global 'evaluation_interval'.

rule_files:

  # - "first_rules.yml"

  # - "second_rules.yml"

# A scrape configuration containing exactly one endpoint to scrape:

# Here it's Prometheus itself.

scrape_configs:

  # The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.

  - job_name: 'prometheus'

    # metrics_path defaults to '/metrics'

    # scheme defaults to 'http'.

    static_configs:

    - targets: ['localhost:9090']

  - job_name: "node"

    file_sd_configs:

    - refresh_interval: 1m

      files:

      - "/usr/local/prometheus/prometheus/conf/node*.yml"

remote_write:

  - url: "http://localhost:8086/api/v1/prom/write?db=prometheus"

remote_read:

  - url: "http://localhost:8086/api/v1/prom/read?db=prometheus"

[root@node00 prometheus]# cat conf/node-dis.yml

- targets:

  - "192.168.100.10:20001"

  labels:

    __datacenter__: dc0

    __hostname__: node00

    __businees_line__: "line_a"

    __region_id__: "cn-beijing"

    __availability_zone__: "a"

- targets:

  - "192.168.100.11:20001"

  labels:

    __datacenter__: dc1

    __hostname__: node01

    __businees_line__: "line_a"

    __region_id__: "cn-beijing"

    __availability_zone__: "a"

- targets:

  - "192.168.100.12:20001"

  labels:

    __datacenter__: dc0

    __hostname__: node02

    __businees_line__: "line_c"

    __region_id__: "cn-beijing"

    __availability_zone__: "b"

简单时序查询

直接查询特定metric_name

# 节点的forks的总次数

node_forks_total
#结果如下

Element	Value
node_forks_total{instance="192.168.100.10:20001",job="node"}	201518
node_forks_total{instance="192.168.100.11:20001",job="node"}	23951
node_forks_total{instance="192.168.100.12:20001",job="node"}	24127

带标签的查询

node_forks_total{instance="192.168.100.10:20001"}
# 结果如下

Element	Value
node_forks_total{instance="192.168.100.10:20001",job="node"}	201816

多标签查询

node_forks_total{instance="192.168.100.10:20001",job="node"}

# 结果如下

Element	Value
node_forks_total{instance="192.168.100.10:20001",job="node"}	201932

查询2分钟的时序数值

node_forks_total{instance="192.168.100.10:20001",job="node"}[2m]

Element	Value
node_forks_total{instance="192.168.100.10:20001",job="node"}	201932 @1569492864.036 201932 @1569492879.036 201932 @1569492894.035 201932 @1569492909.036 201985 @1569492924.036 201989 @1569492939.036 201993 @1569492954.036

正则匹配

node_forks_total{instance=~"192.168.*:20001",job="node"}

Element	Value
node_forks_total{instance="192.168.100.10:20001",job="node"}	202107
node_forks_total{instance="192.168.100.11:20001",job="node"}	24014
node_forks_total{instance="192.168.100.12:20001",job="node"}	24186

常用函数查询

官方提供的函数比较多，具体可以参考地址如下： https://prometheus.io/docs/prometheus/latest/querying/functions/

这里主要就常用函数进行演示。

irate

irate用于计算速率。

# 通过标签查询，特定实例特定job，特定cpu 在idle状态下的cpu次数速率
irate(node_cpu_seconds_total{cpu="",instance="192.168.100.10:20001",job="node",mode="idle"}[1m])

Element	Value
{cpu="0",instance="192.168.100.10:20001",job="node",mode="idle"}	0.9833988932595507

count_over_time

计算特定的时序数据中的个数。

# 这个数值个数和采集频率有关， 我们的采集间隔是15s，在一分钟会有4个点位数据。
count_over_time(node_boot_time_seconds[1m])

Element	Value
{instance="192.168.100.10:20001",job="node"}	4
{instance="192.168.100.11:20001",job="node"}	4
{instance="192.168.100.12:20001",job="node"}	4

子查询

# 过去的10分钟内， 每分钟计算下过去5分钟的一个速率值。 一个采集10m/1m一共10个值。
rate(node_cpu_seconds_total{cpu="",instance="192.168.100.10:20001",job="node",mode="idle"}[5m])[10m:1m]

Element	Value
{cpu="0",instance="192.168.100.10:20001",job="node",mode="idle"}	0.9865228543057867 @1569494040 0.9862807017543735 @1569494100 0.9861087231885309 @1569494160 0.9864946894550303 @1569494220 0.9863192502430038 @1569494280 0.9859649122807017 @1569494340 0.9859298245613708 @1569494400 0.9869122807017177 @1569494460 0.9867368421052672 @1569494520 0.987438596491273 @1569494580

复杂查询

计算内存使用百分比

node_memory_MemFree_bytes / node_memory_MemTotal_bytes  *

Element	Value
{instance="192.168.100.10:20001",job="node"}	9.927579722322251
{instance="192.168.100.11:20001",job="node"}	59.740727403673034
{instance="192.168.100.12:20001",job="node"}	63.2080982675149

获取所有实例的内存使用百分比前2个

topk(,node_memory_MemFree_bytes / node_memory_MemTotal_bytes  *  )

Element	Value
{instance="192.168.100.12:20001",job="node"}	63.20129636298163
{instance="192.168.100.11:20001",job="node"}	59.50586164125955

实用查询样例

获取cpu核心个数

# 计算所有的实例cpu核心数
count by (instance) ( count by (instance,cpu) (node_cpu_seconds_total{mode="system"}) ) 
# 计算单个实例的
count by (instance) ( count by (instance,cpu) (node_cpu_seconds_total{mode="system",instance="192.168.100.11:20001"})

计算内存使用率

( - (node_memory_MemAvailable_bytes{instance=~"192.168.100.10:20001"} / (node_memory_MemTotal_bytes{instance=~"192.168.100.10:20001"})))* 100

Element	Value
{instance="192.168.100.10:20001",job="node"}	87.09358620413717

计算根分区使用率

 - ((node_filesystem_avail_bytes{instance="192.168.100.10:20001",mountpoint="/",fstype=~"ext4|xfs"} * ) / node_filesystem_size_bytes {instance=~"192.168.100.10:20001",mountpoint="/",fstype=~"ext4|xfs"})

Element	Value
{device="/dev/mapper/centos-root",fstype="xfs",instance="192.168.100.10:20001",job="node",mountpoint="/"}	4.175111443575972

预测磁盘空间

 # 整体分为 2个部分， 中间用and分割， 前面部分计算根分区使用率大于85的， 后面计算根据近6小时的数据预测接下来24小时的磁盘可用空间是否小于0 。
 (-  node_filesystem_avail_bytes{fstype=~"ext4|xfs",mountpoint="/"}

  / node_filesystem_size_bytes{fstype=~"ext4|xfs",mountpoint="/"}) *  >=      and (predict_linear(node_filesystem_avail_bytes[6h], * ) < )

prometheus学习系列七： Prometheus promQL查询语言的更多相关文章

Prometheus学习系列（六）之Prometheus 查询说明
前言本文来自Prometheus官网手册和 Prometheus简介 Prothetheus查询 Prometheus提供一个函数式的表达式语言PromQL (Prometheus Query La ...
Prometheus学习系列（九）之Prometheus 存储
前言本文来自Prometheus官网手册和 Prometheus简介存储 Prometheus是一个本地磁盘时间序列数据库,但也可选择与远程存储系统集成,其本地时间序列数据库以自定义格式在磁盘上 ...
Prometheus学习系列（五）之Prometheus 规则（rule）、模板配置说明
前言本文来自Prometheus官网手册1.2.3.4和 Prometheus简介1.2.3.4 记录规则一.配置规则 Prometheus支持两种类型的规则,这些规则可以定期配置,然后定期评估: ...
Prometheus学习系列（一）之Prometheus简介
前言本文来自Prometheus官网手册和 Prometheus简介什么是prometheus? Prometheus是一个最初在SoundCloud上构建的开源系统监视和警报工具包.自2012 ...
prometheus学习系列一： Prometheus简介
Prometheus简介 prometheus受启发于Google的Brogmon监控系统(相似kubernetes是从Brog系统演变而来), 从2012年开始由google工程师Soundclou ...
Prometheus学习系列（九）之Prometheus 联盟、迁移
前言本文来自Prometheus官网手册和 Prometheus简介 FEDERATION 允许Prometheus服务器从另一台Prometheus服务器抓取选定的时间序列. 一,用例联盟有不 ...
Prometheus学习系列（二）之Prometheus FIRST STEPS
前言本文来自Prometheus官网手册和 Prometheus简介说明 Prometheus是一个监控平台,通过在监控目标上的HTTP端点来收集受监控目标的指标.本指南将向您展示如何使用Pro ...
prometheus学习系列十一： Prometheus 安全
prometheus安全我们这里说的安全主要是基本认证和https2种, 目前这2种安全在prometheus中都没有的, 需要借助第三方软件实现, 这里以nginx为例. 基本认证配置基本认证 ...
prometheus学习系列十一： Prometheus pushgateway的使用
由于网络问题或者安全问题,可能我们的数据无法直接暴露出一个entrypoint 给prometheus采集. 这个时候可能就需要一个pushgateway来作为中间者完成中转工作. promethe ...

随机推荐

Input输入框内容限制
该文百度的嘻嘻,原文:Input输入框内容限制输入大小写字母.数字.下划线: <input type="text" onkeyup="this.value=thi ...
Kafka中的HW、LEO、LSO等分别代表什么？
HW . LEO 等概念和上一篇文章所说的 ISR有着紧密的关系,如果不了解 ISR 可以先看下ISR相关的介绍. HW (High Watermark)俗称高水位,它标识了一个特定的消息偏移量(of ...
L1和L2正则化（转载）
[深度学习]L1正则化和L2正则化在机器学习中,我们非常关心模型的预测能力,即模型在新数据上的表现,而不希望过拟合现象的的发生,我们通常使用正则化(regularization)技术来防止过拟合情况 ...
iptables 常用命令解析
查看当前iptables规则: iptables -n -L --line-numbers该命令会以列表的形式显示出当前使用的 iptables 规则,并不做解析,每一条规则前面的编号可以用来做为其它 ...
pacemaker和keepalived的区别
1.pacemaker Pacemaker 是一款开源的高可用资源管理软件,适合大集群或者小集群. Pacemaker 由Novell支持,SLES HAE就是用Pacemaker来管理集群,并且Pa ...
Log4j之HelloWorld
在编写项目的时候,我们一般都会用到日志记录,方便出错查找原因.首先我们需要了解什么是Log4j 1.使用maven建立工程,在pom.xml中加入如下: <dependency> < ...
Java小学四则运算
本次作业要求来自:https://edu.cnblogs.com/campus/gzcc/GZCC-16SE1/homework/2166 github远程仓库的地址:https://github.c ...
asp.net core api 跨域配置
项目前后端分离,前端请求接口例如使用axios发送请求时浏览器会提示跨域错误,需要后端配置允许接口跨域配置步骤: 1.通过NuGet安装Microsoft.AspNetCore.Cors.dll类库 ...
【C/C++开发】C++11 并发指南二(std::thread 详解)
上一篇博客<C++11 并发指南一(C++11 多线程初探)>中只是提到了 std::thread 的基本用法,并给出了一个最简单的例子,本文将稍微详细地介绍 std::thread 的用 ...
POJ-图论-最小生成树模板
POJ-图论-最小生成树模板 Kruskal算法 1.初始时所有结点属于孤立的集合. 2.按照边权递增顺序遍历所有的边,若遍历到的边两个顶点仍分属不同的集合(该边即为连通这两个集合的边中权值最小的那条 ...

prometheus学习系列七： Prometheus promQL查询语言