ELK7日志分析系统基础(二)

2020-11-23 10:40:47 浏览数 (1)

ELK7日志分析系统基础(二)

版本区别

  • ELK6: 默认对外开放访问,需要xpack之类的插件才能开启认证
  • ELK7: 默认开启安全验证功能

基本环境需求

  • centos7
    • 关闭防火墙
    • 关闭selinux
    • 时间同步
    • 磁盘分区
  • yum源
代码语言:javascript复制
curl -o /etc/yum.repos.d/CentOS-Base.repo https://mirrors.aliyun.com/repo/Centos-7.repo
wget -O /etc/yum.repos.d/epel.repo http://mirrors.aliyun.com/repo/epel-7.repo
yum -y install java-1.8.0-openjdk java-1.8.0-openjdk-devel ntpdate
ntpdate time.windows.com
  • java环境安装
代码语言:javascript复制
- 下载地址: https://www.oracle.com/java/technologies/javase/javase-jdk8-downloads.html
- yum安装: yum -y install java-1.8.0-openjdk java-1.8.0-openjdk-devel
- java环境的验证: java -version
  • ELK下载准备
代码语言:javascript复制
https://www.elastic.co/guide/en/elasticsearch/reference/7.6/index.html

ELK7的基础概念

  • Elasticsearch: 搜索数据库服务器,提供restful web接口,简称ES
  • Logstash:数据采集和过滤分析以及字段提取
  • kibana: 主要是页面展示,ES操作简化等

ElasticSearch集群部署与使用实战

ES数据库单节点部署

官方安装文档参考: https://www.elastic.co/guide/en/elasticsearch/reference/7.6/install-elasticsearch.html

  • 基础环境准备好
  • elasticsearch rpm下载与安装
代码语言:javascript复制
[root@centos7-node1 ~]# wget https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-7.6.2-x86_64.rpm
[root@centos7-node1 ~]# yum -y localinstall elasticsearch-7.6.2-x86_64.rpm 
  • jvm配置文件修改(默认要填写本机内存的60%)
代码语言:javascript复制
[root@centos7-node1 ~]# vim /etc/elasticsearch/jvm.options
-Xms200M
-Xmx200M
  • ES单实例配置与启动(特别要注意)
代码语言:javascript复制
[root@centos7-node1 ~]# cp /etc/elasticsearch/elasticsearch.yml /usr/local/src/
[root@centos7-node1 ~]# vim /etc/elasticsearch/elasticsearch.yml
path.data: /var/lib/elasticsearch
path.logs: /var/log/elasticsearch
network.host: 192.168.56.11
http.port: 9200
xpack.security.enabled: true
discovery.type: single-node

[root@centos7-node1 ~]# systemctl restart elasticsearch  # 服务启动
  • 访问(需要密码)
  • 设置密码(试验环境: elastic)
代码语言:javascript复制
[root@centos7-node1 ~]# /usr/share/elasticsearch/bin/elasticsearch-setup-passwords interactive
Enter password for [elastic]:    elastic
Reenter password for [elastic]: 
Enter password for [apm_system]: 
Reenter password for [apm_system]: 
Enter password for [kibana]: 
Reenter password for [kibana]: 
Enter password for [logstash_system]: 
Reenter password for [logstash_system]: 
Enter password for [beats_system]: 
Reenter password for [beats_system]: 
Enter password for [remote_monitoring_user]: 
Reenter password for [remote_monitoring_user]: 
Changed password for user [apm_system]
Changed password for user [kibana]
Changed password for user [logstash_system]
Changed password for user [beats_system]
Changed password for user [remote_monitoring_user]
Changed password for user [elastic]
  • 访问
  • 验证是否启动成功
代码语言:javascript复制
[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200
{
  "name" : "centos7-node1",
  "cluster_name" : "elasticsearch",
  "cluster_uuid" : "WUehKqv3TyudTo_IKMNNlA",
  "version" : {
    "number" : "7.6.2",
    "build_flavor" : "default",
    "build_type" : "rpm",
    "build_hash" : "ef48eb35cf30adf4db14086e8aabd07ef6fb113f",
    "build_date" : "2020-03-26T06:34:37.794943Z",
    "build_snapshot" : false,
    "lucene_version" : "8.4.0",
    "minimum_wire_compatibility_version" : "6.8.0",
    "minimum_index_compatibility_version" : "6.0.0-beta1"
  },
  "tagline" : "You Know, for Search"
}

[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200/_cat/nodes?v    #查看节点信息
ip            heap.percent ram.percent cpu load_1m load_5m load_15m node.role master name
192.168.56.11           56          89   0    0.00    0.01     0.05 dilm      *      centos7-node1

[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200/_cat/indices?v    #列出索引信息
health status index       uuid                   pri rep docs.count docs.deleted store.size pri.store.size
green  open   .security-7 ByCOlBeLSfStcZVe8Ne0-Q   1   0          6            0     19.6kb         19.6kb
  • 写入数据
代码语言:javascript复制
[root@centos7-node1 ~]# curl -u elastic:elastic -X POST http://192.168.56.11:9200/test-index/_doc -H 'Content-Type:application/json' -d '{"name": "test-data1","age": 20}'     #插入数据
{"_index":"test-index","_type":"_doc","_id":"1Ip3xnUB0a1lyJIOfMTO","_version":1,"result":"created","_shards":{"total":2,"successful":1,"failed":0},"_seq_no":0,"_primary_term":1}
  • 查看数据
代码语言:javascript复制
[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200/test-index/_search?q=* | python -m json.tool    #查看数据
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100   284  100   284    0     0   2273      0 --:--:-- --:--:-- --:--:--  2290
{
    "_shards": {
        "failed": 0,
        "skipped": 0,
        "successful": 1,
        "total": 1
    },
    "hits": {
        "hits": [
            {
                "_id": "1Ip3xnUB0a1lyJIOfMTO",
                "_index": "test-index",
                "_score": 1.0,
                "_source": {
                    "age": 20,
                    "name": "test-data1"
                },
                "_type": "_doc"
            }
        ],
        "max_score": 1.0,
        "total": {
            "relation": "eq",
            "value": 1
        }
    },
    "timed_out": false,
    "took": 104
}

ES数据库加密集群部署

ES的分布式集群

索引的分片可以把数据分配到不同节点上

每个分片可设置值0个或者多个副本

副本的功能: 备份,提高查询效率,与集群中任何一个节点的通信结果都是一致的

ES分布式集群的部署
  • 部署节点信息

主机名

ip

软件

centos7-node1

192.168.56.11

java-1.8.0-openjdk java-1.8.0-openjdk-devel elasticsearch7.6

centos7-node2

192.168.56.12

java-1.8.0-openjdk java-1.8.0-openjdk-devel elasticsearch7.6

centos7-node3

192.168.56.13

java-1.8.0-openjdk java-1.8.0-openjdk-devel elasticsearch7.6

  • 软件和基础环境的问题参考第一小节
代码语言:javascript复制
curl -o /etc/yum.repos.d/CentOS-Base.repo https://mirrors.aliyun.com/repo/Centos-7.repo
wget -O /etc/yum.repos.d/epel.repo http://mirrors.aliyun.com/repo/epel-7.repo
yum -y install java-1.8.0-openjdk java-1.8.0-openjdk-devel
wget https://artifacts.elastic.co/downloads/elasticsearch/elasticsearch-7.6.2-x86_64.rpm
yum -y localinstall elasticsearch-7.6.2-x86_64.rpm
  • 置(所有节点)
代码语言:javascript复制
[root@centos7-node1 ~]# vim /etc/elasticsearch/jvm.options
-Xms200M
-Xmx200M
ES加密集群
  1. 创建集群交互证书
代码语言:javascript复制
[root@centos7-node1 ~]# /usr/share/elasticsearch/bin/elasticsearch-certutil ca    #直接回车两次即可
[root@centos7-node1 ~]# /usr/share/elasticsearch/bin/elasticsearch-certutil cert --ca /usr/share/elasticsearch/elastic-stack-ca.p12    #直接回车三次即可
  1. 拷贝证书(拷贝到每台es服务器上)
代码语言:javascript复制
[root@centos7-node1 ~]# cp /usr/share/elasticsearch/elastic-certificates.p12 /etc/elasticsearch/elastic-certificates.p12

# scp 发送证书到其余两个节点上

[root@centos7-node1 ~]# scp /usr/share/elasticsearch/elastic-certificates.p12 192.168.56.12:/etc/elasticsearch/elastic-certificates.p12
[root@centos7-node1 ~]# scp /usr/share/elasticsearch/elastic-certificates.p12 192.168.56.13:/etc/elasticsearch/elastic-certificates.p12
  1. 修改证书权限(三个节点都需要修改)
代码语言:javascript复制
[root@centos7-node1 ~]# chown elasticsearch:elasticsearch /etc/elasticsearch/elastic-certificates.p12

[root@centos7-node1 ~]# ssh 192.168.56.12 "chown elasticsearch:elasticsearch /etc/elasticsearch/elastic-certificates.p12" 

[root@centos7-node1 ~]# ssh 192.168.56.13 "chown elasticsearch:elasticsearch /etc/elasticsearch/elastic-certificates.p12" 
  1. elasticsearch配置

其中node.master用来做数据汇聚

其中node.data是用来做数据存储和查询,压力相对比较大,生产建议将master和data做分离

代码语言:javascript复制
###### centos7-node1配置文件
[root@centos7-node1 ~]# vim /etc/elasticsearch/elasticsearch.yml
cluster.name: cropy
node.name: node1
node.master: true
node.data: true
path.data: /var/lib/elasticsearch
path.logs: /var/log/elasticsearch
network.host: 192.168.56.11
http.port: 9200
discovery.seed_hosts: ["192.168.56.11","192.168.56.12","192.168.56.13"]
cluster.initial_master_nodes: ["192.168.56.11","192.168.56.12","192.168.56.13"]
xpack.security.enabled: true
xpack.monitoring.enabled: true
xpack.security.transport.ssl.enabled: true
xpack.security.transport.ssl.verification_mode: certificate
xpack.security.transport.ssl.keystore.path: /etc/elasticsearch/elastic-certificates.p12
xpack.security.transport.ssl.truststore.path: /etc/elasticsearch/elastic-certificates.p12

###### centos7-node2配置文件
[root@centos7-node2 ~]# vim /etc/elasticsearch/elasticsearch.yml
cluster.name: cropy
node.name: node2
node.master: true
node.data: true
path.data: /var/lib/elasticsearch
path.logs: /var/log/elasticsearch
network.host: 192.168.56.12
http.port: 9200
discovery.seed_hosts: ["192.168.56.11","192.168.56.12","192.168.56.13"]
cluster.initial_master_nodes: ["192.168.56.11","192.168.56.12","192.168.56.13"]
xpack.security.enabled: true
xpack.monitoring.enabled: true
xpack.security.transport.ssl.enabled: true
xpack.security.transport.ssl.verification_mode: certificate
xpack.security.transport.ssl.keystore.path: /etc/elasticsearch/elastic-certificates.p12
xpack.security.transport.ssl.truststore.path: /etc/elasticsearch/elastic-certificates.p12

###### centos7-node3配置文件
[root@centos7-node31 ~]# vim /etc/elasticsearch/elasticsearch.yml
cluster.name: cropy
node.name: node3
node.master: true
node.data: true
path.data: /var/lib/elasticsearch
path.logs: /var/log/elasticsearch
network.host: 192.168.56.13
http.port: 9200
discovery.seed_hosts: ["192.168.56.11","192.168.56.12","192.168.56.13"]
cluster.initial_master_nodes: ["192.168.56.11","192.168.56.12","192.168.56.13"]
xpack.security.enabled: true
xpack.monitoring.enabled: true
xpack.security.transport.ssl.enabled: true
xpack.security.transport.ssl.verification_mode: certificate
xpack.security.transport.ssl.keystore.path: /etc/elasticsearch/elastic-certificates.p12
xpack.security.transport.ssl.truststore.path: /etc/elasticsearch/elastic-certificates.p12
  1. 启动服务
代码语言:javascript复制
[root@centos7-node1 ~]# systemctl enable elasticsearch && systemctl restart elasticsearch
[root@centos7-node1 ~]# tail -f /var/log/elasticsearch/cropy.log    #集群日志查看(最后会有Yello状态)
........
[2020-11-14T21:39:53,107][INFO ][o.e.c.r.a.AllocationService] [node1] Cluster health status changed from [RED] to [YELLOW] (reason: [shards started [[.security-7][0]]]).

[root@centos7-node2 ~]# systemctl enable elasticsearch && systemctl restart elasticsearch
[root@centos7-node3 ~]# systemctl enable elasticsearch && systemctl restart elasticsearch
  1. 状态查询
代码语言:javascript复制
systemctl status elasticsearch
ps -ef | grep elasticsearch
netstat -tanlp | grep 9200
[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200/_cat/nodes?v 
ip            heap.percent ram.percent cpu load_1m load_5m load_15m node.role master name
192.168.56.11           60          92   1    0.00    0.01     0.05 dilm      *      node1
192.168.56.12           45          93   1    0.00    0.01     0.05 dilm      -      node2
192.168.56.13           60          94   1    0.05    0.03     0.05 dilm      -      node3

[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.11:9200/_cat/indices?v 
  1. 重要事项

ES集群开启xpack的时候只要有一个节点设置密码即可,否则不能访问 /usr/share/elasticsearch/bin/elasticsearch-setup-passwords interactive #参考命令

ES集群安全交互抓包

  • 需要安装软件(集群所有节点)
代码语言:javascript复制
yum -y install ngrep tcpdump
  • 测试
代码语言:javascript复制
[root@centos7-node1 ~]# ngrep -d ens33 port 9200
interface: ens33 (192.168.56.0/255.255.255.0)
filter: ( port 9200 ) and ((ip || ip6) || (vlan && (ip || ip6)))
#
T 192.168.56.1:50907 -> 192.168.56.11:9200 [AF] #1
  ......                                                                                                                                      
##
T 192.168.56.1:50907 -> 192.168.56.11:9200 [A] #3
  ......                                    

9300端口测试结果

代码语言:javascript复制
[root@centos7-node1 ~]# ngrep -d ens33 port 9300

ES数据库的基础操作

ES概念
  • 索引: 类似于数据库,索引在写入数据时会自动创建,可按天
  • 文档: 类似于表数据,存储在ES里面的数据
ES的基础操作
  • curl的方式: 相对比较麻烦
代码语言:javascript复制
### 写入指定ID数据(1 是id)
[root@centos7-node1 ~]# curl -u elastic:elastic -X PUT http://192.168.56.11:9200/test-index/_doc/1 -H 'Content-Type: application/json' -d '{"name":"zhangsan","age":30}'
### 查询数据
[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.13:9200/test-index/_search?q=*
[root@centos7-node1 ~]# curl -u elastic:elastic http://192.168.56.13:9200/test-index/_doc/1

### 写入随机ID数据(_doc/ 之后不添加id)
[root@centos7-node1 ~]# curl -u elastic:elastic -X POST http://192.168.56.11:9200/test-index/_doc -H 'Content-Type: application/json' -d '{"name":"lisi","age":33}' 

### 更新数据(id 为1 的数据更新)
[root@centos7-node1 ~]# curl -u elastic:elastic -X POST  http://192.168.56.11:9200/test-index/_update/1  -H 'Content-Type: application/json' -d '{"age":100}'  

### 删除数据
[root@centos7-node1 ~]# curl -u elastic:elastic -X DELETE http://192.168.56.11:9200/test-index/_doc/1     #删除单条数据
[root@centos7-node1 ~]# curl -u elastic:elastic -X DELETE http://192.168.56.11:9200/test-index      #删除所有数据

### 测试集群数据同步
[root@centos7-node1 ~]# curl  -u elastic:elastic -X POST http://192.168.56.12:9200/test-index/_doc -H 'Content-Type: application/json' -d '{"name": "wanghui", "age": 29}'
  • kibana: 提供简化的操作界面

Kibana的部署与使用实战

Kibana用于做数据展示,es的操作简化

Kibana的安装与配置

  • 找其中的一台ES节点做安装
代码语言:javascript复制
[root@centos7-node1 ~]# wget https://artifacts.elastic.co/downloads/kibana/kibana-7.6.2-x86_64.rpm
[root@centos7-node1 ~]# yum -y localinstall kibana-7.6.2-x86_64.rpm 
  • 配置kibana
代码语言:javascript复制
[root@centos7-node1 ~]# cp /etc/kibana/kibana.yml /usr/local/src/
[root@centos7-node1 ~]# vim /etc/kibana/kibana.yml 
server.port: 5601
server.host: "192.168.56.11"
elasticsearch.hosts: ["http://192.168.56.11:9200","http://192.168.56.12:9200","http://192.168.56.13:9200"]
elasticsearch.username: "elastic"
elasticsearch.password: "elastic"
logging.dest: "/tmp/kibaba.log"
  • 启动kibana
代码语言:javascript复制
[root@centos7-node1 ~]# systemctl start kibana && systemctl enable kibana
[root@centos7-node1 ~]# tail -f /tmp/kibaba.log    #启动日志查看
....
{"type":"log","@timestamp":"2020-11-14T15:07:29Z","tags":["listening","info"],"pid":13907,"message":"Server running at http://192.168.56.11:5601"}
  • 访问kibana 用户密码都是elastic (之前es部署的时候添加的)

Kibana简化ES数据库的操作

  • 简单查询
代码语言:javascript复制
GET /
GET /_cat/nodes?v
GET /_cat/indices?v
  • 数据插入
代码语言:javascript复制
# 插入数据
PUT /nginx-logs/_doc/1
{
  "server_name": "www.baidu.com",
  "IP": "101.11.203.12"
}

## 查询数据
GET /nginx-logs/_doc/1
GET /nginx-logs/_search?q=*

# 插入随机id数据
POST /nginx-logs/_doc
{
  "server_name": "tianyancha.com",
  "IP": "180.21.33.41"
}

# 查询所有数据
GET /nginx-logs/_search?q=*
# 更新数据
POST /nginx-logs/_update/1
{
  "doc": {
    "IP": "111.23.33.23"
  }
}

# 修改所有数据
POST /nginx-logs/_update_by_query
{
  "script": {
    "source": "ctx._source['IP']='111.23.133.123'"
  },
  "query": {
    "match_all": {}
  }
}

GET /nginx-logs/_search?q=*

# 增加字段
POST /nginx-logs/_update_by_query
{
  "script": {
    "source": "ctx._source['city']='beijing'"
  },
  "query": {
    "match_all": {}
  } 
}

# 删除数据
DELETE /nginx-logs/_doc/1

# 插入数据
POST /nginx-logs1/_doc
{
  "server_name": "tianyancha.com",
  "IP": "180.21.33.41"
}

POST /nginx-logs2/_doc
{
  "server_name": "tianyancha.com",
  "IP": "180.21.33.41"
}

# 查询索引
GET /_cat/indices?v

# 删除索引
DELETE /nginx-logs*

ElasticSearch模板使用与Python操作

索引的分片及副本的设置

索引的分片以及副本数的设置: 三台ES,最多两个副本,其余的一个要用来存储主数据

代码语言:javascript复制
# 设置分片和副本
PUT /nginx-log
{
  "settings": {
    "number_of_shards": 3,
    "number_of_replicas": 2
  }
}

GET /_cat/indices?v

#获取分片信息
GET /nginx-log/_search_shards
# 插入数据
POST /nginx-logs/_doc
{
  "server_name": "tianyancha.com",
  "IP": "180.21.33.41"
}

# 查询数据分片所在位置(routing也就是数据的ID)
GET /nginx-log/_search_shards?routing=GUmH0XUBiqEQwQWjL5hD

索引创建完成之后分片不可修改,副本数可以修改

代码语言:javascript复制
# 修改副本数量
PUT /nginx-log/_settings
{
"number_of_replicas": 1
}
索引的模板
代码语言:javascript复制
# 获取索引模板
GET /_template

# 简单索引模板创建
PUT _template/test-nginx
{
  "index_patterns": ["nginx*"],
  "settings": {
    "number_of_shards": 2,
    "number_of_replicas": 2
  }
}

# 插入数据
POST /nginx-logs/_doc
{
  "server_name": "www.tianyancha.com",
  "IP": "180.21.33.41"
}

POST /nginx-logs/_doc/1
{
  "server_name": "shanghai.tianyancha.com",
  "IP": "180.21.33.42"
}

POST /nginx-logs/_doc/2
{
  "server_name": "shanghai.tianyancha.com",
  "IP": "180.21.33.42"
}

python操作ES数据库集群

环境准备
  • 安装python36
代码语言:javascript复制
[root@centos7-node2 ~]# yum -y install python36 python36-devel
  • 升级pip3
代码语言:javascript复制
[root@centos7-node2 ~]# pip3 install --upgrade pip -i https://mirrors.aliyun.com/pypi/simple
  • 安装es7.6 的python库
代码语言:javascript复制
[root@centos7-node2 ~]# pip3 install elasticsearch==7.6.0 -i https://mirrors.aliyun.com/pypi/simple 
  • 检查ES模块是否安装成功
代码语言:javascript复制
[root@centos7-node2 ~]# python3
Python 3.6.8 (default, Apr  2 2020, 13:34:55) 
[GCC 4.8.5 20150623 (Red Hat 4.8.5-39)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import elasticsearch
python操作ES集群
  • 添加数据
代码语言:javascript复制
[root@centos7-node2 ~]# vim add_data.py
#!/use/bin/python3

from elasticsearch import Elasticsearch

es = Elasticsearch(['http://elastic:elastic@192.168.56.11:9200','http://elastic:elastic@192.168.56.12:9200','http://elastic:elastic@192.168.56.13
:9200'])

body = {"server_name":"https://baidu.com","IP":"120.99.220.23"}

es.index(index='nginx-logs',body=body)

print("insert data success!!")
  • 查询数据
代码语言:javascript复制
[root@centos7-node2 ~]# cat search_data.py 
#!/use/bin/python3

from elasticsearch import Elasticsearch

es = Elasticsearch(['http://elastic:elastic@192.168.56.11:9200','http://elastic:elastic@192.168.56.12:9200','http://elastic:elastic@192.168.56.13:9200'])

print(es.search(index='nginx-logs'))

[root@centos7-node2 ~]# python3 search_data.py    #查询数据
{'took': 2, 'timed_out': False, '_shards': {'total': 1, 'successful': 1, 'skipped': 0, 'failed': 0}, 'hits': {'total': {'value': 4, 'relation': 'eq'}, 'max_score': 1.0, 'hits': [{'_index': 'nginx-logs', '_type': '_doc', '_id': 'GUmH0XUBiqEQwQWjL5hD', '_score': 1.0, '_source': {'server_name': 'tianyancha.com', 'IP': '180.21.33.41'}}, {'_index': 'nginx-logs', '_type': '_doc', '_id': 'GkmQ0XUBiqEQwQWjDpiD', '_score': 1.0, '_source': {'server_name': 'www.tianyancha.com', 'IP': '180.21.33.41'}}, {'_index': 'nginx-logs', '_type': '_doc', '_id': '1', '_score': 1.0, '_source': {'server_name': 'shanghai.tianyancha.com', 'IP': '180.21.33.42'}}, {'_index': 'nginx-logs', '_type': '_doc', '_id': 'BA-e0XUB4OFcF5NOHHFn', '_score': 1.0, '_source': {'server_name': 'https://baidu.com', 'IP': '120.99.220.23'}}]}}
  • 删除索引
代码语言:javascript复制
[root@centos7-node2 ~]# vim delete_index.py 
#!/use/bin/python3

from elasticsearch import Elasticsearch

es = Elasticsearch(['http://elastic:elastic@192.168.56.11:9200','http://elastic:elastic@192.168.56.12:9200','http://elastic:elastic@192.168.56.13
:9200'])

print(es.indices.delete(index='nginx-logs'))
  • 循环添加数据
代码语言:javascript复制
#!/use/bin/python3

from elasticsearch import Elasticsearch
import time

es = Elasticsearch(['http://elastic:elastic@192.168.56.11:9200','http://elastic:elastic@192.168.56.12:9200','http://elastic:elastic@192.168.56.13
:9200'])

for i in range(1,10000):
    body = {"server_name":"https://{0}.baidu.com".format(i),"IP":"120.99.220.23","count":i}
    es.index('nginx-logs',body=body)
    time.sleep(0.1)
    print("insert {0}".format(i))

LogStash实用技能实战

Logstash的安装和简单使用

官方文档: https://www.elastic.co/guide/en/logstash/7.6/index.html

Logstash的功能
  1. 对日志进行过滤处理
  2. 也能用于日志收集(一般不这么用)
  • 输入支持: 标准输入,文本日志输入等
  • 输出支持: 标准输出,ES输出等
Logstash的安装部署
  • 节点: 192.168.56.14 centos7-node4
  • 基本环境
    • centos7
    • 时间同步
    • yum源
代码语言:javascript复制
curl -o /etc/yum.repos.d/CentOS-Base.repo https://mirrors.aliyun.com/repo/Centos-7.repo
wget -O /etc/yum.repos.d/epel.repo http://mirrors.aliyun.com/repo/epel-7.repo
yum -y install java-1.8.0-openjdk java-1.8.0-openjdk-devel ntpdate
ntpdate time.windows.com
  • 安装logstash

参考: https://www.elastic.co/guide/en/logstash/7.6/installing-logstash.html

代码语言:javascript复制
[root@centos7-node4 ~]# wget https://artifacts.elastic.co/downloads/logstash/logstash-7.6.2.rpm
[root@centos7-node4 ~]# yum localinstall logstash-7.6.2.rpm -y
  • 配置logstash JVM参数
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/jvm.options 
-Xms200M
-Xmx200M
  • 配置logstash
代码语言:javascript复制
[root@centos7-node4 ~]# cp /etc/logstash/logstash-sample.conf /etc/logstash/conf.d/logstash.conf
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   stdin {}
}

output {
   stdout {
       codec=>rubydebug
   }
}
  • logstash的启动和测试
代码语言:javascript复制
[root@centos7-node4 ~]# /usr/share/logstash/bin/logstash -f /etc/logstash/conf.d/logstash.conf
....
INFO ] 2019-11-09 05:17:20.482 [Api Webserver] agent - Successfully started Logstash API endpoint {:port=>9600}
logstash
/usr/share/logstash/vendor/bundle/jruby/2.5.0/gems/awesome_print-1.7.0/lib/awesome_print/formatters/base_formatter.rb:31: warning: constant ::Fixnum is deprecated
{
      "@version" => "1",
       "message" => "logstash",
          "host" => "centos7-node4",
    "@timestamp" => 2019-11-08T21:17:41.189Z
}
hello word
{
      "@version" => "1",
       "message" => "hello word",
          "host" => "centos7-node4",
    "@timestamp" => 2019-11-08T21:17:47.686Z
    ....

Logstash读取日志文件实战

  • 环境准备: 安装ngixn
代码语言:javascript复制
[root@centos7-node4 ~]# yum -y install nginx
[root@centos7-node4 ~]# vim /usr/lib/systemd/system/nginx.service   修改nginx启动文件
[Unit]
Description=The nginx HTTP and reverse proxy server
After=network.target remote-fs.target nss-lookup.target

[Service]
Type=forking
PIDFile=/run/nginx.pid
# Nginx will fail to start if /run/nginx.pid already exists but has the wrong
# SELinux context. This might happen when running `nginx -t` from the cmdline.
# https://bugzilla.redhat.com/show_bug.cgi?id=1268621
ExecStartPre=/usr/bin/rm -f /run/nginx.pid
ExecStartPre=/usr/sbin/nginx -t
ExecStart=/usr/sbin/nginx
ExecReload=/bin/kill -s HUP $MAINPID

[Install]
WantedBy=multi-user.target

[root@centos7-node4 ~]# systemctl start nginx && systemctl enable nginx     #服务启动
[root@centos7-node4 ~]# curl  localhost  
[root@centos7-node4 ~]# tail  /var/log/nginx/access.log 
  • logstash配置
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   file {
      path => "/var/log/nginx/access.log"
   }
}

output {
   stdout {
       codec=>rubydebug
   }
}
  • 测试logstash
代码语言:javascript复制
[root@centos7-node4 ~]# systemctl restart logstash
[root@centos7-node4 ~]# tail -f /var/log/logstash/logstash-plain.log 
[2019-11-09T05:29:03,147][INFO ][logstash.runner          ] Starting Logstash {"logstash.version"=>"7.6.2"}
[2019-11-09T05:29:03,205][INFO ][logstash.agent           ] No persistent UUID file found. Generating new UUID {:uuid=>"93219590-7b15-41b6-b262-c2c4077278f9", :path=>"/var/lib/logstash/uuid"}
[2019-11-09T05:29:05,203][INFO ][org.reflections.Reflections] Reflections took 54 ms to scan 1 urls, producing 20 keys and 40 values 
[2019-11-09T05:29:06,349][WARN ][org.logstash.instrument.metrics.gauge.LazyDelegatingGauge][main] A gauge metric of an unknown type (org.jruby.RubyArray) has been created for key: cluster_uuids. This may result in invalid serialization.  It is recommended to log an issue to the responsible developer/development team.
[2019-11-09T05:29:06,375][INFO ][logstash.javapipeline  

此时logstash不能正常采集日志,因为对nginx日志权限存在问题,所以要改一下nginx日志权限

代码语言:javascript复制
[root@centos7-node4 ~]# chmod 775 -R /var/log/nginx/
  • 查看message日志(一定要有新日志产生)
代码语言:javascript复制
[root@centos7-node4 ~]# tail -f /var/log/messages

Logstash读取日志内容输出到ES

说明:

logstash支持读取日志发送到ES 但是Logstash用来收集日志比较重,后面将对此做优化

操作实战
  1. 清理原先ES集群数据
  1. 配置logstash配置发送日志到ES的配置
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf 
   elasticsearch {
input {
   file {
      path => "/var/log/nginx/access.log"
   }
}

output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
    }
}
  1. 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# systemctl restart logstash
[root@centos7-node4 ~]# ps -ef | grep logstash
  1. 多次访问http://192.168.56.14 (这是我的nginx部署所在)
  2. kibana查看index是否存在
  1. 加入kibana
  1. 数据写入查看

Logstash正则提取Nginx日志

为什么要提取nginx日志?

  • 使用一整行日志无法分析,需要提取单独的字段
  • 分析哪个IP的访问量最大
  • 分析nginx的相应状态码
nginx默认日志的格式与配置
  • 日志格式
代码语言:javascript复制
192.168.56.1 - - [09/Nov/2019:05:24:08  0800] "GET / HTTP/1.1" 200 4833 "-" "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.61 Safari/537.36" "-"
  • 日志格式配置
代码语言:javascript复制
log_format  main  '$remote_addr - $remote_user [$time_local] "$request" '
                  '$status $body_bytes_sent "$http_referer" '
                  '"$http_user_agent" "$http_x_forwarded_for"';
Grok日志提取神器

需要掌握正则表达式,借助kibana的grok工具验证提取

  • 自写正则提取
  • 内置规则提取(提取简化)
代码语言:javascript复制
[root@centos7-node4 ~]# cat /usr/share/logstash/vendor/bundle/jruby/2.5.0/gems/logstash-patterns-core-4.1.2/patterns/grok-patterns 
  1. 普通正则表达式符号

. 表示任意一个字符 * 表示前面一个字符出现0次或者多次 [abc] 表示中括号内的任意一个字符 [^abc] 表示非中括号内的字符 [0-9] 表示数字 [a-z] 表示小写字母 [A-Z] 表示大写字母 [a-zA-Z] 表示所有字母 [a-zA-Z0-9]表示所有字母 数字 [^0-9] 表示非数字 ^xxx 表示以xxx开头 xxx$ 表示以xxx结尾 s 表示空白字符 S 表示非空白字符 d 表示数字

1.1 kibana日志提取操作

  1. 拓展正则表达式,在普通正则表达式的基础上再进行拓展
代码语言:javascript复制
%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{NUMBER:http_version}" %{NUMBER:status} %{NUMBER:body_byte_sent} %{QS} %{QS:http_user_agent}
logstash grok将日志写入ES
  • 配置logstash提取日志配置
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   file {
      path => "/var/log/nginx/access.log"
   }
}

filter {
  grok {
    match => {
      "message" => '%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{NUMBER}" %{NUMBER:status} %{NUMBER:body_bytes_sent} %{QS} %{QS:http_user_agent}'
    }
    remove_field => ["message"]
  }
}

output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
   }
}
  • 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# systemctl restart logstash
[root@centos7-node4 ~]# tail -f /var/log/logstash/logstash-plain.log
[root@centos7-node4 ~]# ps -ef | grep logstash
  • 访问nginx(192.168.56.14)
代码语言:javascript复制
Nginx模拟用户访问
while true;do 
  curl 192.168.56.14/wanghui666
  curl 127.0.0.1
  sleep 2
done
  • 查看kibana
kibana显示感叹号问题的处理

出现感叹号的原因就是重新加入分词,日志字段出现多个的场景

  • kibana索引刷新
  • Kibana索引的操作并不会影响到数据,删除重建也没问题
  • 查看索引

Logstash特殊字段处理与替换

  • 去除字段中的引号
代码语言:javascript复制
mutate {
    gsub => ["http_user_agent",'"',""]
}
  • 数字类型的字符串转换成整型
代码语言:javascript复制
mutate {
    convert => { "status" => "integer" }
    convert => { "body_bytes_sent" => "integer" }
}
  • 完整logstash如下:
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   file {
      path => "/var/log/nginx/access.log"
   }
}

filter {
  grok {
    match => {
      "message" => '%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{
NUMBER}" %{NUMBER:status} %{NUMBER:body_bytes_sent} %{QS} %{QS:http_user_agent}'
    }
    remove_field => ["message"]
  }
  mutate {
    gsub => ["http_user_agent",'"',""]
    convert => { "status" => "integer" }
    convert => { "body_bytes_sent" => "integer" }
  }
}

output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
   }
}
  • 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# kill -1 $(ps -ef | grep logstash | grep -v grep | awk '{print $2}')
  • 访问nginx
  • 查看kibana结果

Logstash替换时间戳timestamp

存在的问题:

处理过程如下:

  1. 删除index
  1. 修改logstash
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   file {
      path => "/var/log/nginx/access.log"
      start_position => "beginning"
      sincedb_path => "/dev/null"
   }
}

filter {
  grok {
    match => {
      "message" => '%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{
NUMBER}" %{NUMBER:status} %{NUMBER:body_bytes_sent} %{QS} %{QS:http_user_agent}'
    }
    remove_field => ["message"]
  }
  mutate {
    gsub => ["http_user_agent",'"',""]
    convert => { "status" => "integer" }
    convert => { "body_bytes_sent" => "integer" }
  }
}

output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
   }
}
  1. 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# kill -1 $(ps -ef | grep logstash | grep -v grep | awk '{print $2}')
[root@centos7-node4 ~]# tail -f /var/log/messages
  1. 问题浮现
使用nginx日志中的访问日期覆盖kibana上的时间
  1. logstash配置如下
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf
input {
   file {
      path => "/var/log/nginx/access.log"
      start_position => "beginning"
      sincedb_path => "/dev/null"
   }
}

filter {
  grok {
    match => {
      "message" => '%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{
NUMBER}" %{NUMBER:status} %{NUMBER:body_bytes_sent} %{QS} %{QS:http_user_agent}'
    }
    remove_field => ["message"]
  }
  date {
    match => ["time_local", "dd/MMM/yyyy:HH:mm:ss Z"]
    target => "@timestamp"
  }
  mutate {
    gsub => ["http_user_agent",'"',""]
    convert => { "status" => "integer" }
    convert => { "body_bytes_sent" => "integer" }
    remove_field => ["time_local"]
  }
}

output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
   }
}
  1. 删除日志索引
  1. 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# kill -1 $(ps -ef | grep logstash | grep -v grep | awk '{print $2}')
[root@centos7-node4 ~]# tail -f /var/log/messages
  1. kibana展示查看
注意

日志里如果有不同的时间格式,覆盖的时候格式要对应 20/Feb/2019:14:50:06 -> dd/MMM/yyyy:HH:mm:ss 2016-08-24 18:05:39,830 -> yyyy-MM-dd HH:mm:ss,SSS

Logstash正则提取与异常处理

  1. 在nginx日志中加入异常字段
代码语言:javascript复制
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
  1. kibana查看到异常的日志
  1. 将异常数据发送到另外的index
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash.conf                                      
input {
   file {
      path => "/var/log/nginx/access.log"
      start_position => "beginning"
      sincedb_path => "/dev/null"
   }
}
      hosts => ["http://192.168.238.90:9200", "http://192.168.238.92:9200"]
      hosts => ["http://192.168.238.90:9200", "http://192.168.238.92:9200"]
output {
   elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user  => "elastic"
      password  => "elastic"
      index  => "nginx-logs-%{ YYYY-MM-dd}"
   }
}

input {
   file {
      path => "/var/log/nginx/access.log"
      start_position => "beginning"
      sincedb_path => "/dev/null"
   }
}

filter {
  grok {
    match => {
      "message" => '%{IP:remote_addr} - (%{WORD:remote_user}|-) [%{HTTPDATE:time_local}] "%{WORD:method} %{NOTSPACE:request} HTTP/%{
NUMBER}" %{NUMBER:status} %{NUMBER:body_bytes_sent} %{QS} %{QS:http_user_agent}'
    }
    remove_field => ["message"]
  }
  date {
    match => ["time_local", "dd/MMM/yyyy:HH:mm:ss Z"]
    target => "@timestamp"
  }
  mutate {
    gsub => ["http_user_agent",'"',""]
    convert => { "status" => "integer" }
    convert => { "body_bytes_sent" => "integer" }
    remove_field => ["time_local"]
  }
}

output {
  if "_grokparsefailure" not in [tags] and "_dateparsefailure" not in [tags] {
    elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user => "elastic"
      password => "elastic"
      index => "nginx-logs-%{ YYYY-MM-dd}"
    }
  }
  else{
    elasticsearch {
      hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
      user => "elastic"
      password => "elastic"
      index => "nginx-err-logs-%{ YYYY-MM-dd}"
    }
  }
}
  1. 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# kill -1 $(ps -ef | grep logstash | grep -v grep | awk '{print $2}')         
[root@centos7-node4 ~]# tail -f /var/log/messages                               
  1. 再次写入异常数据
代码语言:javascript复制
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
[root@centos7-node4 ~]# echo "sdadsadsdsadsfac sadf fwekfwekwfadfaofcr" >> /var/log/nginx/access.log
  1. 新增kibana 数据

Kibana图形使用简介

  1. 模拟数据(nginx 所在机器)
代码语言:javascript复制
[root@centos7-node4 ~]# while true;do      curl 192.168.56.14/cropy666;      curl 127.0.0.1;      sleep 2;    done
  1. 首页区域 可以根据时间查看访问量:每分钟访问量 可以根据某个字段查询 可以单独看某个字段的统计
  2. Kibana图形有建立,选择terms去查看对应的数据 饼图的创建 pie_remote_addr

表的创建 table_remote_addr

  1. Kibana面板的创建cropy_dash 创建面板 在面板上添加图形
  1. 建议采用Grafana展示

Logstash分析Linux系统日志

  1. 系统日志
代码语言:javascript复制
[root@centos7-node4 ~]# cat /var/log/secure
Nov 21 20:47:54 centos7-node4 polkitd[712]: Registered Authentication Agent for unix-process:2314:2063304 (system bus name :1.177 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
Nov 21 20:47:56 centos7-node4 polkitd[712]: Unregistered Authentication Agent for unix-process:2314:2063304 (system bus name :1.177, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
  1. 修改系统日志让支持年份
代码语言:javascript复制
[root@centos7-node4 ~]# vim /etc/rsyslog.conf
# Use default timestamp format
# $ActionFileDefaultTemplate RSYSLOG_TraditionalFileFormat
$template tycformat,"%$NOW% %TIMESTAMP:8:15% %hostname% %syslogtag% %msg%n"
$ActionFileDefaultTemplate tycformat

[root@centos7-node4 ~]# systemctl restart rsyslog
  1. 验证是否生效
代码语言:javascript复制
[root@centos7-node4 ~]# tail -f /var/log/secure
Nov 21 20:47:54 centos7-node4 polkitd[712]: Registered Authentication Agent for unix-process:2314:2063304 (system bus name :1.177 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
Nov 21 20:47:56 centos7-node4 polkitd[712]: Unregistered Authentication Agent for unix-process:2314:2063304 (system bus name :1.177, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
Nov 22 00:05:36 centos7-node4 polkitd[712]: Registered Authentication Agent for unix-process:5177:3249450 (system bus name :1.308 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
2020-11-22 00:05:36 centos7-node4 polkitd[712]: Unregistered Authentication Agent for unix-process:5177:3249450 (system bus name :1.308, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
2020-11-22 00:06:23 centos7-node4 sshd[5191]: Accepted password for root from 192.168.56.1 port 57635 ssh2
2020-11-22 00:06:23 centos7-node4 sshd[5191]: pam_unix(sshd:session): session opened for user root by (uid=0)
  1. kibana grok debug 解决日志格式问题
代码语言:javascript复制
2020-11-22 00:06:23 centos7-node4 sshd[5191]: Accepted password for root from 192.168.56.1 port 57635 ssh2

%{TIMESTAMP_ISO8601:timestamp} %{NOTSPACE} %{NOTSPACE:procinfo}: (?<secinfo>.*)
  1. logstash采集系统日志(message 日志也类似)
代码语言:javascript复制
[root@centos7-node4 ~]# chmod  r /var/log/secure
[root@centos7-node4 ~]# vim /etc/logstash/conf.d/logstash-sys.conf 
input {
  file {
    path => "/var/log/secure"
  }
}
filter {
  grok {
    match => {
      "message" => '%{TIMESTAMP_ISO8601:timestamp} %{NOTSPACE} %{NOTSPACE:procinfo}: (?<secinfo>.*)'
    }
    remove_field => ["message"]
  }
  date {
    match => ["timestamp", "yyyy-MM-dd HH:mm:ss"]
    target => "@timestamp"
  }
  mutate {
    remove_field => ["timestamp"]
  }
}
output {
  elasticsearch {
    hosts => ["http://192.168.56.11:9200","192.168.56.12:9200","192.168.56.13:9200"]
    user => "elastic"
    password => "elastic"
    index => "system-secure-%{ YYYY.MM.dd}"
  }
}
  1. 重启logstash
代码语言:javascript复制
[root@centos7-node4 ~]# systemctl restart logstash    
[root@centos7-node4 ~]# tail -f /var/log/messages
  1. kibana加入index

0 人点赞