mirror of https://github.com/kubeshark/kubeshark.git synced 2025-09-21 10:06:58 +00:00

Files

Nimrod Gilboa Markevich ab38f4c011 Add profiling tools (#1087 )

* Add gin-contrib/pprof dependency

* Run pprof server on agent with --profiler flag

* Add --profiler flag to cli

* Fix error message

* Print cpu usage percentage

* measure cpu of current pid instead of globaly on the system

* Add scripts to plot performance

* Plot packetsCount in analysis

* Concat to DataFrame

* Plot in turbo colorscheme

* Make COLORMAP const

* Fix rss units

* Reduce code repetition by adding function for plotting

* Allow grouping based on filenames

* Temporary: Marked with comments where to disable code for experiments

* Add newline at end of file

* Add tap.cpuprofile flag. Change memprofile flag to tap.memprofile

* create tapper modes for debugging using env vars

* Fix rss plot units (MB instead of bytes)

* Remove comment

* Add info to plot script

* Remove tap.cpumemprofile. Rename tap.memprofile to memprofile

* Remove unused import

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Remove whitespaces

Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

* Rename debug env vars

* Create package for debug env vars, read each env var once

* Run go mod tidy

* Increment MatchedPairs before emitting

* Only count cores once

* Count virtual and physical cores

* Add dbgctl replace in cli

* Fix lint: Check return values

* Add tap/dbgctl to test-lint make rule

* Replace tap/dbgctl in all modules

* #run_acceptance_tests

* Copy dbgctl module to docker image

* Debug/profile tapper benchmark (#1093)

* add mizu debug env to avoid all extensions

* add readme + run_tapper_benchmark.sh

* temporary change branch name

* fix readme

* fix MIZU_BENCHMARK_CLIENTS_COUNT env

* change tap target to tcp stream

* track live tcp streams

* pr fixes

* rename tapperPacketsCount to ignored_packets_count

* change mizu tapper to mizu debugg

Co-authored-by: David Levanon <dvdlevanon@gmail.com>
Co-authored-by: M. Mert Yıldıran <mehmet@up9.com>

2022-05-18 15:42:13 +03:00

example-graph.png

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

plot_from_tapper_logs.py

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

README.md

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

requirements.txt

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

run_tapper_benchmark.sh

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

tapper-modes.png

Add profiling tools (#1087 )

2022-05-18 15:42:13 +03:00

README.md

Performance analysis

This directory contains tools for analyzing tapper performance.

Periodic tapper logs

In tapper logs there are some periodic lines that shows its internal state and consumed resources.

Internal state example (formatted and commented):

stats - {
	"processedBytes":468940592, // how many bytes we read from pcap
	"packetsCount":174883, // how many packets we read from pcap
	"tcpPacketsCount":174883, // how many tcp packets we read from pcap
	"reassembledTcpPayloadsCount":66893, // how many chunks sent to tcp stream
	"matchedPairs":24821, // how many request response pairs found
	"droppedTcpStreams":2 // how many tcp streams remained stale and dropped
}

Consumed resources example (formatted and commented):

mem: 24441240, // golang heap size
goroutines: 29, // how many goroutines
cpu: 91.208791, // how much cpu the tapper process consume (in percentage per core)
cores: 16,  // how many cores there are on the machine
rss: 87052288 // how many bytes held by the tapper process

Plot tapper logs

In order to plot a tapper log or many logs into a graph, use the plot_from_tapper_logs.py util.

It gets a list of tapper logs as a parameter, and output an image with a nice graph.

The log file names should be named in this format XX_DESCRIPTION.log when XX is the number between determining the color of the output graph and description is the name of the series. It allows for easy comparison between various modes.

Example run:

cd $MIZU_HOME/performance_analysis
virtualenv venv
source venv/bin/activate
pip install -r requirements.txt
python plot_from_tapper_logs.py 00_tapper.log

Tapper Modes

Every packet seen by the tapper is processed in a pipeline that contains various stages.

Pcap - Read the packet from libpcap
Assembler - Assemble the packet into a TcpStream
TcpStream - Hold stream information and TcpReaders
Dissectors - Read from TcpReader and recognize the packet content and protocol.
Emit - Marshal the request response pair into a Json
Send - Send the Json to Api Server

Tapper can be run with various debug modes:

No Pcap - Start the tapper process, but don't read from any packets from pcap
No Assembler - Read packets from pcap, but don't assemble them
No TcpStream - Assemble the packets, but don't create TcpStream for them
No Dissectors - Create a TcpStream for the packets, but don't dissect their content
No Emit - Dissect the TcpStream, but don't emit the matched request response pair
No Send - Emit the request response pair, but don't send them to the Api Server.
Regular mode

Run benchmark with various tapper modes

Prerequisite

In order to run the benchmark you probably want:

An up and running Api Server
An up and running Basenine
An up and running UI (optional)
An up and running test server, like nginx, that can return a known payload at a known endpoint.
Set MIZU_HOME environment variable to points to mizu directory
Install the hey tool

Running the benchmark

In order to run a benchmark use the run_tapper_benchmark.sh script.

Example run:

cd $MIZU_HOME/performance_analysis
source venv/bin/activate # Assuming you already run plot_from_tapper_logs.py 
./run_tapper_benchmark.sh

Running it without params use the default values, use the following environment variables for customization:

export=MIZU_BENCHMARK_OUTPUT_DIR=/path/to/dir # Set the output directory for tapper logs and graph
export=MIZU_BENCHMARK_CLIENT_PERIOD=1m # How long each test run
export=MIZU_BENCHMARK_URL=http://server:port/path # The URL to use for the benchmarking process (the test server endpoint)
export=MIZU_BENCHMARK_RUN_COUNT=3 # How many times each tapper mode should run
export=MIZU_BENCHMARK_QPS=250 # How many queries per second the each client should send to the test server
export=MIZU_BENCHMARK_CLIENTS_COUNT=5 # How many clients should run in parallel during the benchmark

Example output graph

An example output graph from a 15 min run with 15K payload and 1000 QPS looks like