mirror of
				https://github.com/falcosecurity/falco.git
				synced 2025-10-26 14:43:51 +00:00 
			
		
		
		
	* Add new json/webserver libs, embedded webserver
Add two new external libraries:
 - nlohmann-json is a better json library that has stronger use of c++
   features like type deduction, better conversion from stl structures,
   etc. We'll use it to hold generic json objects instead of jsoncpp.
 - civetweb is an embeddable webserver that will allow us to accept
   posted json data.
New files webserver.{cpp,h} start an embedded webserver that listens for
POSTS on a configurable url and passes the json data to the falco
engine.
New falco config items are under webserver:
  - enabled: true|false. Whether to start the embedded webserver or not.
  - listen_port. Port that webserver listens on
  - k8s_audit_endpoint: uri on which to accept POSTed k8s audit events.
(This commit doesn't compile entirely on its own, but we're grouping
these related changes into one commit for clarity).
* Don't use relative paths to find lua code
You can look directly below PROJECT_SOURCE_DIR.
* Reorganize compiler lua code
The lua compiler code is generic enough to work on more than just
sinsp-based rules, so move the parts of the compiler related to event
types and filterchecks out into a standalone lua file
sinsp_rule_utils.lua.
The checks for event types/filterchecks are now done from rule_loader,
and are dependent on a "source" attribute of the rule being
"sinsp". We'll be adding additional types of events next that come from
sources other than system calls.
* Manage separate syscall/k8s audit rulesets
Add the ability to manage separate sets of rules (syscall and
k8s_audit). Stop using the sinsp_evttype_filter object from the sysdig
repo, replacing it with falco_ruleset/falco_sinsp_ruleset from
ruleset.{cpp,h}. It has the same methods to add rules, associate them
with rulesets, and (for syscall) quickly find the relevant rules for a
given syscall/event type.
At the falco engine level, there are new parallel interfaces for both
types of rules (syscall and k8s_audit) to:
  - add a rule: add_k8s_audit_filter/add_sinsp_filter
  - match an event against rules, possibly returning a result:
    process_sinsp_event/process_k8s_audit_event
At the rule loading level, the mechanics of creating filterchecks
objects is handled two factories (sinsp_filter_factory and
json_event_filter_factory), both of which are held by the engine.
* Handle multiple rule types when parsing rules
Modify the steps of parsing a rule's filter expression to handle
multiple types of rules. Notable changes:
 - In the rule loader/ast traversal, pass a filter api object down,
   which is passed back up in the lua parser api calls like nest(),
   bool_op(), rel_expr(), etc.
 - The filter api object is either the sinsp factory or k8s audit
   factory, depending on the rule type.
 - When the rule is complete, the complete filter is passed to the
   engine using either add_sinsp_filter()/add_k8s_audit_filter().
* Add multiple output formatting types
Add support for multiple output formatters. Notable changes:
 - The falco engine is passed along to falco_formats to gain access to
   the engine's factories.
 - When creating a formatter, the source of the rule is passed along
   with the format string, which controls which kind of output formatter
   is created.
Also clean up exception handling a bit so all lua callbacks catch all
exceptions and convert them into lua errors.
* Add support for json, k8s audit filter fields
With some corresponding changes in sysdig, you can now create general
purpose filter fields and events, which can be tied together with
nesting, expressions, and relational operators. The classes here
represent an instance of these fields devoted to generic json objects as
well as k8s audit events. Notable changes:
 - json_event: holds a json object, used by all of the below
 - json_event_filter_check: Has the ability to extract values out of a
   json_event object and has the ability to define macros that associate
   a field like "group.field" with a json pointer expression that
   extracts a single property's value out of the json object. The basic
   field definition also allows creating an index
   e.g. group.field[index], where a std::function is responsible for
   performing the indexing. This class has virtual void methods so it
   must be overridden.
 - jevt_filter_check: subclass of json_event_filter_check and defines
   the following fields:
     - jevt.time/jevt.rawtime: extracts the time from the underlying json object.
     - jevt.value[<json pointer>]: general purpose way to extract any
       json value out of the underlying object. <json pointer> is a json
       pointer expression
     - jevt.obj: Return the entire object, stringified.
 - k8s_audit_filter_check: implements fields that extract values from
   k8s audit events. Most of the implementation is in the form of macros
   like ka.user.name, ka.uri, ka.target.name, etc. that just use json
   pointers to extact the appropriate value from a k8s audit event. More
   advanced fields like ka.uri.param, ka.req.container.image use
   indexing to extract individual values out of maps or arrays.
 - json_event_filter_factory: used by things like the lua parser api,
   output formatter, etc to create the necessary objects and return
   them.
  - json_event_formatter: given a format string, create the necessary
    fields that will be used to create a resolved string when given a
    json_event object.
* Add ability to list fields
Similar to sysdig's -l option, add --list (<source>) to list the fields
supported by falco. With no source specified, will print all
fields. Source can be "syscall" for inspector fields e.g. what is
supported by sysdig, or "k8s_audit" to list fields supported only by the
k8s audit support in falco.
* Initial set of k8s audit rules
Add an initial set of k8s audit rules. They're broken into 3 classes of
rules:
 - Suspicious activity: this includes things like:
    - A disallowed k8s user performing an operation
    - A disallowed container being used in a pod.
    - A pod created with a privileged pod.
    - A pod created with a sensitive mount.
    - A pod using host networking
    - Creating a NodePort Service
    - A configmap containing private credentials
    - A request being made by an unauthenticated user.
    - Attach/exec to a pod. (We eventually want to also do privileged
      pods, but that will require some state management that we don't
      currently have).
    - Creating a new namespace outside of an allowed set
    - Creating a pod in either of the kube-system/kube-public namespaces
    - Creating a serviceaccount in either of the kube-system/kube-public
      namespaces
    - Modifying any role starting with "system:"
    - Creating a clusterrolebinding to the cluster-admin role
    - Creating a role that wildcards verbs or resources
    - Creating a role with writable permissions/pod exec permissions.
 - Resource tracking. This includes noting when a deployment, service,
    - configmap, cluster role, service account, etc are created or destroyed.
 - Audit tracking: This tracks all audit events.
To support these rules, add macros/new indexing functions as needed to
support the required fields and ways to index the results.
* Add ability to read trace files of k8s audit evts
Expand the use of the -e flag to cover both .scap files containing
system calls as well as jsonl files containing k8s audit events:
If a trace file is specified, first try to read it using the
inspector. If that throws an exception, try to read the first line as
json. If both fail, return an error.
Based on the results of the open, the main loop either calls
do_inspect(), looping over system events, or
read_k8s_audit_trace_file(), reading each line as json and passing it to
the engine and outputs.
* Example showing how to enable k8s audit logs.
An example of how to enable k8s audit logging for minikube.
* Add unit tests for k8s audit support
Initial unit test support for k8s audit events. A new multiplex file
falco_k8s_audit_tests.yaml defines the tests. Traces (jsonl files) are
in trace_files/k8s_audit and new rules files are in
test/rules/k8s_audit.
Current test cases include:
- User outside allowed set
- Creating disallowed pod.
- Creating a pod explicitly on the allowed list
- Creating a pod w/ a privileged container (or second container), or a
  pod with no privileged container.
- Creating a pod w/ a sensitive mount container (or second container), or a
  pod with no sensitive mount.
- Cases for a trace w/o the relevant property + the container being
  trusted, and hostnetwork tests.
- Tests that create a Service w/ and w/o a NodePort type.
- Tests for configmaps: tries each disallowed string, ensuring each is
  detected, and the other has a configmap with no disallowed string,
  ensuring it is not detected.
- The anonymous user creating a namespace.
- Tests for all kactivity rules e.g. those that create/delete
  resources as compared to suspicious activity.
- Exec/Attach to Pod
- Creating a namespace outside of an allowed set
- Creating a pod/serviceaccount in kube-system/kube-public namespaces
- Deleting/modifying a system cluster role
- Creating a binding to the cluster-admin role
- Creating a cluster role binding that wildcards verbs or resources
- Creating a cluster role with write/pod exec privileges
* Don't manually install gcc 4.8
gcc 4.8 should already be installed by default on the vm we use for
travis.
		
	
		
			
				
	
	
		
			362 lines
		
	
	
		
			9.9 KiB
		
	
	
	
		
			Lua
		
	
	
	
	
	
			
		
		
	
	
			362 lines
		
	
	
		
			9.9 KiB
		
	
	
	
		
			Lua
		
	
	
	
	
	
| -- Copyright (C) 2016-2018 Draios Inc dba Sysdig.
 | |
| --
 | |
| -- This file is part of falco.
 | |
| --
 | |
| -- Licensed under the Apache License, Version 2.0 (the "License");
 | |
| -- you may not use this file except in compliance with the License.
 | |
| -- You may obtain a copy of the License at
 | |
| --
 | |
| --     http://www.apache.org/licenses/LICENSE-2.0
 | |
| --
 | |
| -- Unless required by applicable law or agreed to in writing, software
 | |
| -- distributed under the License is distributed on an "AS IS" BASIS,
 | |
| -- WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
 | |
| -- See the License for the specific language governing permissions and
 | |
| -- limitations under the License.
 | |
| --
 | |
| 
 | |
| --[[
 | |
|    Falco grammar and parser.
 | |
| 
 | |
|    Much of the scaffolding and helpers was derived from Andre Murbach Maidl's Lua parser (https://github.com/andremm/lua-parser).
 | |
| 
 | |
|    Parses regular filters following the existing sysdig filter syntax (*), extended to support "macro" terms, which are just identifiers.
 | |
| 
 | |
|    (*) There is currently one known difference with the syntax implemented in libsinsp: In libsinsp, field names cannot start with 'a', 'o', or 'n'. With this parser they can.
 | |
| 
 | |
| --]]
 | |
| 
 | |
| local parser = {}
 | |
| 
 | |
| local lpeg = require "lpeg"
 | |
| 
 | |
| lpeg.locale(lpeg)
 | |
| 
 | |
| local P, S, V = lpeg.P, lpeg.S, lpeg.V
 | |
| local C, Carg, Cb, Cc = lpeg.C, lpeg.Carg, lpeg.Cb, lpeg.Cc
 | |
| local Cf, Cg, Cmt, Cp, Ct = lpeg.Cf, lpeg.Cg, lpeg.Cmt, lpeg.Cp, lpeg.Ct
 | |
| local alpha, digit, alnum = lpeg.alpha, lpeg.digit, lpeg.alnum
 | |
| local xdigit = lpeg.xdigit
 | |
| local space = lpeg.space
 | |
| 
 | |
| 
 | |
| -- error message auxiliary functions
 | |
| 
 | |
| -- creates an error message for the input string
 | |
| local function syntaxerror (errorinfo, pos, msg)
 | |
|   local error_msg = "%s: syntax error, %s"
 | |
|   return string.format(error_msg, pos, msg)
 | |
| end
 | |
| 
 | |
| -- gets the farthest failure position
 | |
| local function getffp (s, i, t)
 | |
|   return t.ffp or i, t
 | |
| end
 | |
| 
 | |
| -- gets the table that contains the error information
 | |
| local function geterrorinfo ()
 | |
|   return Cmt(Carg(1), getffp) * (C(V"OneWord") + Cc("EOF")) /
 | |
|   function (t, u)
 | |
|     t.unexpected = u
 | |
|     return t
 | |
|   end
 | |
| end
 | |
| 
 | |
| -- creates an errror message using the farthest failure position
 | |
| local function errormsg ()
 | |
|   return geterrorinfo() /
 | |
|   function (t)
 | |
|     local p = t.ffp or 1
 | |
|     local msg = "unexpected '%s', expecting %s"
 | |
|     msg = string.format(msg, t.unexpected, t.expected)
 | |
|     return nil, syntaxerror(t, p, msg)
 | |
|   end
 | |
| end
 | |
| 
 | |
| -- reports a syntactic error
 | |
| local function report_error ()
 | |
|   return errormsg()
 | |
| end
 | |
| 
 | |
| --- sets the farthest failure position and the expected tokens
 | |
| local function setffp (s, i, t, n)
 | |
|   if not t.ffp or i > t.ffp then
 | |
|     t.ffp = i
 | |
|     t.list = {} ; t.list[n] = n
 | |
|     t.expected = "'" .. n .. "'"
 | |
|   elseif i == t.ffp then
 | |
|     if not t.list[n] then
 | |
|       t.list[n] = n
 | |
|       t.expected = "'" .. n .. "', " .. t.expected
 | |
|     end
 | |
|   end
 | |
|   return false
 | |
| end
 | |
| 
 | |
| local function updateffp (name)
 | |
|   return Cmt(Carg(1) * Cc(name), setffp)
 | |
| end
 | |
| 
 | |
| -- regular combinators and auxiliary functions
 | |
| 
 | |
| local function token (pat, name)
 | |
|   return pat * V"Skip" + updateffp(name) * P(false)
 | |
| end
 | |
| 
 | |
| local function symb (str)
 | |
|   return token (P(str), str)
 | |
| end
 | |
| 
 | |
| local function kw (str)
 | |
|   return token (P(str) * -V"idRest", str)
 | |
| end
 | |
| 
 | |
| 
 | |
| local function list (pat, sep)
 | |
|    return Ct(pat^-1 * (sep * pat^0)^0) / function(elements) return {type = "List", elements=elements} end
 | |
| end
 | |
| 
 | |
| --http://lua-users.org/wiki/StringTrim
 | |
| function trim(s)
 | |
|    if (type(s) ~= "string") then return s end
 | |
|   return (s:gsub("^%s*(.-)%s*$", "%1"))
 | |
| end
 | |
| parser.trim = trim
 | |
| 
 | |
| local function terminal (tag)
 | |
|    -- Rather than trim the whitespace in this way, it would be nicer to exclude it from the capture...
 | |
|    return token(V(tag), tag) / function (tok) val = tok; if tag ~= "String" then val = trim(tok) end; return { type = tag, value = val} end
 | |
| end
 | |
| 
 | |
| local function unaryboolop (op, e)
 | |
|   return { type = "UnaryBoolOp", operator = op, argument = e }
 | |
| end
 | |
| 
 | |
| local function unaryrelop (e, op)
 | |
|   return { type = "UnaryRelOp", operator = op, argument = e }
 | |
| end
 | |
| 
 | |
| local function binaryop (e1, op, e2)
 | |
|   if not op then
 | |
|      return e1
 | |
|   else
 | |
|      return { type = "BinaryBoolOp", operator = op, left = e1, right = e2 }
 | |
|   end
 | |
| end
 | |
| 
 | |
| local function bool (pat, sep)
 | |
|   return Cf(pat * Cg(sep * pat)^0, binaryop)
 | |
| end
 | |
| 
 | |
| local function rel (left, sep, right)
 | |
|    return left * sep * right / function(e1, op, e2) return { type = "BinaryRelOp", operator = op, left = e1, right = e2 } end
 | |
| end
 | |
| 
 | |
| local function fix_str (str)
 | |
|   str = string.gsub(str, "\\a", "\a")
 | |
|   str = string.gsub(str, "\\b", "\b")
 | |
|   str = string.gsub(str, "\\f", "\f")
 | |
|   str = string.gsub(str, "\\n", "\n")
 | |
|   str = string.gsub(str, "\\r", "\r")
 | |
|   str = string.gsub(str, "\\t", "\t")
 | |
|   str = string.gsub(str, "\\v", "\v")
 | |
|   str = string.gsub(str, "\\\n", "\n")
 | |
|   str = string.gsub(str, "\\\r", "\n")
 | |
|   str = string.gsub(str, "\\'", "'")
 | |
|   str = string.gsub(str, '\\"', '"')
 | |
|   str = string.gsub(str, '\\\\', '\\')
 | |
|   return str
 | |
| end
 | |
| 
 | |
| -- grammar
 | |
| 
 | |
| 
 | |
| local function filter(e)
 | |
|    return {type = "Filter", value=e}
 | |
| end
 | |
| 
 | |
| local function rule(filter)
 | |
|    return {type = "Rule", filter = filter}
 | |
| end
 | |
| 
 | |
| local G = {
 | |
|    V"Start", -- Entry rule
 | |
| 
 | |
|    Start = V"Skip" * (V"Comment" + V"Rule" / rule)^-1 * -1 + report_error();
 | |
| 
 | |
|   -- Grammar
 | |
|    Comment = P"#" * P(1)^0;
 | |
| 
 | |
|    Rule = V"Filter" / filter * ((V"Skip")^-1 );
 | |
| 
 | |
|    Filter = V"OrExpression";
 | |
|   OrExpression =
 | |
|      bool(V"AndExpression", V"OrOp");
 | |
| 
 | |
|   AndExpression =
 | |
|      bool(V"NotExpression", V"AndOp");
 | |
| 
 | |
|   NotExpression =
 | |
|      V"UnaryBoolOp" * V"NotExpression" / unaryboolop +
 | |
|      V"ExistsExpression";
 | |
| 
 | |
|   ExistsExpression =
 | |
|      terminal "FieldName" * V"ExistsOp" / unaryrelop +
 | |
|      V"MacroExpression";
 | |
| 
 | |
|   MacroExpression =
 | |
|      terminal "Macro" +
 | |
|      V"RelationalExpression";
 | |
| 
 | |
|   RelationalExpression =
 | |
|      rel(terminal "FieldName", V"RelOp", V"Value") +
 | |
|      rel(terminal "FieldName", V"InOp", V"InList") +
 | |
|      rel(terminal "FieldName", V"PmatchOp", V"InList") +
 | |
|      V"PrimaryExp";
 | |
| 
 | |
|   PrimaryExp = symb("(") * V"Filter" * symb(")");
 | |
| 
 | |
|   FuncArgs = symb("(") * list(V"Value", symb(",")) * symb(")");
 | |
| 
 | |
|   -- Terminals
 | |
|   Value = terminal "Number" + terminal "String" + terminal "BareString";
 | |
| 
 | |
|   InList = symb("(") * list(V"Value", symb(",")) * symb(")");
 | |
| 
 | |
| 
 | |
|   -- Lexemes
 | |
|   Space = space^1;
 | |
|   Skip = (V"Space")^0;
 | |
|   idStart = alpha + P("_");
 | |
|   idRest = alnum + P("_");
 | |
|   Identifier = V"idStart" * V"idRest"^0;
 | |
|   Macro = V"idStart" * V"idRest"^0 * -P".";
 | |
|   Int = digit^1;
 | |
|   PathString = (alnum + S'.-_/*?')^1;
 | |
|   Index = V"Int" + V"PathString";
 | |
|   FieldName = V"Identifier" * (P"." + V"Identifier")^1 * (P"[" * V"Index" * P"]")^-1;
 | |
|   Name = C(V"Identifier") * -V"idRest";
 | |
|   Hex = (P("0x") + P("0X")) * xdigit^1;
 | |
|   Expo = S("eE") * S("+-")^-1 * digit^1;
 | |
|   Float = (((digit^1 * P(".") * digit^0) +
 | |
|           (P(".") * digit^1)) * V"Expo"^-1) +
 | |
|           (digit^1 * V"Expo");
 | |
|   Number = C(V"Hex" + V"Float" + V"Int") /
 | |
|            function (n) return tonumber(n) end;
 | |
|   String = (P'"' * C(((P'\\' * P(1)) + (P(1) - P'"'))^0) * P'"' +  P"'" * C(((P"\\" * P(1)) + (P(1) - P"'"))^0) * P"'")  / function (s) return fix_str(s) end;
 | |
|   BareString = C(((P(1) - S' (),='))^1);
 | |
| 
 | |
|   OrOp = kw("or") / "or";
 | |
|   AndOp = kw("and") / "and";
 | |
|   Colon = kw(":");
 | |
|   RelOp = symb("=") / "=" +
 | |
|           symb("==") / "==" +
 | |
|           symb("!=") / "!=" +
 | |
|           symb("<=") / "<=" +
 | |
|           symb(">=") / ">=" +
 | |
|           symb("<") / "<" +
 | |
|           symb(">") / ">" +
 | |
|           symb("contains") / "contains" +
 | |
|           symb("icontains") / "icontains" +
 | |
|           symb("glob") / "glob" +
 | |
|           symb("startswith") / "startswith" +
 | |
|           symb("endswith") / "endswith";
 | |
|   InOp = kw("in") / "in";
 | |
|   PmatchOp = kw("pmatch") / "pmatch";
 | |
|   UnaryBoolOp = kw("not") / "not";
 | |
|   ExistsOp = kw("exists") / "exists";
 | |
| 
 | |
|   -- for error reporting
 | |
|   OneWord = V"Name" + V"Number" + V"String" +  P(1);
 | |
| }
 | |
| 
 | |
| --[[
 | |
|    Parses a single filter and returns the AST.
 | |
| --]]
 | |
| function parser.parse_filter (subject)
 | |
|   local errorinfo = { subject = subject }
 | |
|   lpeg.setmaxstack(1000)
 | |
|   local ast, error_msg = lpeg.match(G, subject, nil, errorinfo)
 | |
|   return ast, error_msg
 | |
| end
 | |
| 
 | |
| function print_ast(ast, level)
 | |
|    local t = ast.type
 | |
|    level = level or 0
 | |
|    local prefix = string.rep(" ", level*4)
 | |
|    level = level + 1
 | |
| 
 | |
|    if t == "Rule" then
 | |
|       print_ast(ast.filter, level)
 | |
|    elseif t == "Filter" then
 | |
|       print_ast(ast.value, level)
 | |
| 
 | |
|    elseif t == "BinaryBoolOp" or t == "BinaryRelOp" then
 | |
|       print(prefix..ast.operator)
 | |
|       print_ast(ast.left, level)
 | |
|       print_ast(ast.right, level)
 | |
| 
 | |
|    elseif t == "UnaryRelOp" or t == "UnaryBoolOp" then
 | |
|       print (prefix..ast.operator)
 | |
|       print_ast(ast.argument, level)
 | |
| 
 | |
|    elseif t == "List" then
 | |
|       for i, v in ipairs(ast.elements) do
 | |
|          print_ast(v, level)
 | |
|       end
 | |
| 
 | |
|    elseif t == "FieldName" or t == "Number" or t == "String" or t == "BareString" or t == "Macro" then
 | |
|       print (prefix..t.." "..ast.value)
 | |
| 
 | |
|    elseif t == "MacroDef" then
 | |
|       -- don't print for now
 | |
|    else
 | |
|       error ("Unexpected type in print_ast: "..t)
 | |
|    end
 | |
| end
 | |
| parser.print_ast = print_ast
 | |
| 
 | |
| -- Traverse the provided ast and call the provided callback function
 | |
| -- for any nodes of the specified type. The callback function should
 | |
| -- have the signature:
 | |
| --     cb(ast_node, ctx)
 | |
| -- ctx is optional.
 | |
| function traverse_ast(ast, node_types, cb, ctx)
 | |
|    local t = ast.type
 | |
| 
 | |
|    if node_types[t] ~= nil then
 | |
|       cb(ast, ctx)
 | |
|    end
 | |
| 
 | |
|    if t == "Rule" then
 | |
|       traverse_ast(ast.filter, node_types, cb, ctx)
 | |
| 
 | |
|    elseif t == "Filter" then
 | |
|       traverse_ast(ast.value, node_types, cb, ctx)
 | |
| 
 | |
|    elseif t == "BinaryBoolOp" or t == "BinaryRelOp" then
 | |
|       traverse_ast(ast.left, node_types, cb, ctx)
 | |
|       traverse_ast(ast.right, node_types, cb, ctx)
 | |
| 
 | |
|    elseif t == "UnaryRelOp" or t == "UnaryBoolOp" then
 | |
|       traverse_ast(ast.argument, node_types, cb, ctx)
 | |
| 
 | |
|    elseif t == "List" then
 | |
|       for i, v in ipairs(ast.elements) do
 | |
|          traverse_ast(v, node_types, cb, ctx)
 | |
|       end
 | |
| 
 | |
|    elseif t == "MacroDef" then
 | |
|       traverse_ast(ast.value, node_types, cb, ctx)
 | |
| 
 | |
|    elseif t == "FieldName" or t == "Number" or t == "String" or t == "BareString" or t == "Macro" then
 | |
|       -- do nothing, no traversal needed
 | |
| 
 | |
|    else
 | |
|       error ("Unexpected type in traverse_ast: "..t)
 | |
|    end
 | |
| end
 | |
| parser.traverse_ast = traverse_ast
 | |
| 
 | |
| return parser
 |