PinnedEfficient SIEM and Detection Engineering in 10 stepsSIEM systems and detection engineering are not just about data and detection rules. Planning and processes are becoming increasingly…Mar 25, 20233Mar 25, 20233
10 most important MITRE ATT&CK sources in one click using PandasMITRE ATT&CK is a source of knowledge about adversarial tactics and techniques. It is a common domain language in the world of cyber…Apr 5, 2023Apr 5, 2023
How To Clean Data with Python Pandas — Vehicles registered in PolandThanks to the Open Data project, we have sources made available by Polish public entities. In this article, we will prepare and clean the…Mar 19, 2023Mar 19, 2023
ksqlDB —real-time SQL magic in the cybersecurity scenario— part 1ksqlDB is a solution from the Apache Kafka and Confluent family. It allows you to use SQL to define stream processing jobs. This story…Feb 4, 20221Feb 4, 20221
Change Data Capture — Convert your database into a stream with DebeziumHave you ever thought about creating a stream from database operations? In this story, you will learn what Change Data Capture is and how…Jan 30, 2021Jan 30, 2021
How to use Variables and XCom in Apache Airflow?It is said that Apache Airflow is CRON on steroids. It is gaining popularity among tools for ETL orchestration (Scheduling, managing and…Dec 11, 2020Dec 11, 2020
Readable Scale Code in Apache Spark (4 attempts)Jupyter and Apache Zeppelin is a good place to experiment with data. Unfortunately, the specifics of notebooks do not encourage to…Oct 31, 20203Oct 31, 20203
Published inThe StartupTwitter Data Analysis for the Lazy in Elastic Stack (Xbox VS PlayStation)Twitter data can be obtained in many ways, but who wants to write the code 😉. Especially one that will work 24/7. In Elastic Stack you…Oct 18, 2020Oct 18, 2020
Published inITNEXTKafka Connect in a nutshellKafka Connect is part of the Apache Kafka platform. It is used to connect Kafka with external services such as file systems and databases…Oct 6, 2020Oct 6, 2020