Philipp Brunenberg (@p_brunenberg) 's Twitter Profile
Philipp Brunenberg

@p_brunenberg

Co-creator of @DayCaptainApp - developer productivity tool with vim-keybindings, clean UI, open API | Developer | Data Engineer

ID: 714896942799712256

linkhttp://philipp-brunenberg.de calendar_today29-03-2016 19:28:08

636 Tweet

446 Followers

160 Following

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

A naive bayes classifier can yield similar results as a neural network. Learn how to implement one in plain Cypher: youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

To materialize a physical plan, #ApacheSpark schedules tasks on the workers. In fact, the workers have one single functionality: to execute a task. Let's explore how a physical plan is transformed into schedulable tasks. youtu.be/-iF7_tDtc3k

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Naive Bayes classification is naive because it assumes independence of features. The maths are very simple. However, it works astonishingly well for our recommendation engine. youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

How to build a recommendation engine in #neo4j. ⏫ Naive Bayes classification 🚀 Performs as good as a deep neural network ✅ Just a handful lines of code youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

What's the fundamental principle enabling Apache Spark to parallelize workload? I'm creating a video series on the internals of Spark. The first video is a deep dive into Spark's most profound principle: MapReduce. youtu.be/BKecoKKs91A

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

A sampling strategy enables us to find the structural contexts of two nodes in a graph. We can use these contexts to find their similarity - for example using the Jaccard set similarity metric. node2vec uses contexts to map nodes to vectors. youtu.be/pS_POUVFXvk

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Consequently, Spark core’s optimization strategy is limited to pipeline narrow transformations into stages. towardsdatascience.com/the-bigger-pic…

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

I'm creating a free Apache Spark video course on YouTube. 🚀 ✅ Now pre-knowledge required 💡 Step-by-step explanations 🏋️‍♀️ Hands-on exercises Check out Video 1: Setting up Spark (IntelliJ, SBT, Scala, Spark) youtu.be/4kb5Ju3TJLs #apachespark

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

In part 2️⃣ of the #ApacheSpark with #Scala course, we will use our development setup to run our first Spark application. 💯 - Read csv file and explore documentation - Run local standalone Spark (3.5.0 with Java 17) - Explore IntelliJ IDEA, a JetBrains IDE run templates youtu.be/i_HyEzEOW-E?si…

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

🚨 New video in Apache Spark with Scala Course - Video 10: What is Spark? 🔨 By now, we have only talked about how to use Spark. 🕵️‍♀️ Today, we take the first glance at what it actually is and what we use it for ... youtu.be/KT0pkhAXDBw

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 3 ✅ Inspect the schema of a DataFrame 📖 Explore Spark SQL data types in the official documentation 🔎 Inferring the schema while reading a csv file youtu.be/as6F8TRmyl8 #ApacheSpark #Scala #Course

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 4 💡 What's the Spark SQL API? ❓ How do Datasets and DataFrames relate? 🔎 What's the Spark DSL? youtu.be/jQBYL_zpbhs

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 5️⃣ How can we reference columns in the Spark DSL? When working with the SparkSQL DSL, we mostly would like to do something with the columns of a Dataset. youtu.be/b_WX3QPCUOI

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 6️⃣ ❓ What can we do with Columns in the SparkSQL API? 💡 Have you ever looked at the Column Class in SparkSQL? ✅ We'll explore the source code and the documentation. youtu.be/9udk6AcgrLU

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 7️⃣ 💡 There exist more functions to transform columns in Spark SQL. 🧚‍♂️ They offer a wide world of data transformations ... We'll explore their implementation and documentation. youtu.be/k-UBdSLlEOs

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 8️⃣ 🚀 There is also a SQL API in Spark ... 🧐 Should we use it or not? 💡 Which functions can we use in our SQL expressions? youtu.be/3GrfsF_095o

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 9️⃣ Let's plug everything we learned together in our first assignment ... Loading data, inferring schemas, accessing, renaming and transforming columns. youtu.be/nxJHiPIe9SU

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 10 🔨 By now, we have only talked about how to use Spark. 🕵️‍♀️ Today, we take the first glance at what it actually is and what we use it for ... youtu.be/KT0pkhAXDBw