Philipp Brunenberg (@p_brunenberg) 's Twitter Profile
Philipp Brunenberg

@p_brunenberg

Co-creator of @DayCaptainApp - developer productivity tool with vim-keybindings, clean UI, open API | Developer | Data Engineer

ID: 714896942799712256

linkhttp://philipp-brunenberg.de calendar_today29-03-2016 19:28:08

636 Tweet

446 Followers

160 Following

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

A naive bayes classifier can yield similar results as a neural network. Learn how to implement one in plain Cypher: youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

To materialize a physical plan, #ApacheSpark schedules tasks on the workers. In fact, the workers have one single functionality: to execute a task. Let's explore how a physical plan is transformed into schedulable tasks. youtu.be/-iF7_tDtc3k

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Naive Bayes classification is naive because it assumes independence of features. The maths are very simple. However, it works astonishingly well for our recommendation engine. youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

How to build a recommendation engine in #neo4j. ⏫ Naive Bayes classification πŸš€ Performs as good as a deep neural network βœ… Just a handful lines of code youtu.be/YbHbsaJhhKM

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

What's the fundamental principle enabling Apache Spark to parallelize workload? I'm creating a video series on the internals of Spark. The first video is a deep dive into Spark's most profound principle: MapReduce. youtu.be/BKecoKKs91A

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

A sampling strategy enables us to find the structural contexts of two nodes in a graph. We can use these contexts to find their similarity - for example using the Jaccard set similarity metric. node2vec uses contexts to map nodes to vectors. youtu.be/pS_POUVFXvk

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Consequently, Spark core’s optimization strategy is limited to pipeline narrow transformations into stages. towardsdatascience.com/the-bigger-pic…

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

I'm creating a free Apache Spark video course on YouTube. πŸš€ βœ… Now pre-knowledge required πŸ’‘ Step-by-step explanations πŸ‹οΈβ€β™€οΈ Hands-on exercises Check out Video 1: Setting up Spark (IntelliJ, SBT, Scala, Spark) youtu.be/4kb5Ju3TJLs #apachespark

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

In part 2️⃣ of the #ApacheSpark with #Scala course, we will use our development setup to run our first Spark application. πŸ’― - Read csv file and explore documentation - Run local standalone Spark (3.5.0 with Java 17) - Explore IntelliJ IDEA, a JetBrains IDE run templates youtu.be/i_HyEzEOW-E?si…

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

🚨 New video in Apache Spark with Scala Course - Video 10: What is Spark? πŸ”¨ By now, we have only talked about how to use Spark. πŸ•΅οΈβ€β™€οΈ Today, we take the first glance at what it actually is and what we use it for ... youtu.be/KT0pkhAXDBw

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 3 βœ… Inspect the schema of a DataFrame πŸ“– Explore Spark SQL data types in the official documentation πŸ”Ž Inferring the schema while reading a csv file youtu.be/as6F8TRmyl8 #ApacheSpark #Scala #Course

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 4 πŸ’‘ What's the Spark SQL API? ❓ How do Datasets and DataFrames relate? πŸ”Ž What's the Spark DSL? youtu.be/jQBYL_zpbhs

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 5️⃣ How can we reference columns in the Spark DSL? When working with the SparkSQL DSL, we mostly would like to do something with the columns of a Dataset. youtu.be/b_WX3QPCUOI

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 6️⃣ ❓ What can we do with Columns in the SparkSQL API? πŸ’‘ Have you ever looked at the Column Class in SparkSQL? βœ… We'll explore the source code and the documentation. youtu.be/9udk6AcgrLU

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 7️⃣ πŸ’‘ There exist more functions to transform columns in Spark SQL. πŸ§šβ€β™‚οΈ They offer a wide world of data transformations ... We'll explore their implementation and documentation. youtu.be/k-UBdSLlEOs

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 8️⃣ πŸš€ There is also a SQL API in Spark ... 🧐 Should we use it or not? πŸ’‘ Which functions can we use in our SQL expressions? youtu.be/3GrfsF_095o

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 9️⃣ Let's plug everything we learned together in our first assignment ... Loading data, inferring schemas, accessing, renaming and transforming columns. youtu.be/nxJHiPIe9SU

Philipp Brunenberg (@p_brunenberg) 's Twitter Profile Photo

Apache Spark with Scala Course - Video 10 πŸ”¨ By now, we have only talked about how to use Spark. πŸ•΅οΈβ€β™€οΈ Today, we take the first glance at what it actually is and what we use it for ... youtu.be/KT0pkhAXDBw