PinnedPublished inTowards DevCaching(): A common mistake made by the data peopleWHAT IS CACHING()?Jul 22, 202218Jul 22, 202218
PinnedGaming Fair: Unraveling the Importance of Skill-Based Matchmaking in Esports with Apache FlinkA Deep Dive into the Pros and Cons, and a Fun Apache Flink Repository for Solutions.Jan 30, 2024Jan 30, 2024
PinnedA great example of inefficient code would be using collect() (using when not needed and also…Another scenario could be loading a huge unsplittable file format which could bring up the load on a single executor and will definitely…Sep 30, 20232Sep 30, 20232
PinnedHow K.F.C can eat your data one day…Data Quality Framework using Kafka, Flink, and Cassandra.Jan 14, 20227Jan 14, 20227
After you cache, you can also use Spark UI -> Storage section to check how many partitions are…Apr 19, 2024Apr 19, 2024
There are several points that should have been included in your article:1) Instead of increasing the values for cores, memory and so on, you should have focused on calculating the minimal number of executors…Jan 31, 2024Jan 31, 2024
Although this post is old enough, Kubernetes operator solves this problem with the latest Flink…Nov 8, 2023Nov 8, 2023
I would have really loved it if you had talked something about design patterns.Yes, refactoring would help but if a person doesn't have the knowledge of the design patterns, the person won’t be able to refactor the…Sep 9, 2023Sep 9, 2023
Now, Athena does support ACID transactions its just that you have to use table_type="Iceberg" and…Sep 7, 20231Sep 7, 20231