PinnedPublished inTowards DevCaching(): A common mistake made by the data peopleWHAT IS CACHING()?Jul 22, 2022Jul 22, 2022
PinnedGaming Fair: Unraveling the Importance of Skill-Based Matchmaking in Esports with Apache FlinkA Deep Dive into the Pros and Cons, and a Fun Apache Flink Repository for Solutions.Jan 30Jan 30
PinnedA great example of inefficient code would be using collect() (using when not needed and also…Another scenario could be loading a huge unsplittable file format which could bring up the load on a single executor and will definitely…Sep 30, 2023Sep 30, 2023
PinnedHow K.F.C can eat your data one day…Data Quality Framework using Kafka, Flink, and Cassandra.Jan 14, 2022Jan 14, 2022
After you cache, you can also use Spark UI -> Storage section to check how many partitions are…Apr 19Apr 19
There are several points that should have been included in your article:1) Instead of increasing the values for cores, memory and so on, you should have focused on calculating the minimal number of executors…Jan 31Jan 31
Although this post is old enough, Kubernetes operator solves this problem with the latest Flink…Nov 8, 2023Nov 8, 2023
I would have really loved it if you had talked something about design patterns.Yes, refactoring would help but if a person doesn't have the knowledge of the design patterns, the person won’t be able to refactor the…Sep 9, 2023Sep 9, 2023
Now, Athena does support ACID transactions its just that you have to use table_type="Iceberg" and…Sep 7, 20231Sep 7, 20231