PinnedSiddhesh KinTowards DevCaching(): A common mistake made by the data peopleWHAT IS CACHING()?Jul 22, 2022Jul 22, 2022
PinnedSiddhesh KGaming Fair: Unraveling the Importance of Skill-Based Matchmaking in Esports with Apache FlinkA Deep Dive into the Pros and Cons, and a Fun Apache Flink Repository for Solutions.Jan 30Jan 30
PinnedSiddhesh KA great example of inefficient code would be using collect() (using when not needed and also…Another scenario could be loading a huge unsplittable file format which could bring up the load on a single executor and will definitely…Sep 30, 2023Sep 30, 2023
PinnedSiddhesh KHow K.F.C can eat your data one day…Data Quality Framework using Kafka, Flink, and Cassandra.Jan 14, 2022Jan 14, 2022
Siddhesh KAfter you cache, you can also use Spark UI -> Storage section to check how many partitions are…Apr 19Apr 19
Siddhesh KThere are several points that should have been included in your article:1) Instead of increasing the values for cores, memory and so on, you should have focused on calculating the minimal number of executors…Jan 31Jan 31
Siddhesh KAlthough this post is old enough, Kubernetes operator solves this problem with the latest Flink…Nov 8, 2023Nov 8, 2023
Siddhesh KI would have really loved it if you had talked something about design patterns.Yes, refactoring would help but if a person doesn't have the knowledge of the design patterns, the person won’t be able to refactor the…Sep 9, 2023Sep 9, 2023
Siddhesh KNow, Athena does support ACID transactions its just that you have to use table_type="Iceberg" and…Sep 7, 20231Sep 7, 20231