Installing pandas 1.3.3 on EMR 6.3.0 — Part 2So if you have read Part 1 of this series, lets try to automate what we did there in this part. Should be easy right? Based on the…Sep 28, 2021Sep 28, 2021
Installing pandas 1.3.3 on EMR 6.3.0 — Part 1I was playing around with Sagemaker notebook and EMR functionality recently. I was looking for some ready made cloudformation examples…Sep 27, 20211Sep 27, 20211
Published inlazyycoderInstalling python 3, pip and boto3 on centos dockerHere are the steps to install python 3 on docker.Jan 19, 2021Jan 19, 2021
Speeding up S3 processing — Part 3As I mentioned in part 2 of this story, the async-parallel version is actually not really async. Really the speedup is more due to using…Nov 13, 2020Nov 13, 2020
Speeding up S3 processing — Part 2In Part 1 of this story , I shared with you how I managed to get a nice speedup of s3 processing (with a running example of s3 copy) using…Nov 12, 2020Nov 12, 2020
Speeding up S3 object processing — Part 1AWS S3 is the go to cloud based object storage widely used in the industry today. But as you become a power user of s3, some of the…Nov 6, 2020Nov 6, 2020
Event driven Context Free Grammar (CFG) parsingAs a software engineer and a semi regular computer science guy, I like to take various software challenges that get presented at work and…Nov 3, 2020Nov 3, 2020
Practicing Scala — Writing a basic boolean expression parserI happened to research on how arbitrary boolean expressions can be parsed at runtime to compute true / false values. Since I wanted to…Jun 11, 2020Jun 11, 2020
Generating Session ids for user event ids — Part 2In part 1 I discussed the use case of assigning session id to closely occurring events generated by a particular user’s activity. We…May 8, 2020May 8, 2020