Josiah Berkebile

Josiah Berkebile

Mentor
Rising Codementor
US$10.00
For every 15 mins
5
Sessions/Jobs
free badge
First 15 mins free for your first session
ABOUT ME
Big Data Cloud Engineer
Big Data Cloud Engineer

I have specialized in Big Data technologies like Apache Spark, Flume, Flink, HBase, HDFS, Hive LLAP, Impala, Athena, EMR, DynamoDB, Airflow, etc. This career has lead me into developing applications that implement Machine Learning models, predictive algorithms, NLP algorithms, and ingest large datasets. I'm very well versed in concurrent and parallel programming and am really good with both Object Oriented as well as Functional programming approaches.

I really love teaching people and sharing my knowledge to the degree that I'm transitioning into making teaching and mentorship my full-time and primary job. I promise that in the time that I spend mentoring you, I will pour into you as much of my knowledge as I can to give you the best chance possible in the industry.

English
Central Time (US & Canada) (-05:00)
Joined October 2016
EXPERTISE
4 years experience
I worked in a fast-paced, goal-oriented Ruby on Rails Agile web app development team at Healthcare IT company for 6-9 months, and then si...
I worked in a fast-paced, goal-oriented Ruby on Rails Agile web app development team at Healthcare IT company for 6-9 months, and then single-handedly maintained a large Ruby on Rails app for a large financial institution for about 2 years. I've also done a few hackathons that involved building a web app for a non-profit in less than 24 hours. For 2 of the 3 hackathons, I organized and lead the team. Here's the end result of some of those challenges: http://principalsconnect.com/ http://adoption.kvc.org/
Software architectsRvmRuby on Rails
View more
8 years experience
I have 8 years of extensive experience writing Java applications that run in a Hadoop environment. I've developed data pipelines in Apac...
I have 8 years of extensive experience writing Java applications that run in a Hadoop environment. I've developed data pipelines in Apache Crunch, created custom Flume Clients, and also Flume Sources that plugin to the running Flume Agents per the Flume API, and written MapReduce algorithms that process data in the petabytes scale. Some of these Java applications ran as services and used Jersey to expose a web API and Solr to index data sitting in NoSQL storage. Spring is a bit of a heavy-weight. As a result, the various Hadoop engineering teams I've been on have chosen Dagger, rather than Spring, as the dependency injection framework for our applications since it's faster and more lightweight.
6 years experience
I have six (6) years of experience using Scala in Hadoop and data engineering environments. I've written a LOT of Scala. Some of the Fl...
I have six (6) years of experience using Scala in Hadoop and data engineering environments. I've written a LOT of Scala. Some of the Flume plugins I've written have been written in Scala. I've written a Flume Client that used Scalatra to listen for events using Webhooks and then distributed them to an array of Flume Clients using the Akka concurrency framework. I've extended and refactored a Scala Akka web scraper. I've leveraged ScalaTest and ScalaCheck in practically all of my Scala projects. In my projects as a Scala consultant a year ago I've leveraged the Typelevel stack to develop streaming applications using a pure functional approach. I've also written a few Spark applications in Scala. Some of my smaller projects have involved implementing a PageRank and Collaborative Filtering algorithms, and the most ambitious project I've done on Spark was an NLP classifier. I have a decent amount of experience architecting, profiling, and tuning Spark applications. All of the applications I've written in Scala have been highly concurrent, some of them have leveraged Software Transactional Memory libraries like ScalaSTM to simplify multi-threaded interactions with shared memory.
Cats EffectScalastmScalacheck
View more
8 years experience
Python is a versatile language, and my experience in it is also very versatile. I've used Python to do data exploration using NumPy and ...
Python is a versatile language, and my experience in it is also very versatile. I've used Python to do data exploration using NumPy and Pandas, I've taken Python code from data scientists and translated it into PySpark or into Scala Spark applications, I've written some systems automation scripts in Python, and written scripts for moving or ingesting data between databases and systems on the petabytes scale. I have about equal experience using legacy Python 2 as well as Python 3 in production environments.
2 years experience
I used Haskell as my vehicle for learning Functional Programming. Since Haskell is a pure functional language, it would not let me fall ...
I used Haskell as my vehicle for learning Functional Programming. Since Haskell is a pure functional language, it would not let me fall back on my old procedural object-oriented habits. I succeeded in learning enough Haskell on my own to become productive in the language. This learning enabled me to pick-up Scala more easily than most other members of the engineering teams I've been a part of. If you are learning Functional Programming or just getting started with Haskell, I can certainly get you to a level where you will be productive in the language.
2 years experience
6 years experience

REVIEWS FROM CLIENTS

Josiah's profile has been carefully vetted and approved as a Codementor. Connect with Josiah now, and leave a review for them once you're done!
EMPLOYMENTS
Sr Big Data Engineer
FINRA
2023-07-01-Present
Working in FINRA's big data processing environment. I cannot discuss which languages or technologies I use, here.
Working in FINRA's big data processing environment. I cannot discuss which languages or technologies I use, here.
Big Data
Big Data
Senior Scala Engineer
47 Degrees
2021-07-01-2023-07-01
Mentored Jr. Developers Developed and maintained a streaming categorization application for Jack Henry using a pure functional approach ...
Mentored Jr. Developers Developed and maintained a streaming categorization application for Jack Henry using a pure functional approach via the Typelevel stack in Scala.
Scala
Docker
Kubernetes
View more
Scala
Docker
Kubernetes
AWS Kinesis
Helm
Doobie
Cats Effect
View more
Information Architect II
Blue Cross Blue Shield of Kansas City
2020-05-01-2021-07-01
Lead the company shift to modern cloud-based data management from classical on-premises solutions. This involved the introduction of Kube...
Lead the company shift to modern cloud-based data management from classical on-premises solutions. This involved the introduction of Kubernetes, Helm, and Terraform to automate and codify deployment environments, the containerization of Apache Airflow for deployment to AKS, and coordinating the conversations and interactions necessary between the Ops, Security, and Engineering/Development teams to facilitate the shift.
Docker
Kubernetes
Terraform
View more
Docker
Kubernetes
Terraform
Agile
Snowflake
Apache Airflow
Azure Functions
Helm
Prisma
Azure Data Factory
View more
PROJECTS
Overnight Website ChallengeView Project
KVC Health Systems, The Nerdery
2014
Built a new website for KVC Health Systems in 24 hours.
Built a new website for KVC Health Systems in 24 hours.
HTML/CSS
Ruby on Rails
PostgreSQL
View more
HTML/CSS
Ruby on Rails
PostgreSQL
Heroku
JavaScript
View more
Overnight Website ChallengeView Project
PrincipalsConnect
2017
Built a website in 24 hours for PrincipalsConnect
Built a website in 24 hours for PrincipalsConnect
HTML/CSS
Ruby on Rails
PostgreSQL
View more
HTML/CSS
Ruby on Rails
PostgreSQL
Heroku
Continuous Integration
Docker
React
JavaScript
Continuous Deployment
Redux
View more