Posts
Maximizing Efficiency: A Guide to Optimizing Large Language Model (LLM) Inference with AWS Inferentia2
Understanding AWS Inferentia2 AWS Inferentia2 is a custom-built ML inference chip designed to deliver high performance at low cost. It supports all major ML frameworks, including TensorFlow, PyTorch, and MXNet, and offers flexible options for instances and type of processors (CPUs or GPUs). This flexibility makes it an excellent choice for running LLM inference tasks.
The Inf2 instance fundamentally integrates up to 12 Inferentia2 devices, with each Inferentia2 unit consisting of 2 NeuronCore-v2.
Posts
Harnessing the Power of SageMaker Pipeline: Ten Benefits That Can't Be Ignored
In an era where businesses are driven by data, maintaining a streamlined and efficient machine learning (ML) workflow is paramount. As a Solutions Architect, desinging efficient MLOps solutions for ease of model training, deployments and inference testing is critical. This is where Amazon SageMaker Pipeline comes into play. It serves as a lifesaver, enabling data engineers to automate, manage, and scale ML workflows with ease and efficiency.
Let’s deep dive into the multiple benefits that Amazon SageMaker Pipeline brings to the table.
Posts
From Sargeant to Solutions Architect: How I Used Military Training to Land My First Tech Job
Changing a new career can be intimidating and very stressful… Stationed at 2-7 Infantry Battalion, 1st Armored Brigade Combat Team, 3rd Infantry Division, Fort Stewart GA, my pride and joy was my title of being an Army Infantryman. Serving to directly protect my country, meeting people from around the world, and getting paid to workout and shoot guns….It was heaven. I had purpose, my mission was clear and I was good at my job.
Posts
Like Ogres and Onions, Security Has Layers: AWS Services That Will Help You Achieve Defense in Depth
What is Defense in Depth and why is it important? I am so glad you asked! Defense in Depth (Layered Security) is the critical concept of achieving a thorough security posture by implementing protection at multiple resource levels. Think of the White House for example…There are multiple components in place securing the building and the personnel inside. Would it be sufficient to only have a lock on the front door?