Born in Brazil and relocated to the US at age 14, I am a Site Reliability Engineer with over 10 years of experience. Skilled in both startup and established corporate environments, I am always thrilled to take on new challenges. Known for my excellent communication skills, I thrive as both a team player and independent contributor.
Leading the development and implementation of strategies for Kubernetes reliability and scalability, including multi-cluster deployments, the adoption of node-problem-detector for automatic node issue remediation addressing critical incidents, implementation of node-local-dns-cache to address DNS issues and the introduction of multi-instance types per node group to circumvent AWS resource limits.
Spearheaded IPv6 compatibility for airtable.com, ensuring compliance with customer demands.
Implemented a Kubernetes upgrade process standardizing the upgrade path and cadence.
Designing and automating Kubernetes upgrade testing suite, reducing the risk of system incidents and ensuring smooth version transitions.
Tesla
Site Reliability Engineer, FleetNet Team
Contributed to upstream Concourse CI developments to support multi-architecture Docker image builds, facilitating broader application deployment capabilities.
Successfully migrated a critical application from a VM to Kubernetes, which is essential for the cryptographic signing of software updates dispatched to all Tesla vehicles. This migration significantly improved deployment efficiency and security.
Participated in a team effort for a major Kubernetes cluster upgrade and transitioned to new Container Network Interfaces (CNIs), contributing to enhanced network performance and reliability.
Played a pivotal role in preparing Kubernetes clusters for migration to ARM architecture, with the strategic objective of reducing operational costs.
Pinterest
Site Reliability Engineer
Worked with a team to create the infrastructure for migrating most stateless services to Docker, enhancing service scalability and deployment efficiency.
Built and maintained a Kubernetes infrastructure, significantly reducing operational burdens for developers and ensuring a stable, up-to-date platform.
Implemented continuous end-to-end integration testing for Kubernetes, guaranteeing system reliability during updates.
Designed and implemented chaos engineering strategies to validate application resilience and compliance with Kubernetes standards.
Contributed as a major owner to the Kops OSS project within the Kubernetes organization, aligning project goals with Pinterest's needs and fostering community engagement.
Facilitated the onboarding process for new hires through interviews, mentorship, and training.
Pinterest
Site Reliability Engineer ( Contract )
Supported development teams and assisted in migration to a new deployment tooling.
Hive
DevOps Engineer
Managed scaling operations for multiple applications, achieving rapid growth in user base and daily active users.
Developed and deployed a CI/CD infrastructure, doubling the previous deployment rate and enhancing release testing capabilities.
Oversaw a Docker-centric environment, leading efforts in configuration, deployment, and automation with Chef.
Transitioned AWS hosted environments to an in-house data center, optimizing infrastructure efficiency.
LiveVox
Linux Systems Engineer
Supported development and QA teams, interfacing directly with customers when required.
Spearheaded virtualization initiatives, enhancing system scalability and reliability.
Managed AWS infrastructure across 2 different regions
Cogent Solutions & Supplies
Systems Administrator
Implemented comprehensive monitoring solutions.
Virtualized entire infrastructure improving systems reliability and flexibility.
Managed Windows and Linux infrastructure across 3 locations.