Data Engineering
Design and operation of ETL pipelines in Python, Java and SQL. Relational modelling, query optimization and spatial indexing in PostgreSQL/PostGIS. Layered raw/staging/production architecture with full governance and traceability.
◎ Available · Sanlúcar la Mayor, Andalusia, Spain
I'm Jean Michael, a Data Engineer with over 10 years building ETL pipelines, geospatial systems and automation workflows in production. I write about what I build and learn: data management, GIS, analysis, systems development and engineering best practices.
Languages, libraries, GIS tools and technologies I use in real projects.
I combine data engineering, geospatial systems, automation and backend development to build solutions where data has traceability and purpose.
Design and operation of ETL pipelines in Python, Java and SQL. Relational modelling, query optimization and spatial indexing in PostgreSQL/PostGIS. Layered raw/staging/production architecture with full governance and traceability.
Territorial analysis, cartographic production and topological validation with QGIS, Google Earth Engine and FME Safe. PyQGIS plugin development and integration of heterogeneous sources with GDAL, PostGIS and GeoPandas.
REST APIs with FastAPI and Spring Boot, container orchestration with Docker and Kubernetes, workflow automation with n8n. Systems designed for maintainability, scalability and operational integrity.
Over 10 years building production data platforms, geospatial systems and territorial analysis tools in demanding environments.
Exército Brasileiro ·
Design and operation of ETL pipelines in Python and SQL on high-demand production platforms. Relational modelling and optimization of PostgreSQL/PostGIS databases with spatial indexing. Implementation of a layered architecture (raw → staging → production) that reduced integration errors and improved per-record traceability. Development of backend services with stable, versioned and auditable data contracts.
Engprat · Contract · Rio de Janeiro, hybrid
Designed a relational schema in PostgreSQL/PostGIS from scratch to consolidate field data from thousands of properties and residents. Python ETL in raw/staging/production layers, automated spatial validations (boundary overlaps, attribute coherence), territorial density analysis via neighborhood analysis, and results exposed through a REST API consumed by external systems.
Exército Brasileiro
Design and implementation of automated validation workflows in Python and PostgreSQL. Definition of formal spatial rules, systemic topological consistency controls and structured error logging at every pipeline stage. The system enabled scaling cartographic production while maintaining verifiable technical standards and reducing rework.
COBE Engenharia e Geotecnia · Freelance · São Paulo
Processing and integration of bathymetric sensor data for the Porto Primavera Hydroelectric Plant. Digital Elevation Model generation, volume calculation per elevation and hydraulic flow analysis. Full pipeline with Python, SQL and FME Safe for conversion, transformation and standardization of heterogeneous spatial datasets.
Exército Brasileiro
Logical and physical design of PostgreSQL/PostGIS databases for complex geospatial data. Spatial query optimization with GiST/SP-GiST indexing. ETL process development with FME Safe and Python for integration of heterogeneous sources — vector files, rasters, spreadsheets and geospatial web services.
A selection of systems and tools representing my hands-on experience in data engineering, geospatial analysis and automation.
Territorial analysis system integrating geospatial data to support public policy decisions.
Capacity, scheduling, production and indicator management system with a focus on operational traceability.
PyQGIS tools for spatial validation, production control and productivity in geospatial workflows.
Technical notes on automation, data management, geospatial analysis, systems development and software engineering best practices.
How to divide rasters and vectors into meaningful regions using SLIC, DBSCAN and PostGIS — with code examples in Python.
From Pods and Services to Deployments and scaling — a practical guide with real YAML manifests and Minikube examples.
Why a well-designed pipeline needs clear rules, validation and traceability from source to final product.
Available for opportunities in Spain — data engineering, geospatial data, backend, ETL and automation.