DXC Technology

Python Pyspark Data Engineer

Bengaluru, Karnataka, India

Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Information Technology, IT ServicesIndustries

Python PySpark Data Engineer

Employment Type: Full-time

Location: Hyderabad / Bangalore / Chennai / Kolkata / Noida / Gurgaon / Pune / Indore / Mumbai


Position Overview

We are seeking a skilled Python PySpark Data Engineer to join our team. The ideal candidate will be proficient in Python and experienced in designing, developing, and maintaining robust data pipelines and ETL workflows. This role requires strong database knowledge, data analysis skills, and familiarity with cloud platforms and big data tools.


Requirements

  • Strong Python Skills: Proficient in Python for data manipulation, automation, and building reusable components.
  • Data Pipeline Development: Experience designing and maintaining ETL/data pipelines using tools like Airflow or custom scripts.
  • Database Knowledge: Hands-on experience with SQL and NoSQL databases such as PostgreSQL, MySQL, and MongoDB.
  • Data Analysis: Skilled in data wrangling and analysis using Pandas, NumPy, and similar libraries.
  • ETL Expertise: Capable of building scalable ETL workflows for large structured and unstructured datasets.
  • Cloud Platform Exposure: Familiarity with AWS, Azure, or GCP for data storage, processing, and deployment.
  • API Integration: Experience consuming and integrating RESTful APIs for data exchange.
  • Big Data Tools (Preferred): Exposure to PySpark, Hadoop, or Kafka for handling large-scale data.
  • CI/CD & DevOps: Proficient with Git, Jenkins, and Docker for code versioning and automated deployment.

Responsibilities

  • Design, develop, and maintain scalable data pipelines and ETL processes.
  • Perform data wrangling and analysis using Python libraries.
  • Integrate and manage data from various SQL and NoSQL databases.
  • Utilize cloud platforms for data storage and processing.
  • Integrate RESTful APIs for data exchange.
  • Implement CI/CD practices for automated deployment.

Company Information

At DXC Technology, we believe strong connections and community are key to our success. Our work model prioritizes in-person collaboration while offering flexibility to support wellbeing, productivity, individual work styles, and life circumstances. We’re committed to fostering an inclusive environment where everyone can thrive.


Recruitment Fraud Alert

Recruitment fraud is a scheme in which fictitious job opportunities are offered to job seekers typically through online services, such as false websites, or through unsolicited emails claiming to be from the company. These emails may request recipients to provide personal information or to make payments as part of their illegitimate recruiting process.

DXC does not make offers of employment via social media networks and DXC never asks for any money or payments from applicants at any point in the recruitment process, nor ask a job seeker to purchase IT or other equipment on our behalf. More information on employment scams is available here.

Skills

Python
PySpark
ETL
SQL
NoSQL
PostgreSQL
MySQL
MongoDB
Pandas
NumPy
AWS
Azure
GCP
RESTful APIs
Hadoop
Kafka
Git
Jenkins
Docker

DXC Technology

IT services for enterprise modernization and management

About DXC Technology

DXC Technology provides IT services to large enterprises, focusing on modernizing their critical systems and operations. The company uses the Enterprise Technology Stack to enhance IT infrastructure, optimize data architectures, and ensure security across various cloud environments, including public, private, and hybrid. DXC operates on a contractual basis, offering consulting, system integration, and managed services to help clients improve their IT operations. What sets DXC apart from competitors is its strong commitment to innovation, sustainability, and corporate responsibility, which has earned it recognition as one of the Most Responsible Companies. The goal of DXC Technology is to be a trusted partner for enterprises, helping them achieve scalable and secure IT solutions while promoting inclusion and diversity within its workforce.

McLean, VirginiaHeadquarters
2017Year Founded
$14.6MTotal Funding
IPOCompany Stage
Consulting, Enterprise SoftwareIndustries
10,001+Employees

Risks

Emerging IT service providers offer cost-effective solutions, threatening DXC's market share.
Rapid technological changes may outpace DXC's innovation, risking service obsolescence.
Economic downturns could reduce IT spending, impacting DXC's long-term contract revenue.

Differentiation

DXC Technology is a Fortune 500 global IT services leader.
The company specializes in modernizing mission-critical systems for large enterprises.
DXC's Enterprise Technology Stack ensures security and scalability across cloud environments.

Upsides

DXC is recognized as a leader in the 2024 Magic Quadrant for Outsourced Digital Workplace Services.
The Quercus AI platform collaboration with Ferrovial and Microsoft enhances DXC's innovation capabilities.
DXC's role in transforming Italy's healthcare sector showcases its expertise in digital transformation.

Land your dream remote job 3x faster with AI