Proficiency in Python (must), SQL; familiarity with NoSQL is a plus
Experience with libraries/frameworks: Scikit-learn, TensorFlow, PyTorch, Hugging Face Transformers, SpaCy, LangChain
Knowledge of big data tools: Spark, Hadoop, Dask or Ray
Experience with cloud platforms
Familiarity with data pipeline tools: Apache Airflow, MLflow
Proficiency in version control & DevOps: Git, Docker; optional Kubernetes, CI/CD tools
Responsibilities
Design and implement anomaly detection systems; develop models to identify unusual patterns in structured and unstructured data; monitor and refine detection systems for accuracy and scalability
Architect and optimize data pipelines; design and execute data strategy, architecture, and management programs aligned with organizational priorities; lead the design of scalable, fault-tolerant data pipelines for batch and streaming data
Develop and optimize machine learning algorithms to enhance model performance, accuracy, and efficiency
Handle large-scale data; ensure efficient data ingestion, transformation, and storage across diverse sources; work with high-volume datasets using distributed computing frameworks; ensure data quality, consistency, and accessibility for downstream analytics
Build and deploy NLP solutions; apply NLP techniques for text classification, entity recognition, summarization, and sentiment analysis; fine-tune transformer-based models (e.g., BERT, GPT) for domain-specific tasks
Develop generative AI applications; work with LLMs and diffusion models to build intelligent assistants, content generators, or summarization tools; optimize prompt engineering and model fine-tuning for performance and relevance
Collaborate across teams; partner with engineering, product, and business teams to translate requirements into data-driven solutions; communicate findings and recommendations clearly to technical and non-technical stakeholders
Collaborate with project stakeholders to identify product and technical requirements; conduct analysis to determine integration needs
Design new software and web applications; support applications under development and customize current applications; develop software update process for existing applications; assist in the roll-out of software releases
Train junior Software Development Engineers on internally developed software applications
Oversee the researching, writing, and editing of documentation and technical requirements, including evaluation plans, test results, technical manuals, and formal recommendations and reports
Keep current with technological developments within the industry; monitor and evaluate competitive applications and products; review literature, patents, and current practices relevant to the solution of assigned projects
Provide technical leadership throughout the design process
Skills
Anomaly Detection
Machine Learning
Data Pipelines
Distributed Computing
Data Ingestion
Data Transformation
Streaming Data
Batch Processing
Scalable Architecture
Fault-Tolerant Systems
Comcast
Comcast Corporation is a global media and technology company.