Vector DB Engineer – Data Scientist at Caterpillar Inc.

Bengaluru, Karnataka, India

Caterpillar Inc. Logo
Not SpecifiedCompensation
Mid-level (3 to 4 years), Senior (5 to 8 years)Experience Level
Full TimeJob Type
UnknownVisa
Technology, ManufacturingIndustries

Requirements

  • Deep understanding and hands-on experience with vector databases, including their architecture, query languages, and optimization techniques
  • Strong programming skills in languages such as Python, C++, or Java, with experience in developing and optimizing database operations
  • Solid understanding of data structures, algorithms, and computational geometry, particularly related to vector search and similarity measures
  • Experience with cloud platforms (e.g., AWS, GCP, Azure) and managed database services
  • Understanding of machine learning concepts, particularly those related to embedding vectors and similarity searches
  • Strong problem-solving skills with a focus on performance optimization and scalability
  • Excellent communication skills, with the ability to articulate complex technical concepts to non-technical stakeholders
  • Ability to work a 5-day-a-week schedule in the office (preference: Bangalore – Caterpillar PSN)
  • Shift Timing: 01:00PM - 10:00PM IST
  • Knowledge of business statistics, statistical tools, processes, and practices (desired)

Responsibilities

  • Design, implement, and manage vector databases to support large-scale data storage and retrieval, ensuring low latency and high availability
  • Develop efficient data models that facilitate fast vector operations such as similarity search, nearest neighbor search, and other vector-based queries
  • Optimize database performance through indexing, partitioning, sharding, and other techniques to handle large-scale datasets
  • Integrate vector databases with existing systems and applications, ensuring seamless data flow and accessibility
  • Design and implement solutions that scale with growing data volumes, ensuring the database infrastructure can handle increased load and complexity
  • Implement security best practices to protect data at rest and in transit, including encryption, access controls, and audit logging
  • Monitor database performance and troubleshoot issues as they arise, ensuring system reliability and availability
  • Work closely with data scientists, machine learning engineers, and software developers to understand their needs and provide database solutions that meet their requirements
  • Maintain comprehensive documentation for database schemas, configurations, and procedures to support operational excellence and knowledge sharing

Skills

Vector Databases
Data Modeling
Similarity Search
Nearest Neighbor Search
Database Indexing
Data Partitioning
Low Latency Optimization
High Availability
Machine Learning
Data Science

Caterpillar Inc.

Manufactures heavy machinery for various industries

About Caterpillar Inc.

Caterpillar Inc. designs and manufactures heavy machinery and equipment for industries such as construction, mining, energy, and rail. Their products include a wide range of machinery and engines that help clients complete large-scale projects. Caterpillar's equipment works by providing powerful tools that can perform tasks like digging, lifting, and transporting materials. What sets Caterpillar apart from its competitors is its strong aftermarket support, which includes maintenance and repair services, ensuring that their machinery remains efficient and reliable over time. The company's goal is to deliver high-quality products while also focusing on sustainability and community development through initiatives that improve education and reduce poverty.

Irving, TexasHeadquarters
1925Year Founded
$143.5KTotal Funding
IPOCompany Stage
Industrial & Manufacturing, Social Impact, AI & Machine LearningIndustries
10,001+Employees

Benefits

Annual incentive bonus plan
Medical, dental, and vision coverage
Paid time off plan (Vacation, Holiday, Volunteer, Etc.)
401k savings plan
Health savings account (HSA)
Flexible spending accounts (FSAs)
Disability benefits
Life Insurance
Parental leave
Healthy Lifestyle Programs
Employee Assistance Programs
Voluntary Benefits and Employee Discounts
Tuition Reimbursement
Career Development

Risks

Closure of Aurora office may impact regional economy and Caterpillar's reputation.
Partnership with Anti Social Social Club could dilute Caterpillar's industrial brand focus.
Advanced technology in Cat D8 dozer may face resistance from traditional customers.

Differentiation

Caterpillar's century-long history underscores its reliability and industry leadership.
The company integrates AI and IoT to enhance machinery performance and customer satisfaction.
Caterpillar's strong aftermarket services ensure product longevity and operational efficiency.

Upsides

Growing demand for autonomous equipment boosts Caterpillar's innovation in heavy machinery.
Expansion of 5G networks enhances Caterpillar's remote operation capabilities, improving safety.
Caterpillar's commitment to sustainability aligns with the global shift towards eco-friendly practices.

Land your dream remote job 3x faster with AI