About PredictX
Make a real difference at one of London’s foremost SaaS scale-ups: Be ready to pioneer the future of AI, data analytics, and technology. Step into PredictX, where we don't just see AI as a fashionable bandwagon but have lived and breathed AI & ML in every aspect of our product for the past decade.
As an Enterprise SaaS provider, we're revolutionising critical decision-making for many of the world’s largest businesses, including 3 FAANGs, seeking empowerment through our integrative AI technology and Predictive Analytics.
We pride ourselves on our commitment to staying at the forefront of technological advancements. You'll be joining a team that actively explores and integrates the latest innovations to maintain our competitive edge.
The Role
As a Senior/Expert ML/Data Engineer, you will be at the forefront of our data science and machine learning initiatives, actively contributing to the evolution of our AI-powered solutions. You will be instrumental in designing, building, and maintaining our cutting-edge data infrastructure and machine learning pipelines, with a growing focus on leveraging the power of Large Language Models (LLMs) and other emerging AI technologies.
This role demands a strong blend of data engineering prowess, machine learning understanding (including LLMs), and the ability to translate complex business needs into robust technical solutions. You will be expected to lead projects, mentor junior team members, and drive innovation within our rapidly evolving data and AI landscape. Candidates will be employed on a B2B basis.
Key Responsibilities:
- Design, develop, and maintain scalable and efficient data pipelines using technologies such as Spark, Python, and relevant ETL tools to support our machine learning models, including those leveraging LLMs, and analytical needs.
- Architect and implement robust data warehousing solutions and data models that ensure data quality, integrity, and performance, catering to the specific data requirements of advanced AI models.
- Lead the development, testing, and deployment of machine learning models, including exploration and integration of Large Language Models (LLMs) and other novel AI architectures, collaborating closely with Data Scientists to productionize innovative solutions.
- Engineer approaches for storing, transforming, transporting, synchronising, archiving, and securing large and complex datasets, including unstructured and semi-structured data crucial for training and deploying advanced AI models.
- Participate in the evaluation and testing of new machine learning models and frameworks, including LLMs, to assess their potential and applicability to our products.
- Identify and resolve performance bottlenecks, data quality issues, and other pain points within our data and ML infrastructure. Proactively recommend and implement solutions for optimization and improvement, especially in the context of deploying large-scale AI models.
- Define and govern data modelling and design standards, best practices, and development methodologies within the team, considering the unique challenges and opportunities presented by LLMs and other advanced AI.
- Create and maintain comprehensive technical documentation for data pipelines, data models, and machine learning workflows, including details specific to LLM integration and testing.
- Collaborate effectively with Business Analysts, Data Scientists, and other engineering teams to understand data requirements and deliver impactful data and AI solutions.
- Stay abreast of the latest advancements in data engineering, machine learning (including LLMs and generative AI), and big data technologies, and actively participate in the evaluation and integration of promising new technologies.
- Mentor and guide junior Data Engineers and Data Scientists within the team, sharing knowledge about new AI developments and best practices.