Applied Scientist (AI Evaluation) - #1776361

Trismik


Date: 16 hours ago
City: Bradford
Contract type: Full time
Work schedule: Full day
Trismik

Why join us? At Trismik we're a team of tech geeks from the University of Cambridge, Salesforce, and Amazon looking to push the boundaries of AI through science-led evaluations. If you're ambitious to make a difference to the future of AI, have a PhD in NLP, and like to turn ideas into reality, we'd love to hear from you.


RoleWe are developing adversarial tests for Large Language Models (LLMs). We are looking for a passionate, talented, and innovative applied scientist with a strong background in algorithms to help build an industry-leading evaluation engine for LLMs and help us bring this to market.


As one of our first hires this is a high-impact and high-ownership role. You will have a strong say in how our science and product roadmap evolves. Our mission is to provide the fastest and most accurate testing environment to add value to AI engineers wishing to deploy AI applications using LLMs.


We do this to push forward the SoTA in Artificial Intelligence and to achieve the best possible chance of a human-aligned AGI.Key job responsibilities



As an Applied Scientist in a startup you will have a key role in our team. You will work with the Chief Scientist to design, develop and deploy evaluation technologies that will create high value insights for AI engineers. These will involve providing assessments for several technologies that involve


Large Language Models, including retrieval augmented systems (RAGs), recommender systems, and agentive systems. You will:Invent, experiment with, and launch new features, products and systems based on machine learning and MLLM.Perform hands-on construction and analysis of large-scale multi-modal datasets to be deployed as part of our product offering. Build algorithms, perform offline and A/B test experiments, optimise and deploy your models into production, working closely with software engineers.


Establish automated processes for large-scale data analysis and generation, machine-learning model development, model validation and serving. Communicate results and insights to both technical and non-technical audiences, including through presentations and written reports, and publish your work for internal and external audiences, e.g. through white papers and conference papers. Collaborate with engineers, product partners locally and abroad, and join conversations with customers led by our marketing team.


Essential skillsPhD in Computer Science or related field (Natural Language Processing focus only)Experience in state-of-the-art deep learning models architecture design and deep learning training and optimization and model pruning


Experience with LLM benchmarking

Strong programming skills in Python, Java, C++, or a related language

Strong skills in algorithm development to solve optimisation problems

Familiarity with core tools for a typical ML-focused work environment (e.g. Git, IDE, common libraries, HuggingFace).Ability to work as part of a team

Be comfortable with a fast-paced, high-risk, and a high reward environment

Desirable skills

Experience creating data sets

Experience coordinate development of products with multiple stakeholders

Publications in top-tier peer-reviewed journals and conferences (e.g. NeurIPS, EMNLP, ACL)Experience deploying solutions to AWS or other cloud platforms

Excellent communication skills, solid work ethic, a strong desire to write production-quality code

Our offer

Competitive salary, bonus, and share options in our startup;Company issued laptop, workstation, and tech allowance;Flexible/remote work environment. We are based in Cambridge/London but open to outstanding remote work candidates based in the UK or EU. We offer travel allowance to enable meet ups;ContactFor informal enquiries please contact *****@trismik.com.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Field Service Engineer - Bradford

Brsk,
10 hours ago
Field Service Engineer – Telecommunications About The Role We are looking for knowledgeable and customer-focused Wi-Fi Engineers to join our team. As a Wi-Fi Engineer, you will be responsible for the installation, configuration, maintenance, and troubleshooting of Wi-Fi networks for...
Brsk

Sales Engineer-EMEA

JumpCloud,
17 hours ago
All roles at JumpCloud are Remote unless otherwise specified in the Job Description. About JumpCloudJumpCloud’s mission is to Make Work Happen, providing simple, secure access to corporate technology resources from any device, or any location. The JumpCloud Directory Platform gives...

Junior Security Operations Center Analyst

Ventula Consulting,
17 hours ago
Junior SOC Analyst – Infrastructure - Hull - £35,000One of the UK’s leading infrastructure clients now requires a Junior SOC Analyst to help drive robust cyber and infrastructure security across their organisation.The Information Security Systems Engineer will work across multiple...