Artificial Analysis is an independent AI benchmarking and insights provider. Our benchmarks help engineers and companies understand AI and make informed decisions on AI technologies.
We are hiring for two roles:
1. Full Stack Engineer: Full Stack Engineer to support our benchmarking of AI and with communicating these benchmarks to our users. Proficiency in Typescript & Python required. Familiarity with LLM APIs preferred.
Tech stack: Javascript/Typescript, Node.js, React/Next.js, Python.
2. ML Engineer: ML Engineer to support our benchmarking and evaluation of AI software stack. You will design and run benchmarks and evaluations of different AI models.
Strong analytical skills and proficiency in Python required.
Apply at hiring (-at-) artificialanalysis.ai with your resume, github and dot points on relevant experience.
Artificial Analysis is an independent AI benchmarking and insights provider. Our benchmarks help engineers and companies understand AI and make informed decisions on AI technologies and providers.
We are hiring for two roles:
1. Full Stack Engineer: Full Stack Engineer to support our benchmarking of AI and with communicating these benchmarks to our users. Proficiency in Typescript & Python required. Familiarity with LLM APIs preferred.
Tech stack: Javascript/Typescript, Node.js, React/Next.js, Python.
2. ML Engineer: ML Engineer to support our benchmarking and evaluation of AI software stack. You will design and run benchmarks and evaluations of different AI models.
Strong analytical skills and proficiency in Python required.
Apply at hiring (-at-) artificialanalysis.ai with your resume, github and dot points on relevant experience.
The tables are very similar - though you've added a custom calculator which is a nice touch.
Also for the Versus Comparison, it might be nice to have a checkbox that when clicked highlights the superlative fields of each LLM at a glance.
This page has up to date information of all models and providers: https://artificialanalysis.ai/leaderboards/providers We also on other pages cover Speech to Text, Text to Speech, Text to Image, Text to Video.
Note I'm one of the creators of Artificial Analysis.
Artificial Analysis is an independent benchmarking, evaluation and insights provider for AI. Our benchmarks let engineers and companies make the best decisions on which technologies and providers to use, empowering them to build the next generation of AI applications.
We are looking for a full stack developer to support us in our analysis of AI and presenting it to the world at https://artificialanalysis.ai/.
Strong analytical skills and proficiency in Typescript & Python required. Familiarity with LLMs and AI scaling laws preferred.
Tech stack: Javascript/Typescript, Node.js, React/Next.js, Python.
Apply here: https://artificialanalysis.ai/careers
We're seeking a Senior AI Research Analyst to support with benchmarking and evaluation of AI. Role involves analyzing AI systems, visualizing data, and supporting people in understanding the capabilities of AI (across different modalities).
Strong analytical skills, AI/ML research experience, and proficiency in Python/data analysis required (Typescript a nice to have). Familiarity with LLMs and AI scaling laws preferred.
Apply here: https://artificialanalysis.ai/careers
We also have pricing, long/medium/short prompt lengths (decode time can vary between providers) & parallel query benchmarking + model details (ctx window, etc)
Artificial Analysis is an independent AI benchmarking and insights provider. We benchmark AI to help engineers and companies understand AI and make informed decisions regarding which AI technologies to use. We are fast growing with a team of 20 and have backing from investors including Nat Friedman, Daniel Gross & Andrew Ng.
We are hiring for four roles: 1. Full Stack Engineer: Full Stack Engineer to support our benchmarking of AI and with communicating these benchmarks to our users. Proficiency in Typescript & Python required. Familiarity with LLM APIs preferred. Tech stack: Javascript/Typescript, Node.js, React/Next.js, Python.
2. ML Engineer: ML Engineer to support our benchmarking and evaluation of AI software stack. You will design and run benchmarks and evaluations of different AI models. Strong analytical skills and proficiency in Python required.
3. Member of Technical Staff: You will design evaluations of different AI models and work to translate technical insights to analysis that helps companies navigate AI.
4. Product Manager (AI Media Generation): You'll work closely with us to develop and enhance our media generation (Image, Video, Speech, Music) arenas and leaderboards, contributing to product strategy and execution at the intersection of creativity and AI.
Apply at hiring (-at-) artificialanalysis.ai with your resume, github and dot points on relevant experience (including anything you've built). Add | HackerNews to email subject line.