FR

Partager sur :

Senior Artificial Intelligence Engineer – Voice & LLMs

  • Paris, 75014

  • CDI

  • 13/10/2025

Description

At Vocads, we’re revolutionizing customer interaction through our no-code AI voice agents. Our technology automates inbound and outbound calls, boosts team productivity, and delivers a seamless 24/7 customer experience.

We are a fast-growing startup, accelerated by Station F and Microsoft GenAI Studio, already adopted by ambitious companies and currently expanding into the United States.


Missions

LLM Development and Fine-Tuning

  • Fine-tuning large language models (OpenAI, Anthropic, Meta) for natural and contextual voice interactions.

  • Designing NLP pipelines tailored to real-time dialogue and production constraints.

  • Customizing models for specific use cases, optimizing for accuracy, consistency, and performance.

  • Conducting continuous technology watch to stay up to date with the latest AI advancements.

Advanced Prompting & Real-Time AI Content Generation

  • Creating and optimizing prompts for precise and contextually appropriate responses.

  • Implementing prompt chaining strategies and few-shot learning when needed.

Python Development, APIs & Deployment

  • Building and maintaining scripts, notebooks, and APIs to integrate models into our voice applications.

  • Using modern Python tools such as Pydantic for data validation and structuring, and Instructor for fine-tuning and embedding optimization.

  • Optimizing algorithms and models to enhance accuracy and efficiency.

  • Deploying models and pipelines in production, ensuring performance, scalability, and reliability.

  • Collaborating with development teams to integrate AI models into existing products and services.

  • Working with backend and DevOps teams to ensure continuous monitoring, maintenance, and optimization of models.

Performance and Latency Optimization

  • Analyzing and improving the response times of AI models and pipelines for voice interactions.

  • Profiling and tuning models for smooth real-time operation on LiveKit, Twilio, and other voice platforms.

Integration of External Features and Data Sources

  • Designing pipelines that allow voice agents to leverage various features or data sources: vector databases, web search, SMS sending, third-party API integrations, etc.

  • Developing modular and scalable solutions to enrich model responses based on context and business needs.

  • Managing consistency, latency, and security when accessing external data or performing AI-triggered actions.

Real-Time Machine Learning & NLP

  • Training, deploying, and optimizing ML and NLP models for low-latency systems.

  • Monitoring model performance and continuously improving production pipelines.

Profil

  • Degree in Computer Science, Applied Mathematics, AI, or a related field.

  • Minimum of 5 years of experience in AI, ML, or NLP, with a strong track record in production and model deployment.

  • Expertise in Python and ML/NLP libraries (PyTorch, TensorFlow, HuggingFace, LangChain, etc.), as well as modern tools (Pydantic, Instructor).

  • Proven experience in LLM fine-tuning and Prompt Engineering.

  • Knowledge of pipelines integrating external functionalities (RAG, APIs, third-party tools, web scraping, SMS, etc.).

  • Experience with real-time or interactive systems (latency-critical, performance optimization).

  • Ability to design, deploy, and maintain models in production.

  • Autonomy, rigor, and a passion for tackling complex technical challenges.

Additional Assets

  • Experience with voice platforms (LiveKit, Twilio, WebRTC).

  • Experience with MLOps and CI/CD pipelines for ML workflows.

  • Familiarity with deployment frameworks (FastAPI, BentoML, TorchServe, etc.).

  • Knowledge of cloud environments (GCP).