It’s been so inspiring to hear about the amazing things people are working on in the AI & compute space at #GTC2024!
- Christian Szegegy: who’s currently working on reasoning for xAI’s Grok. He believes that solving math is the best path towards intelligence. Math is more than just computing numbers. Math is essential for many tasks: logic, coding, science discovery… His team is working on making Grok the superhuman mathematician. Fun fact: at xAI, they don’t have data annotators but they have AI tutors.
- Clément Farabet: VP of Research at DeepMind who’s leading the development of Gemma. He got me access to Gemini 1.5 Pro, a small mixture-of-expert model that can process 1 million token context length. Let me know what tasks you’d like me to experiment with!
- Sameer Raheja: senior director of engineering at NVIDIA. His team, RAPIDS AI, works on this challenging problem of making data processing cheaper and faster on GPUs. Companies like Paypal already saw up to 70% cost savings by moving data processing from CPUs to GPUs — check out Ilay Chen’s talk on it today!
- Kyle Kranen: engineering manager at NVIDIA who works on optimization for NIM, NVIDIA’s Inference Microservices. NIM is a suite of pre-built, optimized containers for AI models that allow users to deploy AI models anywhere.
- Adel El Hallak: senior director of product at NVIDIA who leads the development of NVIDIA’s API catalog that allows users to quickly try out open source and close source models, before deploying them with NIM.
If you want to learn more about data processing on GPUs, our team at Voltron Data will be presenting at Lambda Labs booth today (Wed) at 2pm and tomorrow (Thu) at 11am!
#gpu #aiengineering #llmops