The Philosophical Cafe

Mike Conover from Databricks was guest speaker at Latent Space. He’s a really smart integration engineer, who understands the trends, and who can use algorithms built by others as tools. Summary:

Infrastructure needs for Large Language Models (LLMs).
How Dolly was made on Databricks, with 1h of training at cost of $30.
What infrastructure was missing, and Mike wished existed, to simplify this kind of LLM development
He calls that type of infrastructure “LLMOps”.
Example - we’re missing good automation for checking accuracy of question/response interactions. Another model could be built to automate that.
Example - computer vision can use thumbnails to quickly summarize large images or videos. With text, there is no notion of thumbnail yet.
Example - it would be nice to have infrastructure that runs multiple models in parallel, and displays results in thumbnail format on a spreadsheet (where these are ‘thumbnails for text’, if such a thing could be invented)
Enumeration of products that could be built with LLMs
The state of the art in closed-source LLMs
How open source datasets and models are catching up
How companies like HuggingFace and Databricks are successful because they share information, and build things as open source
Why there is also a need for closed-source products
Why closed-source companies drive the success and excitement around open-source businesses
Trends in the next 9 months for LLMs