Machine Learning & Large Language Models

Large Language Models (LLMs)

Over the past few years, Large Language Models have become a very important discussion point since the release of ChatGPT to the public in late 2022. Since the Release of 2022's ChatGPT based on GPT 3.5, LLMs have come a long way, with ChatGPT itself moving to improved models such as 4o, and open source solutions being introduced. While it is unfortunate that many programs such as ChatGPT are based on closed source models, in recent months there has been a surge of interest regarding new Open Source Models.

Finding Open Source LLMs

Currently, there are a wide variety of LLMs with source code available for download. Some are Open Source, some are not, and some (such as Meta's well known and popular Llama model) are arguably somewhere in between. To find the model with license terms that are right for you, we recommend visiting https://huggingface.co/models, and sorting by "License". There are quite a few models listed under explicitly Open source Licenses such as MIT here, as well as Models with their own Pseudo-Open-Source licenses such as Meta's series of Llama models. In addition to this, the OSSIG plans on providing students with AI training resources in the upcoming Spring 2025 semester.