Training large AI models requires vast datasets — often scraped from the internet without the explicit consent of the people who created that content.
Reference:
US Copyright Office AI policy
TaskLoco™ — The Sticky Note GOAT