Bill Text: CA AB2013 | 2023-2024 | Regular Session | Introduced

NOTE: There are more recent revisions of this legislation. Read Latest Draft
Bill Title: Generative artificial intelligence: training data transparency.

Spectrum: Partisan Bill (Democrat 1-0)

Status: (Passed) 2024-09-28 - Chaptered by Secretary of State - Chapter 817, Statutes of 2024. [AB2013 Detail]

Download: California-2023-AB2013-Introduced.html


CALIFORNIA LEGISLATURE— 2023–2024 REGULAR SESSION

Assembly Bill
No. 2013


Introduced by Assembly Member Irwin

January 31, 2024


An act to add Title 15.2 (commencing with Section 3110) to Part 4 of Division 3 of the Civil Code, relating to artificial intelligence.


LEGISLATIVE COUNSEL'S DIGEST


AB 2013, as introduced, Irwin. Artificial intelligence: training data transparency.
Existing law requires the Department of Technology, in coordination with other interagency bodies, to conduct, on or before September 1, 2024, a comprehensive inventory of all high-risk automated decision systems, as defined, that have been proposed for use, development, or procurement by, or are being used, developed, or procured by, state agencies, as defined.
This bill would require, on or before January 1, 2026, a developer, as defined, of an artificial intelligence system or service, as defined, made available to Californians for use, regardless of whether the terms of that use include compensation, to post on the developer’s internet website documentation regarding the data used to train the artificial intelligence system or service, as specified.
Vote: MAJORITY   Appropriation: NO   Fiscal Committee: NO   Local Program: NO  

The people of the State of California do enact as follows:


SECTION 1.

 Title 15.2 (commencing with Section 3110) is added to Part 4 of Division 3 of the Civil Code, to read:

TITLE 15.2. Artificial Intelligence Training Data Transparency

3110.
 For purposes of this title, the following definitions shall apply:
(a) “Artificial intelligence system or service” means a machine-based system or service that can, for a given set of human-defined objectives, generate content and make predictions, recommendations, or decisions influencing a real or virtual environment.
(b) “Developer” means a person, partnership, state or local government agency, or corporation that designs, codes, or produces an artificial intelligence system or service, or substantially modifies an artificial intelligence system or service for use by a third party for free or for a fee.
(c) “Synthetic data generation” means process in which seed data are used to create artificial data that have some of the statistical characteristics of the seed data.

3111.
 On or before January 1, 2026, a developer of an artificial intelligence system or service made available to Californians for use, regardless of whether the terms of that use include compensation, shall post on the developer’s internet website documentation regarding the data used to train the artificial intelligence system or service, including, but not be limited to, all of the following:
(a) A description of each dataset used in the development of the system or service, including, but not limited to:
(1) The source or owner of the dataset.
(2) A clear definition of each category associated to data points within the dataset.
(3) The time period during which the data in the dataset was collected.
(4) The dates the dataset was first and last used during the development of the system or service.
(5) Whether the dataset was purchased by the developer, licensed by the developer, or is in the public domain.
(b) A disclosure of whether the system or service used or continuously uses synthetic data generation in its development.

feedback