The Text and Data Mining (TDM) Licence supports organisations in extracting valuable insights from unstructured content, improving decision-making and business outcomes while remaining copyright compliant. This licence will be offered as a bolt-on to existing CLA Business and Public Sector Licences and as a stand-alone licence where an organisation is currently unlicensed.

What is text and data mining?

Text and data mining is the process of transforming unstructured content into a structured format to analyse, extract and identify meaningful information and insights. By using TDM, organisations can harness the power of vast volumes of information and data, capturing and revealing key concepts, trends, and hidden relationships. Organisations use TDM for market research, sentiment analysis, text classification and customer analysis.

This computational technique provides valuable information to organisations for studies and research and to aid decision-making.

TDM Licence permissions

  • The right to download, extract from, and format, using computational technical means, the licensed content on the licensee’s computer servers (including cloud-based servers) to enable the use of licensed content for the permitted purposes.
  • The right to create one’s own digital copy from print publications for the purpose of text and data mining.
  • The right to create a central repository with retention of mined licensed content (for the duration of the term of the licence only) – subject to the licensee agreeing to industry-standard information security obligations.

TDM use cases

Image showing digital copying actions

  1. Media evaluation
  2. Financial analysis
  3. Image identification
  4. Scientific discovery
  5. Anti-plagiarism

Enquire now

To enquire about the new TDM Licence, use the form below. Our specialist team will be happy to help.

TDM Licence FAQs

The licence covers employees and individual contractors of the licensee, and organisations subcontracted to undertake TDM on the licensee’s behalf.
The licence will cover print and digital publications, including magazines, journals, books and websites from participating publishers. This includes free-to-view websites and publications to which the licensee has purchased or subscribed.
No, the licence does not permit the use of licensed content for the purposes of training generative AI, or developing datasets or outputs that may be used to train generative AI, including Large Language Models. The use of licensed content and TDM outputs as inputs (prompts) to generative AI systems is also not permitted.