Text and Data Mining (TDM) Licence | Copyright Licensing Agency

Text and Data Mining (TDM) is the process of using computational techniques to extract valuable insights and patterns from large volumes of information. Organisations use TDM to gain competitive advantages and drive efficiencies by uncovering hidden trends and extracting valuable facts and concepts from text and data.

CLA’s TDM licence extension includes rights covering use of published content for TDM purposes. This does not cover the use of content in training or prompting Generative AI models. We already have over 150 participating publishers, and these new rights available to all UK businesses and public sector organisations.

What is text and data mining?

Text and data mining is the process of transforming unstructured content into a structured format to analyse, extract and identify meaningful information and insights. By using TDM, organisations can harness the power of vast volumes of information and data, capturing and revealing key concepts, trends, and hidden relationships. Organisations use TDM for market research, sentiment analysis, text classification and customer analysis.

This computational technique provides valuable information to organisations for studies and research and to aid decision-making.

TDM Licensing permissions

The right to download, extract from, and format, using computational technical means, the licensed content on the licensee’s computer servers (including cloud-based servers) to enable the use of licensed content for the permitted purposes.
The right to create one’s own digital copy from print publications for the purpose of text and data mining.
The right to create a central repository with retention of mined licensed content (for the duration of the term of the licence only) – subject to the licensee agreeing to industry-standard information security obligations.

TDM use cases

Media evaluation
Financial analysis
Image identification
Scientific discovery
Anti-plagiarism

Enquire now

To enquire about the new TDM permissions, use the form below. Our specialist team will be happy to help.

"*" indicates required fields

First Name*

Last Name*

Why are you contacting CLA?*

Email*

This field is hidden when viewing the form

Lead Owner

This field is hidden when viewing the form

Status

This field is hidden when viewing the form

Record Type

This field is hidden when viewing the form

Lead Source

This field is hidden when viewing the form

Subject

This field is hidden when viewing the form

Utm Campaign

This field is hidden when viewing the form

Utm Medium

This field is hidden when viewing the form

Utm Source

CAPTCHA

Email

This field is for validation purposes and should be left unchanged.

TDM Licence FAQs

What is text and data mining?

What is unstructured data and what do these datasets include?

How do TDM practices use published content?

What aspects of Text & Data Mining infringe copyright?

Is Text & Data mining permitted in the UK?

See more

Licence Terms

AI & Copyright

×

This website uses cookies

This website uses cookies to improve user experience. By using our website you consent to all cookies in accordance with our Cookie Policy. Read more

Accept

Accept all

Accept only required cookies

Manage Cookies

Strictly necessary

Performance

Targeting

Functionality

Cookie declaration

About cookies

Strictly necessary

Performance

Targeting

Functionality

Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.

Cookie report
Name	Provider / Domain	Expiration	Description
cla_customer	https://cla.co.uk/	Session	This cookie is used to remember the state of the “CLA Customer” toggle in header. No personal information stored.
cla-form-submitted	https://cla.co.uk/	1 month	This cookie is used to check if a user has access to a restricted page. No personal information stored.
cla_user_licence_id	https://cla.co.uk/	Session	This cookie is used to remember the user selected licence. No personal information stored.
cla-jwt	.cla.co.uk	1 hour	The cla-jwt cookie is a distinctive token utilised within our license permissions system integrated into the website. It ensures seamless operation and access management. This cookie does not store any sensitive information, ensuring user privacy and data security.
jwt-expiry	.cla.co.uk	1 hour	It is employed to store the expiration date of the cla-jwt token. This cookie aids in maintaining accurate session timing and enhancing user experience by managing token validity efficiently. This cookie does not store any sensitive information, ensuring user privacy and data security.
cla-course-login	cla.co.uk	1 month	This cookie is set when a user logs in to our website. It contains a unique user identifier, allowing the user to access courses seamlessly. The cookie does not store any sensitive information and is designed to enhance the user experience by maintaining login sessions. It is set to expire in 30 days, ensuring that users remain logged in without needing to re-authenticate frequently.
Recite.Persist	cla.co.uk	1 year	This cookie is used when the Recite.me Assistive Toolbar is launched and allows the toolbar to persist across subsequent web pages on the website.
Recite.Preferences	cla.co.uk	1 year	This cookie is used when the Recite.me Assistive Toolbar is launched and allows user-selected settings to auto-load on subsequent web pages on the website.
ASLBSA	cla.co.uk	Session	This cookie is typically used to ensure a consistent and efficient user experience by managing load balancing on the web server, helping to ensure that user requests are directed to the same server in any browsing session.
__cf_bm	Cloudflare Inc. .canva.com	30 minutes	This cookie is used to distinguish between humans and bots. This is beneficial for the website, in order to make valid reports on the use of their website.
ApplicationGatewayAffinity	cla.co.uk	Session	This cookie is used to maintain user session affinity, ensuring that a user's requests within a session are sent to the same server for consistent interaction with the web application.
ARRAffinity	Microsoft Corporation .accountlogin.cla.co.uk	Session	This cookie is set by websites run on the Windows Azure cloud platform. It is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
ARRAffinitySameSite	Microsoft Corporation .accountlogin.cla.co.uk	Session	When using Microsoft Azure as a hosting platform and enabling load balancing, this cookie ensures that requests from one visitor browsing session are always handled by the same server in the cluster.
sp_landing	Spotify Inc. .spotify.com	1 day	Required to ensure the functionality of the integrated Spotify plugin. This does not result in any cross-site functionality.
JSESSIONID	Oracle Corporation accountlogin.cla.co.uk	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
x-ms-routing-name	Microsoft .accountlogin.cla.co.uk	1 hour	This cookie is used to ensure the user's browsing session is directed to the same server in a session to maintain a consistent user experience.
__RequestVerificationToken	Microsoft Corporation contentstore.cla.co.uk	Session	This is an anti-forgery cookie set by web applications built using ASP.NET MVC technologies. It is designed to stop unauthorised posting of content to a website, known as Cross-Site Request Forgery. It holds no information about the user and is destroyed on closing the browser.
ASLBSACORS	cla.co.uk	Session	This cookie is likely associated with load balancing to ensure that visitor page requests are routed to the same server in any browsing session.
ASP.NET_SessionId	Microsoft Corporation contentstore.cla.co.uk	Session	General purpose platform session cookie, used by sites written with Miscrosoft .NET based technologies. Usually used to maintain an anonymised user session by the server.
PHPSESSID	PHP.net cla.co.uk	Session	Cookie generated by applications based on the PHP language. This is a general purpose identifier used to maintain user session variables. It is normally a random generated number, how it is used can be specific to the site, but a good example is maintaining a logged-in status for a user between pages.
sp_t	Spotify Inc. .spotify.com	1 year	Required to ensure the functionality of the integrated Spotify plugin. This does not result in any cross-site functionality.
CookieScriptConsent	CookieScript cla.co.uk	1 month	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.

Cookie report
Name	Provider / Domain	Expiration	Description
cf_clearance	Cloudflare, Inc. cla.co.uk	1 year	This cookie is set by Cloudflare, a service that improves website security. This temporary cookie helps ensure a smooth browsing experience.
_clck	cla.co.uk	1 year	Microsoft Clarity sets this cookie. It anonymously tracks visitor behaviour to enhance the browsing experience and website performance. The cookie gathers data on visitor numbers, source (where they came from), and pages viewed.
_clsk	Microsoft cla.co.uk	1 day	Microsoft Clarity sets this cookie. It links various page views from a single visitor into one session. This helps analyse how visitors navigate the website. The cookie collects anonymous data.
_vwo_uuid	Wingify Software Pvt. Ltd https://cla.co.uk/	1 year	The _vwo_uuid cookie generates a unique ID for each visitor. It helps distinguish one visitor from another and is used for report segmentation in VWO. Importantly, it does not contain any personal data.
_vwo_ds	Wingify https://cla.co.uk/	3 months	This cookie stores persistent user-level data for VWO Insights.
Pardot	https://cla.co.uk/	Session	The pardot cookie is set while the visitor is logged in as a Pardot user. The cookie indicates an active session and is not used for tracking.
_vis_opt_s	Wingify Software Pvt. Ltd https://cla.co.uk/	3 months 10 days	The _vis_opt_s cookie tracks session creation for a visitor. It counts the number of times the browser was closed and reopened. Used for A/B testing, rollouts, personalization, and insights.
_ga	Google LLC .accountlogin.cla.co.uk	1 year 1 month	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports.
_gid	Google LLC .accountlogin.cla.co.uk	1 day	This cookie is set by Google Analytics. It stores and update a unique value for each page visited and is used to count and track pageviews.
_gat_UA-6576605-32	.accountlogin.cla.co.uk	59 seconds	This is a pattern type cookie set by Google Analytics, where the pattern element on the name contains the unique identity number of the account or website it relates to. It is a variation of the _gat cookie which is used to limit the amount of data recorded by Google on high traffic volume websites.
_ga	Google LLC .cla.co.uk	1 year 1 month	This cookie name is associated with Google Universal Analytics - which is a significant update to Google's more commonly used analytics service. This cookie is used to distinguish unique users by assigning a randomly generated number as a client identifier. It is included in each page request in a site and used to calculate visitor, session and campaign data for the sites analytics reports.
_vwo_uuid_v2	Wingify Software Pvt. Ltd .cla.co.uk	1 year	This cookie name is associated with the product Visual Website Optimiser, by USA based Wingify. The tool helps site owners measure the performance of different versions of web pages. This cookie ensures a visitor always sees the same version of a page and is used to track behaviour to measure the performance of different page versions.
_ga_7YC6NPZV20	.accountlogin.cla.co.uk	1 year 1 month	This cookie is used by Google Analytics to persist session state.
YSC	Google LLC .youtube.com	Session	This cookie is set by YouTube to track views of embedded videos.
TiPMix	.accountlogin.cla.co.uk	1 hour	This cookie is associated with diagnostics and website health issues to ensure continued stability and performance. It tracks user sessions to identify and resolve any potential problems actively.
ai_user	Microsoft Corporation contentstore.cla.co.uk	1 year	This cookie name is associated with the Microsoft Application Insights software, which collects statictical usage and telemetry information for apps built on the Azure cloud platform. This is a unique user identifier cookie enabling counting of the number of users accessing the application over time.
ai_session	Microsoft Corporation contentstore.cla.co.uk	30 minutes	This cookie name is associated with the Microsoft Application Insights software, which collects statictical usage and telemetry information for apps built on the Azure cloud platform. This is a unique anonymous session identifier cookie.
_ga_6E28TL5ZLG	.cla.co.uk	1 year 1 month	This cookie is used by Google Analytics to persist session state.

Targeting cookies are used to identify visitors between different websites, eg. content partners, banner networks. Those cookies may be used by companies to build a profile of visitor interests or show relevant ads on other websites.

Cookie report
Name	Provider / Domain	Expiration	Description
loc	Oracle Corporation https://cla.co.uk/	1 year	AddThis sets this geolocation cookie to help understand the location of users who share the information.
uvc	Oracle Corporation https://cla.co.uk/	1 year	addthis.com sets this cookie to determine the usage of addthis.com service.
In_or	.linkedin.com	1 day	LinkedIn sets this cookie to registers statistical data on users’ behaviour on the website for internal analytics.
UTM and CLAID	https://cla.co.uk/	1 month	UTM and CLAID cookies are used to provide traffic, medium, source and campaign information when a user submits information into a website form.
visitor_id699783-hash	.pardot.com	1 year 1 month	A Pardot cookie security measure that prevents malicious users from faking a visitor and accessing prospect information. The cookie contains the account ID and stores a unique hash
pum_alm_last_activity	cla.co.uk	23 hours 59 minutes	A cookie that remembers a user's previous activity in relation to forms and popups.
IDE	Google LLC .doubleclick.net	1 year	The IDE cookie is used for advertising purposes. It registers and reports user actions after viewing or clicking on ads.
utm_medium	cla.co.uk	1 month	This cookie is used to provide traffic, medium, source and campaign information when a user submits information into a website form.
pum_alm_popup_open_counts	cla.co.uk	3 months	A cookie that remembers a user's previous activity in relation to forms and popups.
MUID	Microsoft Corporation .bing.com	1 year	This cookie is widely used my Microsoft as a unique user identifier. It can be set by embedded microsoft scripts. Widely believed to sync across many different Microsoft domains, allowing user tracking.
bcookie	Microsoft Corporation .linkedin.com	1 year	The bcookie cookie tracks user behavior across LinkedIn services and helps with ad targeting and analytics.
_fbp	Meta Platform Inc. .cla.co.uk	2 months 29 days	Used by Meta, the _fbp cookie is used to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising after visiting the website.
_uetsid	Microsoft Corporation .cla.co.uk	1 day	This cookie is used by Bing to determine what ads should be shown that may be relevant to the end user perusing the site.
test_cookie	Google LLC .doubleclick.net	15 minutes	This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.
pum_alm_pages_viewed	cla.co.uk	3 months	A cookie that remembers a user's previous activity in relation to forms and popups.
_gcl_au	Google LLC .cla.co.uk	2 months 29 days	Used by Google AdSense for experimenting with advertisement efficiency across websites using their services
lpv699783	pi.pardot.com	30 minutes	A pi.pardot.com cookie for tracking user activity on website.
_uetvid	Microsoft Corporation .cla.co.uk	1 year	This is a cookie utilised by Microsoft Bing Ads and is a tracking cookie. It allows us to engage with a user that has previously visited our website.
UTM	cla.co.uk	1 month	This cookie is used to provide traffic, medium, source and campaign information when a user submits information into a website form.
pum_alm_first_activity	cla.co.uk	23 hours 59 minutes	A cookie that remembers a user's previous activity in relation to forms and popups.
visitor_id699783	cla.co.uk	1 year 1 month	This is a cookie pattern that appends a unique identifier for a website visitor, used for tracking purposes. The cookies in this domain have a lifespan of 10 years.
visitor_id699783-hash	cla.co.uk	1 year 1 month	A Pardot cookie security measure that prevents malicious users from faking a visitor and accessing prospect information.
VISITOR_PRIVACY_METADATA	YouTube .youtube.com	6 months	.youtube.com - This cookie is used to store the user's consent and privacy choices for their interaction with the site. It records data on the visitor's consent regarding various privacy policies and settings, ensuring that their preferences are honoured in future sessions.
visitor_id699783	.pardot.com	1 year 1 month	A Pardot cookie that appends a unique identifier for a website visitor, used for tracking purposes. The visitor_id699783 cookie tracks user interactions and helps personalize marketing content.
CLAID	cla.co.uk	1 month	This cookie is used to provide traffic, medium, source and campaign information when a user submits information into a website form.

Functionality cookies are used to remember visitor information on the website, eg. language, timezone, enhanced content.

Cookie report
Name	Provider / Domain	Expiration	Description
__atuvc	Oracle Corporation https://cla.co.uk/	1 year	The __atuvc cookie stores performed actions on a website. It helps track user interactions and actions related to sharing content.
__atuvs	Oracle Corporation https://cla.co.uk/	30 minutes	Similar to __atuvc, the __atuvs cookie also stores performed actions on a website.
_vis_opt_test_cookie	Wingify Software Pvt. Ltd https://cla.co.uk/	Session	The _vis_opt_test_cookie detects if cookies are enabled in the visitor’s browser. It also tracks the number of browser sessions.
_vwo_sn	Wingify https://cla.co.uk/	30 minutes	This cookie stores session-level information.
test_cookie	Google LLC https://cla.co.uk/	15 minutes	This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.
li_gc	LinkedIn Corporation .linkedin.com	6 months	Used to store guest consent to the use of cookies for non-essential purposes
MSPTC	Microsoft .bing.com	1 year	This cookie is used to track user engagement and interaction with the website to enhance customer experience and website functionality. It may collect information about how users navigate and use the site, helping to identify preferences and improve service delivery.
lidc	Microsoft Corporation .linkedin.com	1 day	LinkedIn sets the lidc cookie to facilitate data centre selection.
VISITOR_INFO1_LIVE	Google LLC .youtube.com	6 months	This cookie is set by Youtube to keep track of user preferences for Youtube videos embedded in sites;it can also determine whether the website visitor is using the new or old version of the Youtube interface.
__Secure-ROLLOUT_TOKEN	.youtube.com	6 months	This cookie is a secure, HTTPS-only cookie used to manage feature rollouts or A/B testing. It ensures users consistently receive specific features during phased deployments, while remaining secure and protected from client-side access.
returnUrl	apiportal.cla.co.uk	Session	This cookie is used to redirect the user back to the last visited page after login, enhancing the user experience by ensuring seamless navigation.

Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.

You can change your consent to cookie usage at any time on our Privacy Policy page.

We also use cookies to collect data for the purpose of personalizing and measuring the effectiveness of our advertising. For more details, visit the Google Privacy Policy.