The culture sector dataset has been created by Northumbria Culture Connect Data Observatory pilot
project to provide a view of the organisations operating across the culture sector in Newcastle.
Stage 1 of the pipeline breaks down the raw funding data into a candidate longlist of organisations.
The resulting data is broken down by local authority and source.
Stage 2 of the pipeline performs various matches, including simple matches,
and fuzzy matches to both Companies House and Charity Commission data.
It also attempts to correct any typographical errors in the names.
Stage 3 combines the candidate organisations and the matches from the prior stage
to produce the culture sector dataset. This is augmented with data from the Companies House
and Charity Commission references, and adds in other companies that do not appear in the
funding data sources, but which share SIC codes with the matched companies.