This dataset comprises information on the number of new book titles published per year per million inhabitants in a given country for the period 1500-2010. Data has been gathered and organized in 1-year intervals, using as geographical reference the current list of existing world countries. Colours have been used to indicate which source is used for the given number, a more elaborate list of sources can be found in this document


Jonathan Fink-Jensen

Production date



New book titles per capita


Human capital, book production, book titles, world countries

Time period


Geographical coverage

Entire World

Methodologies used for data collection and processing

Data on the number of new book titles has been gathered from a number of other datasets and publications that contain data on book production for different regions and periods (see below). This data has been combined with population numbers as given by Maddison (2003) and CLIO Infra in order to calculate the number of new book titles per million inhabitants. Europe, Canada, USA 1500-1800 Buringh & Van Zanden Europe 1800-2010 Dataset `Book production 1800-2010', mainly based on national bibliographies: * Belgium, 1875-2004: Belgische Bibliografie. Numbers from 1966 onwards are more reliable, since an official depot was opened in that year and all publications had to be filed. However, there may still be small differences with the real number of new publications, since the numbers given indicate the amount of publications processed in the Belgian bibliography, not the actual publications; * Sweden, 1801-2009: LIBRIS catalogue, containing a substantial part of all publications, but real numbers will probably be higher; * Netherlands, 1801-2009: Picarta; * United Kingdom, 1801-1978 Copac, 1979-2009: British Library Legal Deposit. Copac data has the disadvantage that quite a number of titles are `unspecified', which makes it harder to indicate its reliability. Therefore, from the earliest date possible (1979) the data is taken from the British Library Legal Deposit, since these numbers for a later period correspond to the data gathered by Plopenanu et al. based on UNESCO statistics (see below); * Denmark, 1800-2009: Danish catalogue; * Norway 1801-2009: Bibsys catalogue; * Switzerland, 1915-2009: statistics published by the Federal Office of Statistics. The total population per country is given by Maddison (2003) from 1820 onwards, CLIO Infra gives population numbers for 1800 and 1810. For the years in between, total population is taken from the closest year known (1801-1805 is the same as 1800, 1806-1815 the same as 1810, etc.). Germany, Finland, Switzerland, France, Poland, USSR, USA: 1800-1945 Dataset `Book production before 1900', based on The History of the book in the West 1800-1914, Vol. IV (2010) and The Oxford Companion of the book, Vol. I (2010). USSR 1924-2010 Didenko, Földvári & Van Leeuwen (2013) For the years 1924-1939 and 1941-1949 population numbers for Armenia, Azerbaijan, Belarus, Georgia, Kazakhstan, Kyrgyzstan, Tajikistan, Turkmenistan, Ukraine, and Uzbekistan are unknown. The same goes for Moldova, Estonia, Latvia, and Lithuania for the years 1941-1949. However, the total population in these countries is given for 1940 (CLIO Infra) and 1950 onwards (Maddison). For the years 1924-1939 and 1941-1945 the data for 1940 is used, and for 1946-1949 the data for 1950. Peru, Mexico: 1539-1799 Arroyo Abad & Van Zanden (2014). The annual average of new book titles per million inhabitants up to 1700 has been calculated by dividing the average annual production of new book titles by the average annual population (based on Maddison). For the 18th century, the annual average per million inhabitants is given by the authors themselves. India, Indonesia, 1750-1799 Baten & Van Zanden (2014). The numbers given are annual averages, as given by the authors. Japan, 1505-1911 Derived from Van Zanden (2009), based on Kornicki (2001) and Hayami and Kito (2004. All numbers are annual averages. Kornicki (2001) covers the longest periods, but since he gives annual averages for a very long period (1720s-1815) there is no growth visible. According to his figures, the annual average production of new book titles per million inhabitants is 10,5 for the whole period, but this was probably lower in the earlier years and higher in later periods. Van Zanden states, for instance, that the annual production for the period 1750-1800 was 7 new titles per million inhabitants. In order to show some growth, numbers given by Hayami & Kito are shown for the 1720s and 1750s, and from 1754-1815 Kornicki's figures are given. Ottoman Empire, 16th century Van Zanden (2009): the sultan of the Ottoman Empire banned the possession of printed materials, and the fact that new technology could easily be suppressed suggests that the market for books was limited. World 1930s-1996 Plopenanu et al. (2014), based on UNESCO data. Population data taken from Maddison

Period of collection

June 2015

Data collectors

Jonathan Fink-Jensen

Anguilla[No Data]

Antigua and Barbuda1500 (5)-2013 (21)

Aruba[No Data]

Bahamas1500 (5)-2013 (23)

Barbados1500 (5)-2013 (27)

Bonaire, Sint Eustatius and Saba[No Data]

British Virgin Islands[No Data]

Cayman Islands[No Data]

Cuba1500 (8)-2012 (34)

Curaçao[No Data]

Dominica1500 (5)-2013 (20)

Dominican Republic1500 (6)-2013 (40)

Grenada1500 (5)-2013 (21)

Guadeloupe[No Data]

Haiti1500 (6)-2013 (36)

Jamaica1500 (6)-2013 (38)

Martinique[No Data]

Montserrat[No Data]

In 2010, the Netherlands Organisation for Scientific Research (NWO) awarded a subsidy to the Clio Infra project, of which Jan Luiten van Zanden was the main applicant and which is hosted by the International Institute of Social History (IISH). Clio Infra has set up a number of interconnected databases containing worldwide data on social, economic, and institutional indicators for the past five centuries, with special attention to the past 200 years. These indicators allow research into long-term development of worldwide economic growth and inequality.

Global inequality is one of the key problems of the contemporary world. Some countries have (recently) become wealthy, other countries have remained poor. New theoretical developments in economics - such as new institutional economics, new economic geography, and new growth theory - and the rise of global economic and social history require such processes to be studied on a worldwide scale. Clio Infra provides datasets for the most important indicators. Economic and social historians from around the world have been working together in thematic collaboratories, in order to collect and share their knowledge concerning the relevant indicators of economic performance and its causes. The collected data have been standardized, harmonized, and stored for future use. New indicators to study inequality have been developed. The datasets are accessible through the Clio Infra portal which also offers possibilities for visualization of the data. Clio Infra offers the opportunity to greatly enhance our understanding of the origins, causes and character of the process of global inequality.