5. FINDINGS AND RESULTS

1                        Findings and results

The experiment’s data was analysed using ANOVA (analysis of variance) in order to test the various hypotheses. Using ANOVA, the effect of the independent variable on the dependant variable is measured. For each hypothesis, the dependant and independent variables are listed in combination to the results of the test. As a second analysis, several other variables are included in order to test their influence on the outcome, or in other words: the between subject effects. This is called ANCOVA (analysis of covariance). Furthermore, the influence of title properties on the dependant variables is assessed using multiple regression analysis.

 

5.1                    Hypothesis 1: The discovery of fully accessible titles is significantly higher, compared to titles which are not fully accessible.

The dependant variable is discovery, which is measured as the number of monthly Book visits a title received during the period from April 2009 to December 2009 in the Google Book search program. The independent variable is accessibility, which is measured as the set to which a title belongs. The variances of the independent variable were found to be significantly different; therefore the values of the Welch robust test of equality of means are reported here. There was a significant effect of accessibility on discovery, F(3, 1808) = 7.164, p < .01, ω = .07.

Planned contrasts revealed that the number of monthly Book visits a title received was increased by being disseminated through an Open Access channel (books in Set 2, Set 3 or Set 4), compared to titles which were not (books in Set 1), t(2117) = 1.99, p < .05 (one tailed), r = .04.

As the assumption of homogeneity of variance has not been met, when carrying out post-hoc tests, the Games-Howell procedure must be used. The multiple comparisons carried out by the Games-Howell test appear to show that there is a significant difference in means between Set 2 and Set 3, Set 2 and Set 4. In other words, the post-hoc test revealed a difference in Book visits between the titles disseminated through the AUP repository and titles disseminated through the Google Book Search program or through the AUP repository combined with the Google Book Search program.

 

Table 7.1 Hypothesis 1: Descriptive statistics of Book visits

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 1

857

78.54

134.939

4.609

69.50

87.59

1

1322

Set 2

860

63.03

145.908

4.975

53.27

72.80

1

1845

Set 3

839

98.12

231.384

7.988

82.44

113.79

0

4522

Set 4

868

124.08

542.741

18.422

87.93

160.24

1

13804

Total

3424

90.99

313.270

5.354

80.49

101.49

0

13804

 

5.2                    Hypothesis 2: The online consultation (e.g. pages read or number of downloads) of fully accessible titles is significantly higher, compared to titles which are not fully accessible.

The dependant variable is online consultation, which is measured as the number of monthly page views a title received in the Google Book search program during the period from April 2009 to December 2009. Here the values from the AUP repository are not used, as they only apply to Set 2 and Set 3. The independent variable is accessibility, which is measured as the set to which a title belongs. The variances of the independent variable were found to be significantly different; therefore the values of the Welch robust test of equality of means are reported here. There was a significant effect of accessibility on online consultation, F(3, 1894) = 37.705, p < .01, ω = .18.

Planned contrasts revealed that the number of monthly page views a title received was increased by being disseminated through an Open Access channel (books in Set 2, Set 3 or Set 4), compared to titles which were not (books in Set 1), t(3420) = 4.92, p < .01 (one tailed), r = .08.

As the assumption of homogeneity of variance has not been met, when carrying out post-hoc tests, the Games-Howell procedure must be used. The multiple comparisons carried out by the Games-Howell test appear to show that there is a significant difference in means between Set 1 and Set 3, Set 1 and Set 4, Set 2 and Set 3, Set 2 and Set 4. In other words, the post-hoc test revealed a difference in page views between the control group and the titles disseminated through the AUP repository and titles disseminated through the Google Book Search program or through the AUP repository combined with the Google Book Search program.

 

Table 7.2 Hypothesis 2: Descriptive statistics of page views

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 1

857

706.16

1469.756

50.206

607.62

804.70

3

18171

Set 2

860

574.09

1498.728

51.106

473.79

674.40

1

17163

Set 3

839

1123.25

1435.594

49.562

1025.97

1220.53

0

10051

Set 4

868

1302.97

1937.376

65.759

1173.90

1432.03

3

26607

Total

3424

926.48

1627.238

27.809

871.96

981.01

0

26607

 

5.3                    Hypothesis 3: The citation rate of fully accessible titles is significantly higher, compared to titles which are not fully accessible.

The dependant variable is citation rate, which is measured as the difference in citation rate per title during the period from April 2009 to December 2009 as found in the Google Scholar search engine. The independent variable is accessibility, which is measured as the set to which a title belongs. The variances of the independent variable were not significantly different, and there was no significant effect of accessibility on citation, F(3, 396) = .785, p < .51, ω = .99.

Planned contrasts revealed that the number of citations a title received was not increased or decreased by being disseminated through an Open Access channel (books in Set 2, Set 3 or Set 4), compared to titles which were not (books in Set 1), t(396) =  .99, p < .17 (one tailed), r = .05, and that the number of citations was not being increased or decreased by being disseminated through both the AUP repository and the Google Book Search program (books in Set 3), compared to titles which were disseminated through one of those channels (books in Set 2 and 4), t(396) = 1.12, p < .14, r = .06.

The assumption of homogeneity of variance has been met; therefore the Tukey and REGWQ post-hoc tests have been used. The multiple comparisons of both tests appear to show that there is no significant difference in means between any of the sets.

 

Table 7.3 Hypothesis 3: Descriptive statistics of citations

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 1

100

1.60

3.864

.386

.83

2.37

0

33

Set 2

100

3.51

22.960

2.296

-1.05

8.07

0

230

Set 3

100

1.92

4.014

.401

1.12

2.72

0

24

Set 4

100

4.23

15.837

1.584

1.09

7.37

0

145

Total

400

2.82

14.210

.711

1.42

4.21

0

230

 

5.4                    Hypothesis 4: The sales figures of fully accessible titles are significantly higher, compared to titles which are not fully accessible.

The dependant variable is sales figures, which is measured as the monthly number of sales per title during the period from April 2009 to December 2009. The independent variable is accessibility, which is measured as the set to which a title belongs. From the collected data the following was omitted:

Table 7.4 Hypothesis 4: Omitted sales figures

ISBN

Sales

Set

Remarks

9789053568385

1643

1

remaindered

9789053569030

1238

2

remaindered

9789053568651

1210

1

remaindered

9789053565315

1048

1

remaindered

 

These were unusually high monthly sales, caused by remaindering of those titles. When a title is remaindered, the complete stock is sold to a specialised vendor, causing a major effect on the sales figures.

The variances of the independent variable were not significantly different, and there was no significant effect of accessibility on sales, F(3, 396) = .554, p < .65, ω = .99.

Planned contrasts revealed that the number of sales a title received was not increased or decreased by being disseminated through an Open Access channel (books in Set 2, Set 3 or Set 4), compared to titles which were not (books in Set 1), t(396) =  1.21, p < .12 (one tailed), r = .06.

The assumption of homogeneity of variance has been met; therefore the Tukey and REGWQ post-hoc tests have been used. The multiple comparisons of both tests appear to show that there is no significant difference in means between any of the sets.

 

Table 7.5 Hypothesis 4: Descriptive statistics of sales

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 1

100

40.10

112.748

11.275

17.73

62.47

-2

1026

Set 2

100

63.59

151.826

15.183

33.46

93.72

-1

1088

Set 3

100

55.04

118.951

11.895

31.44

78.64

-1

724

Set 4

100

58.31

152.855

15.285

27.98

88.64

-5

1208

Total

400

54.26

135.122

6.756

40.98

67.54

-5

1208

 

5.5                    Hypothesis 5: The discovery of titles disseminated through both the institutional repository and the Google Book Search program is significantly higher, compared to titles disseminated through one of those channels.

The dependant variable is discovery, which is measured as the number of monthly Book visits a title received in the Google Book search program during the period April 2009 to December 2009. The independent variable is accessibility through a single channel or through multiple channels, which is measured as the set to which a title belongs. Titles in Set 2 and Set 4 are disseminated through one channel; titles in Set 3 are disseminated through multiple channels.

Testing Hypothesis 5 is an extension of the tests performed for Hypothesis 1. For Hypothesis 5, the focus is not on all titles in Open Access channels – the books in Set 2, Set 3 and Set 4 – versus the books in Set 1, but on the differences in discovery of the books in Set 2 and Set 4 compared to the discovery of titles in Set 3. In order to find these results, a planned contrast test is performed. Furthermore, as the descriptive statistics are the same as for Hypothesis 1, they are not repeated here.

Planned contrasts revealed that the number of monthly Book visits was not increased by being disseminated through both the AUP repository and the Google Book Search program (books in Set 3), compared to titles which were disseminated through one of those channels (books in Set 2 and Set 4), t(1816) = -.37, p < .36, r = .01. In this case, dissemination through multiple channels does not have a significant effect on discovery.

 

5.6                    Hypothesis 6: The online consultation (e.g. pages read or number of downloads) of titles disseminated through both the institutional repository and the Google Book Search program is significantly higher, compared to titles disseminated through one of those channels.

In order to test Hypothesis 6, several dependant variables were used. The first variable to be used is the number of monthly page views a title received in the Google Book search program, which is an extension of the tests performed for Hypothesis 2. Similar to Hypothesis 1 and Hypothesis 5, the focus is not on all titles in Open Access channels – the books in Set 2, Set 3 and Set 4 – versus the books in Set 1, but on the differences in online consultation of the books in Set 2 and Set 4 compared to the online consultation of titles in Set 3. In order to find these results, a planned contrast test is performed. Furthermore, as the descriptive statistics are the same as for Hypothesis 2, they are not repeated here.

The second dependant variable is the number of monthly page views a title received in the AUP repository. This only applies to books in Set 2 and Set 3. The third dependant variable is the number of monthly downloads a title received in the AUP repository. Of course, this is only measured for titles in Set 2 and Set 3.

 

5.6.1             Dependant variable: page views in the Google Book Search program

The dependant variable is online consultation, which is measured as the number of monthly page views a title received in the Google Book search program during the period April 2009 to December 2009. The independent variable is accessibility through a single channel or through multiple channels, which is measured as the set to which a title belongs. Titles in Set 2 and Set 4 are disseminated through one channel; titles in Set 3 are disseminated through multiple channels.

Planned contrasts revealed that the number of monthly page views was decreased by being disseminated through both the AUP repository and the Google Book Search program (books in Set 3), compared to titles which were disseminated through one of those channels (books in Set 2 and 4), t(1941) = -2.85, p < .05, r = .06. In this case, dissemination through multiple channels does have a significant negative effect on online consultation, which is most likely caused by the very low number of online consultations through the AUP repository, compared with the number of online consultations of titles through the Google Book Search program.

 

5.6.2             Dependant variable: page views in the AUP repository

The dependant variable is online consultation, which is measured as the number of monthly page views during the period April 2009 to December 2009 a title received in the AUP repository. The independent variable is accessibility through a single channel or through multiple channels, which is measured as the set to which a title belongs to. Titles in Set 2 are disseminated through one channel; titles in Set 3 are disseminated through multiple channels.

The variances of the independent variable were not significantly different, and dissemination through multiple channels does not have a significant effect on page views in the AUP repository, F(1, 468) = .333, p < .57, ω = .99.

Planned contrasts or post-hoc tests were not performed because there are fewer than three groups.

Table 7.6 Hypothesis 6: Descriptive statistics of repository page views

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 2

255

1.60

2.088

.131

1.34

1.86

0

29

Set 3

215

1.50

1.459

.100

1.31

1.70

0

13

Total

470

1.56

1.826

.084

1.39

1.72

0

29

 

5.6.3             Dependant variable: downloads from the AUP repository

The dependant variable is online consultation, which is measured as the number of monthly downloads a title received in the AUP repository during the period from April 2009 to December 2009. The independent variable is accessibility through a single channel or through multiple channels, which is measured as the set to which a title belongs. Titles in Set 2 are disseminated through one channel; titles in Set 3 are disseminated through multiple channels.

The variances of the independent variable were not significantly different, and dissemination through multiple channels does not have a significant effect on monthly downloads from the AUP repository, F(1, 468) = .209, p < .65, ω = .99.

Planned contrasts or post-hoc tests were not performed because there are fewer than three groups.

 

Table 7.7 Hypothesis 6: Descriptive statistics of repository downloads

 

N

Mean

Std. Deviation

Std. Error

95% Confidence Interval for Mean

Minimum

Maximum

 

Lower Bound

Upper Bound

Set 2

255

3.82

10.739

.673

2.50

5.15

0

58

Set 3

215

4.37

15.192

1.036

2.33

6.41

0

122

Total

470

4.07

12.955

.598

2.90

5.25

0

122

 

5.7                    Hypothesis 7: The sales figures of titles disseminated through both the institutional repository and the Google Book Search program is significantly higher, compared to titles disseminated through one of those channels.

Testing Hypothesis 7 is an extension of the tests performed for Hypothesis 4. As we have seen for Hypothesis 1 paired to Hypothesis 5 and Hypothesis 2 paired to Hypothesis 6, the focus is not on all titles in Open Access channels – the books in Set 2, Set 3 and Set 4 – versus the books in Set 1, but on the differences in the dependant variable of the books in Set 2 and Set 4 compared to Set 3. In order to find these results, a planned contrast test is performed. Furthermore, as the descriptive statistics are the same as for Hypothesis 4, they are not repeated here.

The dependant variable is sales figures, which is measured as the monthly number of sales per title during the period from April 2009 to December 2009. The independent variable is accessibility, which is measured as the set to which a title belongs. Similar to Hypothesis 4, from the collected data the following was omitted:

Table 7.8 Hypothesis 7: Omitted sales figures

ISBN

Sales

Set

Remarks

9789053568385

1643

1

remaindered

9789053569030

1238

2

remaindered

9789053568651

1210

1

remaindered

9789053565315

1048

1

remaindered

 

These were unusually high monthly sales, caused by remaindering of those titles. When a title is remaindered, the complete stock is sold to a specialised vendor, causing a major effect on the sales figures.

Planned contrasts revealed that the number of sales was not being increased or decreased by being disseminated through both the AUP repository and the Google Book Search program (books in Set 3), compared to titles which were disseminated through one of those channels (books in Set 2 and 4), t(396) = .36, p < .37, r = .02.

 

5.8                    Analysis of covariance

The previous paragraphs described the hypotheses, where one dependant variable – discovery, citations, online consultation and sales – was tested against accessibility. In the following paragraphs, a test is performed to find if other variables – the covariables – have an effect on the dependant variable. For instance, if discovery – measured as the number of Book visits – is the dependant variable, the covariables tested will be online usage – measured as the number of page views in the Google Book program – combined with sales and citations. Measures of online usage tied to the AUP repository are not used here, as they are only relevant for books in Set 2 and Set 3.

ANCOVA tests are based on two assumptions: firstly, independence of the covariate and treatment effect and secondly, homogeneity of regression slopes. Independence of the covariate and treatment effect means that the effects of the covariate do not ‘interfere’ with the effects caused by the dependant variable. This can be tested using a customized ANCOVA model. For instance, choosing the number of repository downloads as a covariate would be a violation of this assumption, as it is closely related to the dependant variable: the dissemination channels used. The second assumption can be tested statistically using Levene’s test of equality of error variances, combined with a test on the ratio – or critical value – between the highest and lowest variances, called Hartley’s test. The critical value associated with datasets containing more than 60 items is 1.00. In order to meet the assumption of homogeneity of regression slopes, the found critical value must be lower (Field, 2009).

 

5.9                    Discovery: ANCOVA

The dependant variable is discovery, which is measured as the number of monthly Book visits a title received in the Google Book search program during the period April 2009 to December 2009. The independent variable is accessibility, which is measured as the set to which a title belongs. The covariates used are page views, citations and sales.

While the assumption of independence of the covariate and treatment effect has been met, the assumption of homogeneity of regression slopes was violated. Therefore, no results will be reported.

Table 7.9 Discovery: Between-Subjects effects

Source

Type III Sum of Squares

df

Mean Square

F

Sig.

Corrected Model

7.025E8

10

7.025E7

147.090

.000

Intercept

1625844.113

1

1625844.113

3.404

.066

Set

3570179.010

3

1190059.670

2.492

.060

Citations

351656.733

1

351656.733

.736

.391

PageViews

6.107E8

1

6.107E8

1278.573

.000

Sales

191481.715

1

191481.715

.401

.527

Set * Citations * PageViews * Sales

788434.536

4

197108.634

.413

.800

Error

1.858E8

389

477605.511

 

 

Total

1.131E9

400

 

 

 

Corrected Total

8.883E8

399

 

 

 

R2 = .791 (Adjusted R2 = .785)

 

5.10                Online consultation: ANCOVA

The dependant variable is online consultation, which is measured as the number of monthly page views a title received in the Google Book search program during the period April 2009 to December 2009. The independent variable is accessibility, which is measured as the set to which a title belongs. The covariates used are Book visits, citations and sales.

While the assumption of independence of the covariate and treatment effect has been met, the assumption of homogeneity of regression slopes was violated. Therefore, no results will be reported.

 

Table 7.10 Online consultation: Between-Subjects effects

Source

Type III Sum of Squares

df

Mean Square

F

Sig.

Corrected Model

4.645E10

10

4.645E9

156.686

.000

Intercept

1.009E9

1

1.009E9

34.033

.000

Set

5.071E8

3

1.690E8

5.701

.001

Citations

1.540E8

1

1.540E8

5.194

.023

Sales

3.030E7

1

3.030E7

1.022

.313

Bookvisits

3.267E10

1

3.267E10

1101.837

.000

Set * Citations * Bookvisits * Sales

1.358E8

4

3.396E7

1.145

.335

Error

1.153E10

389

2.965E7

 

 

Total

8.315E10

400

 

 

 

Corrected Total

5.799E10

399

 

 

 

R2 = .801 (Adjusted R2 = .796)

 

5.11                Citation: ANCOVA

The dependant variable is citation rate, which is measured as the difference in citation rate per title during the period from April 2009 to December 2009 as found in the Google Scholar search engine.  The independent variable is accessibility, which is measured as the set to which a title belongs. The covariates used are Book visits, page views and sales.

While the assumption of homogeneity of regression slopes has been met, the assumption of independence of the covariate and treatment effect was violated. Therefore, no results will be reported.

Table 7.11 Citation: Between-Subjects effects

Source

Type III Sum of Squares

df

Mean Square

F

Sig.

Corrected Model

4801.296

10

480.130

2.465

.007

Intercept

30.236

1

30.236

.155

.694

Set

656.814

3

218.938

1.124

.339

Sales

13.863

1

13.863

.071

.790

Bookvisits

649.728

1

649.728

3.336

.069

PageViews

3103.757

1

3103.757

15.935

.000

Set * PageViews * Bookvisits * Sales

1884.880

4

471.220

2.419

.048

Error

75769.014

389

194.779

 

 

Total

83740.000

400

 

 

 

Corrected Total

80570.310

399

 

 

 

R2 = .060 (Adjusted R2 = .035)

5.12                Sales: ANCOVA

The dependant variable is sales figures, which is measured as the monthly number of sales per title during the period from April 2009 to December 2009. The independent variable is accessibility, which is measured as the set to which a title belongs. The covariates used are Book visits, page views and citations.

The covariate Book visits was not significantly related to the sales figures, F(1,389) = .23, p < .64, partial η2 = .00, the covariate page views was not significantly related to the sales figures, F(1,389) = 1.12, p < .29, partial η2 = .00, the covariate citations was not significantly related to the sales figures, F(1,389) = .30, p < .86, partial η2 = .00. There was also no significant effect of accessibility on sales figures, F(3,389) = .52, p < .68, partial η2 = .00.

 

Table 7.12 Sales: Between-Subjects effects

Source

Type III Sum of Squares

df

Mean Square

F

Sig.

Corrected Model

64958.257

10

6495.826

.350

.966

Intercept

507337.952

1

507337.952

27.335

.000

Set

28970.806

3

9656.935

.520

.669

Bookvisits

4292.245

1

4292.245

.231

.631

PageViews

20684.399

1

20684.399

1.114

.292

Citations

565.743

1

565.743

.030

.861

Set * PageViews * Citations * Bookvisits

13218.190

4

3304.548

.178

.950

Error

7219956.703

389

18560.300

 

 

Total

8462574.000

400

 

 

 

Corrected Total

7284914.960

399

 

 

 

R2 = .009 (Adjusted R2 = -.017)

 

5.13                Multiple regression

As described in the previous chapter, several intrinsic properties of the publications were used to place them into different sets. Those properties – year of publication, print run, language and subject of the publication – may influence the usage by scientists. If – for instance – a researcher is not able to read Dutch, relevant publications in that language are not likely to be used. Therefore, several analyses were carried out to measure whether these effects occur. Furthermore, discovery of a title is crucial to usage; this is the reason for taking it into account as well. In the following paragraphs the results for discovery, online consultation, citation and sales are described.

 

5.14                Discovery: multiple regression

The dependant variable is discovery, which is measured as the number of monthly Book visits a title received in the Google Book search program during the period from April 2009 to December 2009. A multiple regression analysis is carried out, using a hierarchical model containing the following variables:

1.       Accessibility, which is measured as the set to which a title belongs; Year of publication; Print run of the publication; Language of the publication;

2.       Subject of the publication.

Table 7.13 Discovery: Multiple regression model summary

Model

R

R2

Adjusted R2

Std. Error of the Estimate

Change Statistics

Durbin-Watson

R2 Change

F Change

df1

df2

Sig. F Change

1

.356

.127

.112

1406.438

.127

8.153

7

392

.000

 

2

.481

.232

.144

1380.680

.105

1.434

34

358

.059

2.059

 

Table 7.14 Discovery: Multiple regression results

 

 

B

SE B

ß

Step 1

(Constant)

112521.79

59306.40

 

Set 2 vs. Set 1

-119.54

199.39

-.03

Set 3 vs. Set 1

183.81

199.19

.05

Set 4 vs. Set 1

334.07

200.08

.10

Dutch vs. English

-356.45

146.58

-.12*

German vs. English

-929.67

511.83

-.09

PrintRun

.55

.09

.30**

Year

-55.87

29.57

-.09

Step 2

(Constant)

98101.34

62822.44

 

Set 2 vs. Set 1

-83.98

196.80

-.02

Set 3 vs. Set 1

188.41

196.55

.05

Set 4 vs. Set 1

340.88

197.68

.10

Dutch vs. English

-80.86

174.82

-.03

German vs. English

-1944.18

1103.86

-.18

PrintRun

.44

.10

.24**

Year

-48.82

31.33

-.08

Art_History

2011.78

631.41

.27**

Culture

291.04

721.01

.03

Culture_History_Culture

-92.43

687.81

-.01

Dutch_Language

307.61

636.10

.04

Dutch_Language_Study

-310.00

973.16

-.02

Dutch_Literature

180.70

721.62

.02

Dutch_Literature_History

-182.38

657.55

-.02

Dutch_Literature_Education

-330.00

718.17

-.03

Economics

-103.72

866.05

-.01

Education

-309.03

812.43

-.02

History

265.68

578.59

.05

Information_Technology

273.43

955.30

.02

Japan_Culture_History

-535.15

784.97

-.04

Law

256.86

689.35

.03

Law_History

1341.59

1116.11

.14

Literature

-64.52

885.57

.00

Mathematics

60.55

870.14

.00

Medicine

-338.37

650.16

-.04

Motion_Pictures

563.60

581.44

.10

Music

1577.42

774.95

.13*

Philosophy

-9.16

742.44

.00

Political_Science

350.47

617.37

.05

Political_Science_Law

202.30

757.00

.02

Political_Science_Public_Administration_Sociology

-38.66

603.92

-.01

Psychology

251.56

651.64

.03

Public_Administration

-70.22

740.49

-.01

Public_Administration_Economics_Law

62.59

744.27

.01

Public_Administration_Political_Science

97.01

718.69

.01

Public_Administration_Political_Science_Sociology

-384.46

653.67

-.05

Public_Administration_Sociology

403.27

691.93

.04

Religion

-41.66

954.70

.00

Science

-184.88

669.68

-.02

Sociology

43.29

582.70

.01

Theatre

-1.98

875.54

.00

R2 = .13 for Step 1, ΔR2 = .11 for Step 2 (p < .05). * p < .05, ** p < .001.

 

Print run, publication subject “Art – History” and publication subject “Music” were significant predictors of Book visits.

 

5.15                Online consultation: multiple regression

Online consultation can be measured by the number of page views a title received in the Google Book Search Program, and by the usage through the AUP repository. As downloading a document from the repository points to actual usage whereas opening a page in the repository does not always imply usage, only the number of repository downloads are analysed.

 

5.15.1         Page views: multiple regression

The dependant variable is page views, which is measured as the number of monthly page views a title received in the Google Book search program during the period from April 2009 to December 2009. A multiple regression analysis is carried out, using a hierarchical model containing the following variables:

1.       Accessibility, which is measured as the set to which a title belongs; Book visits, which is measured as the total number of book visits a publication received during the during the period from April 2009 to December 2009; Year of publication; Print run of the publication; Language of the publication;

2.       Subject of the publication.

 

Table 7.15 Page views: Multiple regression model summary

Model

R

R2

Adjusted R2

Std. Error of the Estimate

Change Statistics

Durbin-Watson

R2 Change

F Change

df1

df2

Sig. F Change

1

.892

.796

.792

5502.176

.796

190.555

8

391

.000

 

2

.904

.817

.795

5451.681

.021

1.214

34

357

.197

2.128

 

Table 7.16 Page views: Multiple regression results

 

 

B

SE B

ß

Step 1

(Constant)

-66260.96

233077.60

 

 

Set 2 vs. Set 1

-208.35

780.39

-.01

 

Set 3 vs. Set 1

2371.49

780.11

.09**

 

Set 4 vs. Set 1

2450.96

785.50

.09**

 

Book visits

7.02

0.20

.87**

 

Dutch vs. English

-504.82

577.75

-.02

 

German vs. English

-2933.27

2010.77

-.03

 

PrintRun

0.10

0.38

.01

 

Year

33.82

116.19

.01

Step 2

(Constant)

-214834.94

248900.84

 

 

Set 2 vs. Set 1

-67.19

777.26

.00

 

Set 3 vs. Set 1

2424.08

777.10

.09**

 

Set 4 vs. Set 1

2481.44

783.78

.09**

 

Book visits

6.80

0.21

.84**

 

Dutch vs. English

-7.95

690.48

.00

 

German vs. English

-9434.25

4377.51

-.11*

 

PrintRun

-0.19

0.41

-.01

 

Year

108.24

124.11

.02

 

Art_History

4446.84

2528.26

.08

 

Culture

1276.19

2847.60

.02

 

Culture_History_Culture

-1368.04

2715.92

-.02

 

Dutch_Language

-820.23

2512.51

-.01

 

Dutch_Language_Study

-2838.36

3843.10

-.02

 

Dutch_Literature

-1908.75

2849.62

-.02

 

Dutch_Literature_History

-918.05

2596.63

-.01

 

Dutch_Literature_Education

-1309.94

2836.56

-.02

 

Economics

-1091.87

3419.73

-.01

 

Education

-2535.76

3208.59

-.02

 

History

674.22

2285.26

.02

 

Information_Technology

-4959.93

3772.47

-.04

 

Japan_Culture_History

-1763.20

3101.51

-.02

 

Law

-846.75

2722.46

-.01

 

Law_History

6012.06

4415.90

.08

 

Literature

-2948.11

3496.76

-.02

 

Mathematics

-3096.90

3435.80

-.03

 

Medicine

-1957.86

2568.16

-.03

 

Motion_Pictures

1349.63

2298.87

.03

 

Music

-6400.85

3077.58

-.07*

 

Philosophy

-865.35

2931.55

-.01

 

Political_Science

-2758.68

2438.82

-.05

 

Political_Science_Law

-89.16

2989.37

.00

 

Political_Science_Public_Administration_Sociology

-1315.89

2384.60

-.03

 

Psychology

965.16

2573.58

.01

 

Public_Administration

-2021.93

2923.92

-.02

 

Public_Administration_Economics_Law

-2978.31

2938.83

-.03

 

Public_Administration_Political_Science

496.81

2837.86

.01

 

Public_Administration_Political_Science_Sociology

-1552.63

2582.31

-.02

 

Public_Administration_Sociology

-600.57

2733.41

-.01

 

Religion

-2080.52

3769.70

-.02

 

Science

-734.96

2644.54

-.01

 

Sociology

-909.37

2300.84

-.02

 

Theatre

-1743.28

3457.11

-.01

 

R2 = .8 for Step 1, ΔR2 = .02 for Step 2 (p < .05). * p < .05, ** p < .001.

 

Set 3, Set 4 and book visits were significant predictors of page views. Publication language “German” and publication subject “Music” have a negative relationship.

 

5.15.2         Repository downloads: multiple regression

The dependant variable is downloads, which is measured as the number of monthly downloads a title received in the AUP repository during the period from April 2009 to December 2009. A multiple regression analysis is carried out, using a hierarchical model containing the following variables:

1.       Accessibility, which is measured as the set to which a title belongs – Set 2 or Set 3; Book visits, which is measured as the total number of book visits a publication received during the during the period from April 2009 to December 2009; Year of publication and print run of the publication; Language of the publication; Page views, which is measured as the total number of page views during the period April 2009 to December 2009 a title received in the AUP repository;

2.       Subject of the publication.

Table 7.17 Downloads: Multiple regression model summary

Model

R

R2

Adjusted R2

Std. Error of the Estimate

Change Statistics

Durbin-Watson

R2 Change

F Change

df1

df2

Sig. F Change

1

.377

.142

.111

34.167

.142

4.554

7

192

.000

 

2

.550

.303

.127

33.860

.160

1.106

33

159

.332

2.411

 

Table 7.18 Downloads: Multiple regression results

 

 

B

SE B

ß

Step 1

(Constant)

-1370.58

2358.10

 

 

Set 3 vs. Set 2

1.55

4.91

.02

 

Book visits

0.00

0.00

.10

 

Dutch vs. English

-3.58

5.13

-.05

 

German vs. English

-1.80

16.04

-.01

 

PrintRun

0.00

0.00

.02

 

Year

0.68

1.18

.04

 

RepositoryViews

2.86

0.55

.36*

Step 2

(Constant)

-524.18

2632.77

 

 

Set 3 vs. Set 2

1.05

4.91

.02

 

Book visits

0.00

0.00

.08

 

Dutch vs. English

-6.02

6.12

-.08

 

German vs. English

-2.62

23.31

-.01

 

PrintRun

0.00

0.00

.09

 

Year

0.26

1.31

.02

 

RepositoryViews

2.42

0.58

.30*

 

Art_History

-11.46

21.69

-.06

 

Culture

93.68

24.21

.36*

 

Culture_History_Culture

16.42

24.36

.06

 

Dutch_Language

-2.98

21.25

-.02

 

Dutch_Language_Study

1.11

38.54

.00

 

Dutch_Literature

2.80

25.11

.01

 

Dutch_Literature_History

3.77

21.63

.02

 

Dutch_Literature_Education

-7.30

24.82

-.03

 

Economics

-4.60

29.57

-.01

 

Education

-2.47

26.20

-.01

 

History

7.09

19.21

.06

 

Information_Technology

-2.21

29.74

-.01

 

Japan_Culture_History

2.40

27.31

.01

 

Law

4.78

23.19

.02

 

Literature

6.94

30.40

.02

 

Mathematics

-5.53

29.61

-.02

 

Medicine

-3.98

21.59

-.02

 

Motion_Pictures

7.47

19.47

.05

 

Music

-7.20

26.49

-.02

 

Philosophy

-4.52

24.50

-.02

 

Political_Science

-12.14

20.21

-.07

 

Political_Science_Law

7.91

26.44

.03

 

Political_Science_Public_Administration_Sociology

-2.03

19.99

-.01

 

Psychology

-5.86

22.19

-.03

 

Public_Administration

-0.07

24.16

.00

 

Public_Administration_Economics_Law

-2.36

24.18

-.01

 

Public_Administration_Political_Science

-3.50

24.01

-.01

 

Public_Administration_Political_Science_Sociology

-7.80

21.89

-.04

 

Public_Administration_Sociology

1.55

23.03

.01

 

Religion

17.56

29.69

.05

 

Science

-11.29

22.62

-.05

 

Sociology

2.84

19.37

.02

 

Theatre

12.25

29.73

.03

 

R2 = .14 for Step 1, ΔR2 = .16 for Step 2 (p < .05). * p < .001.

 

Repository views and publication subject “Culture” were significant predictor of downloads.

 

5.16                Citations: multiple regression

The dependant variable is citations, which is measured as the difference in citation rate per title during the period from April 2009 to December 2009 as found in the Google Scholar search engine. A multiple regression analysis is carried out, using a hierarchical model containing the following variables:

1.       Accessibility, which is measured as the set to which a title belongs; Book visits, which is measured as the total number of book visits a publication received during the during the period from April 2009 to December 2009; Year of publication; Print run of the publication; Language of the publication;

2.       Subject of the publication.

 

Table 7.19 Citations: Multiple regression model summary

Model

R

R2

Adjusted R2

Std. Error of the Estimate

Change Statistics

Durbin-Watson

R2 Change

F Change

df1

df2

Sig. F Change

1

.191

.037

.017

14.089

.037

1.859

8

391

.065

 

2

.351

.123

.020

14.070

.086

1.032

34

357

.422

2.158

 

Table 7.20 Citations: Multiple regression results

 

 

B

SE B

ß

Step 1

(Constant)

-136.98

596.84

 

 

Set 2 vs. Set 1

1.88

2.00

.06

 

Set 3 vs. Set 1

0.14

2.00

.00

 

Set 4 vs. Set 1

2.21

2.01

.07

 

Book visits

0.00

0.00

.13*

 

Dutch vs. English

-2.49

1.48

-.09

 

German vs. English

-3.67

5.15

-.04

 

PrintRun

0.00

0.00

-.09

 

Year

0.07

0.30

.01

Step 2

(Constant)

98.85

642.36

 

 

Set 2 vs. Set 1

1.96

2.01

.06

 

Set 3 vs. Set 1

0.17

2.01

.01

 

Set 4 vs. Set 1

1.91

2.02

.06

 

Book visits

0.00

0.00

.15*

 

Dutch vs. English

-3.59

1.78

-.13*

 

German vs. English

-10.94

11.30

-.11

 

PrintRun

0.00

0.00

-.10

 

Year

-0.05

0.32

-.01

 

Art_History

-1.30

6.53

-.02

 

Culture

5.03

7.35

.05

 

Culture_History_Culture

1.99

7.01

.02

 

Dutch_Language

5.17

6.48

.07

 

Dutch_Language_Study

4.27

9.92

.03

 

Dutch_Literature

3.45

7.35

.04

 

Dutch_Literature_History

3.84

6.70

.05

 

Dutch_Literature_Education

3.21

7.32

.03

 

Economics

-0.38

8.83

.00

 

Education

0.87

8.28

.01

 

History

2.92

5.90

.06

 

Information_Technology

-1.80

9.74

-.01

 

Japan_Culture_History

3.03

8.00

.03

 

Law

2.99

7.03

.03

 

Law_History

10.27

11.40

.11

 

Literature

2.78

9.02

.02

 

Mathematics

-1.06

8.87

-.01

 

Medicine

1.17

6.63

.02

 

Motion_Pictures

1.40

5.93

.03

 

Music

0.14

7.94

.00

 

Philosophy

0.53

7.57

.01

 

Political_Science

8.72

6.29

.13

 

Political_Science_Law

1.85

7.72

.02

 

Political_Science_Public_Administration_Sociology

3.98

6.15

.07

 

Psychology

12.16

6.64

.15

 

Public_Administration

1.23

7.55

.01

 

Public_Administration_Economics_Law

0.25

7.58

.00

 

Public_Administration_Political_Science

1.21

7.32

.01

 

Public_Administration_Political_Science_Sociology

5.11

6.66

.07

 

Public_Administration_Sociology

2.31

7.05

.03

 

Religion

1.82

9.73

.01

 

Science

22.15

6.83

.27*

 

Sociology

2.83

5.94

.06

 

Theatre

3.11

8.92

.02

 

R2 = .04 for Step 1, ΔR2 = .09 for Step 2 (p < .05). * p < .05.

 

Book visits, publication language “English” and publication subject “Science” were significant predictors of citations.

 

5.17                Sales: multiple regression

The dependant variable is sales, which is measured as the total number of sales of a publication during the period from April 2009 to December 2009. A multiple regression analysis is carried out, using a hierarchical model containing the following variables:

1.       Accessibility, which is measured as the set to which a title belongs; Book visits, which is measured as the total number of book visits a publication received during the during the period from April 2009 to December 2009; Year of publication; Print run of the publication; Language of the publication;

2.       Subject of the publication.

 

Table 7.21 Sales: Multiple regression model summery

Model

R

R2

Adjusted R2

Std. Error of the Estimate

Change Statistics

Durbin-Watson

R2 Change

F Change

df1

df2

Sig. F Change

1

.351

.123

.105

127.805

.123

6.874

8

391

.000

 

2

.518

.268

.182

122.199

.145

2.079

34

357

.001

2.138

 

Table 7.22 Sales: Multiple regression results

 

 

B

SE B

ß

Step 1

(Constant)

-9217.18

5413.93

 

Set 2 vs. Set 1

24.82

18.13

.08

Set 3 vs. Set 1

17.41

18.12

.06

Set 4 vs. Set 1

22.89

18.25

.07

Book visits

-0.01

0.00

-.06

Dutch vs. English

30.62

13.42

.11*

German vs. English

-21.64

46.71

-.02

PrintRun

0.05

0.01

.31**

Year

4.59

2.70

.08

Step 2

(Constant)

-14948.18

5579.10

 

Set 2 vs. Set 1

22.02

17.42

.07

Set 3 vs. Set 1

14.40

17.42

.05

Set 4 vs. Set 1

23.33

17.57

.07

Book visits

-0.01

0.00

-.06

Dutch vs. English

19.24

15.48

.07

German vs. English

-14.99

98.12

-.02

PrintRun

0.05

0.01

.28**

Year

7.46

2.78

.14*

Art_History

16.54

56.67

.02

Culture

-16.58

63.83

-.02

Culture_History_Culture

-7.66

60.88

-.01

Dutch_Language

33.97

56.32

.05

Dutch_Language_Study

-22.87

86.14

-.01

Dutch_Literature

-42.97

63.87

-.05

Dutch_Literature_History

138.77

58.20

.20*

Dutch_Literature_Education

84.92

63.58

.09

Economics

-37.20

76.65

-.03

Education

-65.78

71.92

-.05

History

-34.36

51.22

-.08

Information_Technology

-32.15

84.56

-.02

Japan_Culture_History

162.44

69.52

.15*

Law

-27.20

61.02

-.03

Law_History

-23.65

98.98

-.03

Literature

-47.42

78.38

-.03

Mathematics

-17.00

77.01

-.01

Medicine

-34.66

57.57

-.05

Motion_Pictures

-20.22

51.53

-.04

Music

-44.63

68.98

-.04

Philosophy

-48.44

65.71

-.05

Political_Science

-22.11

54.67

-.03

Political_Science_Law

-35.17

67.01

-.03

Political_Science_Public_Administration_Sociology

-79.10

53.45

-.14

Psychology

-47.48

57.69

-.06

Public_Administration

-23.66

65.54

-.02

Public_Administration_Economics_Law

-36.86

65.87

-.04

Public_Administration_Political_Science

-40.52

63.61

-.04

Public_Administration_Political_Science_Sociology

-78.09

57.88

-.11

Public_Administration_Sociology

77.42

61.27

.09

Religion

75.67

84.50

.05

Science

-3.68

59.28

.00

Sociology

-44.64

51.57

-.09

Theatre

41.69

77.49

.03

R2 = .11 for Step 1, ΔR2 = .18 for Step 2 (p < .05). * p < .05, ** p < .001.

 

Print run, publication language “English”, publication subject “Dutch Literature – History” and publication subject “Japan; Culture – History” were significant predictors of sales.