How long is the average dissertation?

Note: Please see the update to this blog!

The best part about writing a dissertation is finding clever ways to procrastinate. The motivation for this blog comes from one of the more creative ways I’ve found to keep myself from writing. I’ve posted about data mining in the past and this post follows up on those ideas using a topic that is relevant to anyone that has ever considered getting, or has successfully completed, their PhD.

I think a major deterrent that keeps people away from graduate school is the requirement to write a dissertation or thesis. One often hears horror stories of the excessive page lengths that are expected. However, most don’t realize that dissertations are filled with lots of white space, e.g., pages are one-sided, lines are double-spaced, and the author can put any material they want in appendices. The actual written portion may only account for less than 50% of the page length. A single chapter may be 30-40 pages in length, whereas the same chapter published in the primary literature may only be 10 or so pages long in a journal. Regardless, students (myself included) tend to fixate on the ‘appropriate’ page length for a dissertation, as if it’s some sort of measure of how much work you’ve done to get your degree. Any professor will tell you that page length is not a good indicator of the quality of your work. Regardless, I feel that some general page length goal should be established prior to writing. This length could be a minimum to ensure you put forth enough effort, or an upper limit to ensure you aren’t too excessive on extraneous details.

It’s debatable as to what, if anything, page length indicates about the quality of one’s work. One could argue that it indicates absolutely nothing. My advisor once told me about a student in Chemistry that produced a dissertation that was less than five pages, and included nothing more than a molecular equation that illustrated the primary findings of the research. I’ve heard of other advisors that strongly discourage students from creating lengthy dissertations. Like any indicator, page length provides information that may or may not be useful. However, I guarantee that almost every graduate student has thought about an appropriate page length on at least one occasion during their education.

The University of Minnesota library system has been maintaining electronic dissertations since 2007 in their Digital Conservancy website. These digital archives represent an excellent opportunity for data mining. I’ve developed a data scraper that gathers information on student dissertations, such as page length, year and month of graduation, major, and primary advisor. Unfortunately, the code will not work unless you are signed in to the University of Minnesota library system. I’ll try my best to explain what the code does so others can use it to gather data on their own. I’ll also provide some figures showing some relevant data about dissertations. Obviously, this sample is not representative of all institutions or time periods, so extrapolation may be unwise. I also won’t be providing any of the raw data, since it isn’t meant to be accessible for those outside of the University system.

I’ll first show the code to get the raw data for each author. The code returns a list with two elements for each author. The first element has the permanent and unique URL for each author’s data and the second element contains a character string with relevant data to be parsed.

#import package
require(XML)

#starting URL to search
url.in<-'http://conservancy.umn.edu/handle/45273/browse-author?starts_with=0'

#output object
dat<-list()

#stopping criteria for search loop
stp.txt<-'2536-2536 of 2536.'
str.chk<-'foo'

#initiate search loop
while(!grepl(stp.txt,str.chk)){

	html<-htmlTreeParse(url.in,useInternalNodes=T)
	str.chk<-xpathSApply(html,'//p',xmlValue)[3]

	names.tmp<-xpathSApply(html, "//table", xmlValue)[10]
	names.tmp<-gsub("^\\s+", "",strsplit(names.tmp,'\n')[[1]])
	names.tmp<-names.tmp[nchar(names.tmp)>0]

	url.txt<-strsplit(names.tmp,', ')
	url.txt<-lapply(
		url.txt,
		function(x){

			cat(x,'\n')
			flush.console()

			#get permanent handle
			url.tmp<-gsub(' ','+',x)
			url.tmp<-paste(
				'http://conservancy.umn.edu/handle/45273/items-by-author?author=',
				paste(url.tmp,collapse='%2C+'),
				sep=''
				)
			html.tmp<-readLines(url.tmp)
			str.tmp<-rev(html.tmp[grep('handle',html.tmp)])[1]
			str.tmp<-strsplit(str.tmp,'\"')[[1]]
			str.tmp<-str.tmp[grep('handle',str.tmp)] #permanent URL

			#parse permanent handle
			perm.tmp<-htmlTreeParse(
				paste('http://conservancy.umn.edu',str.tmp,sep=''),useInternalNodes=T
				)
			perm.tmp<-xpathSApply(perm.tmp, "//td", xmlValue)
			perm.tmp<-perm.tmp[grep('Major|pages',perm.tmp)]
			perm.tmp<-c(str.tmp,rev(perm.tmp)[1])

			}
		)

	#append data to list, will contain some duplicates
	dat<-c(dat,url.txt)

	#reinitiate url search for next iteration
	url.in<-strsplit(rev(names.tmp)[1],', ')[[1]]
	url.in<-gsub(' ','+',url.in)
	url.in<-paste(
		'http://conservancy.umn.edu/handle/45273/browse-author?top=',
		paste(url.in,collapse='%2C+'),
		sep=''
		)

	}

#remove duplicates
dat<-unique(dat)

The basic approach is to use functions in the XML package to import and parse raw HTML from the web pages on the Digital Conservancy. This raw HTML is then further parsed using some of the base functions in R, such as grep and strsplit. The tricky part is to find the permanent URL for each student that contains the relevant information. I used the ‘browse by author’ search page as a starting point. Each ‘browse by author’ page contains links to 21 individuals. The code first imports the HTML, finds the permanent URL for each author, reads the HTML for each permanent URL, finds the relevant data for each dissertation, then continues with the next page of 21 authors. The loop stops once all records are imported.

The important part is to identify the format of each URL so the code knows where to look and where to re-initiate each search. For example, each author has a permanent URL that has the basic form http://conservancy.umn.edu/ plus ‘handle/12345’, where the last five digits are unique to each author (although the number of digits varied). Once the raw HTML is read in for each page of 21 authors, the code has to find text where the word ‘handle’ appears and then save the following digits to the output object. The permanent URL for each student is then accessed and parsed. The important piece of information for each student takes the following form:

University of Minnesota Ph.D. dissertation. July 2012. Major: Business. Advisor: Jane Doe. 1 computer file (PDF); iv, 147 pages, appendices A-B.

This code is found by searching the HTML for words like ‘Major’ or ‘pages’ after parsing the permanent URL by table cells (using the <td></td> tags). This chunk of text is then saved to the output object for additional parsing.

After the online data were obtained, the following code was used to identify page length, major, month of completion, year of completion, and advisor for each character string for each student. It looks messy but it’s designed to identify the data while handling as many exceptions as I was willing to incorporate into the parsing mechanism. It’s really nothing more than repeated calls to grep using appropriate search terms to subset the character string.

#function for parsing text from website
get.txt<-function(str.in){

	#separate string by spaces
	str.in<-strsplit(gsub(',',' ',str.in,fixed=T),' ')[[1]]
	str.in<-gsub('.','',str.in,fixed=T)

	#get page number
	pages<-str.in[grep('page',str.in)[1]-1]
	if(grepl('appendices|appendix|:',pages)) pages<-NA

	#get major, exception for error
	if(class(try({
		major<-str.in[c(
			grep(':|;',str.in)[1]:(grep(':|;',str.in)[2]-1)
			)]
		major<-gsub('.','',gsub('Major|Mayor|;|:','',major),fixed=T)
		major<-paste(major[nchar(major)>0],collapse=' ')

		}))=='try-error') major<-NA

	#get year of graduation
	yrs<-seq(2006,2013)
	yr<-str.in[grep(paste(yrs,collapse='|'),str.in)[1]]
	yr<-gsub('Major|:','',yr)
	if(!length(yr)>0) yr<-NA

	#get month of graduation
	months<-c('January','February','March','April','May','June','July','August',
		'September','October','November','December')
	month<-str.in[grep(paste(months,collapse='|'),str.in)[1]]
	month<-gsub('dissertation|dissertatation|\r\n|:','',month)
	if(!length(month)>0) month<-NA

	#get advisor, exception for error
	if(class(try({
		advis<-str.in[(grep('Advis',str.in)+1):(grep('computer',str.in)-2)]
		advis<-paste(advis,collapse=' ')
		}))=='try-error') advis<-NA

	#output text
	c(pages,major,yr,month,advis)

	}

#get data using function, ran on 'dat'
check.pgs<-do.call('rbind',
	lapply(dat,function(x){
		cat(x[1],'\n')
		flush.console()
		c(x[1],get.txt(x[2]))})
		)

#convert to dataframe
check.pgs<-as.data.frame(check.pgs,sringsAsFactors=F)
names(check.pgs)<-c('handle','pages','major','yr','month','advis')

#reformat some vectors for analysis
check.pgs$pages<-as.numeric(as.character(check.pgs$pages))
check.pgs<-na.omit(check.pgs)
months<-c('January','February','March','April','May','June','July','August',
		'September','October','November','December')
check.pgs$month<-factor(check.pgs$month,months,months)
check.pgs$major<-tolower(check.pgs$major)

The section of the code that begins with #get data using function takes the online data (stored as dat on my machine) and applies the function to identify the relevant information. The resulting text is converted to a data frame and some minor reworkings are applied to convert some vectors to numeric or factor values. Now the data are analyzed using the check.pgs object.

The data contained 2,536 records for students that completed their dissertations since 2007. The range was incredibly variable (minimum of 21 pages, maximum of 2002), but most dissertations were around 100 to 200 pages.

Interestingly, a lot of students graduated in August just prior to the fall semester. As expected, spikes in defense dates were also observed in December and May at the ends of the fall and spring semesters.

The top four majors with the most dissertations on record were (in descending order) educational policy and administration, electrical engineering, educational psychology, and psychology.

I’ve selected the top fifty majors with the highest number of dissertations and created boxplots to show relative distributions. Not many differences are observed among the majors, although some exceptions are apparent. Economics, mathematics, and biostatistics had the lowest median page lengths, whereas anthropology, history, and political science had the highest median page lengths. This distinction makes sense given the nature of the disciplines.

I’ve also completed a count of number of students per advisor. The maximum number of students that completed their dissertations for a single advisor since 2007 was eight. Anyhow, I’ve satiated my curiosity on this topic so it’s probably best that I actually work on my own dissertation rather than continue blogging. For those interested, the below code was used to create the plots.

######
#plot summary of data
require(ggplot2)

mean.val<-round(mean(check.pgs$pages))
med.val<-median(check.pgs$pages)
sd.val<-round(sd(check.pgs$pages))
rang.val<-range(check.pgs$pages)
txt.val<-paste('mean = ',mean.val,'\nmed = ',med.val,'\nsd = ',sd.val,
	'\nmax = ',rang.val[2],'\nmin = ', rang.val[1],sep='')

#histogram for all
hist.dat<-ggplot(check.pgs,aes(x=pages))
pdf('C:/Users/Marcus/Desktop/hist_all.pdf',width=7,height=5)
hist.dat + geom_histogram(aes(fill=..count..),binwidth=10) +
  scale_fill_gradient("Count", low = "blue", high = "green") +
	xlim(0, 500) + geom_text(aes(x=400,y=100,label=txt.val))
dev.off()

#barplot by month
month.bar<-ggplot(check.pgs,aes(x=month,fill=..count..))

pdf('C:/Users/Marcus/Desktop/month_bar.pdf',width=10,height=5.5)
month.bar + geom_bar() + scale_fill_gradient("Count", low = "blue", high = "green")
dev.off()

######
#histogram by most popular majors
#sort by number of dissertations by major
get.grps<-list(c(1:4),c(5:8))#,c(9:12),c(13:16))

for(val in 1:length(get.grps)){

	pop.maj<-names(sort(table(check.pgs$major),decreasing=T)[get.grps[[val]]])
	pop.maj<-check.pgs[check.pgs$major %in% pop.maj,]
	pop.med<-aggregate(pop.maj$pages,list(pop.maj$major),function(x) round(median(x)))
	pop.n<-aggregate(pop.maj$pages,list(pop.maj$major),length)

	hist.maj<-ggplot(pop.maj, aes(x=pages))
	hist.maj<-hist.maj + geom_histogram(aes(fill = ..count..), binwidth=10)
	hist.maj<-hist.maj + facet_wrap(~major,nrow=2,ncol=2) + xlim(0, 500) +
		scale_fill_gradient("Count", low = "blue", high = "green")

	y.txt<-mean(ggplot_build(hist.maj)$panel$ranges[[1]]$y.range)
	txt.dat<-data.frame(
		x=rep(450,4),
	  y=rep(y.txt,4),
		major=pop.med$Group.1,
		lab=paste('med =',pop.med$x,'\nn =',pop.n$x,sep=' ')
	  )

	hist.maj<-hist.maj + geom_text(data=txt.dat, aes(x=x,y=y,label=lab))

	out.name<-paste('C:/Users/Marcus/Desktop/group_hist',val,'.pdf',sep='')
	pdf(out.name,width=9,height=7)

	print(hist.maj)

	dev.off()

	}

######
#boxplots of data for fifty most popular majors

pop.maj<-names(sort(table(check.pgs$major),decreasing=T)[1:50])
pop.maj<-check.pgs[check.pgs$major %in% pop.maj,]

pdf('C:/Users/Marcus/Desktop/pop_box.pdf',width=11,height=9)
box.maj<-ggplot(pop.maj, aes(factor(major), pages, fill=pop.maj$major))
box.maj<-box.maj + geom_boxplot(lwd=0.5) + ylim(0,500) + coord_flip()
box.maj + theme(legend.position = "none", axis.title.y=element_blank())
dev.off()

Update: By popular request, I’ve redone the boxplot summary with major sorted by median page length.

130 thoughts on “How long is the average dissertation?”

anspiess says:

April 15, 2013 at 11:59 pm

How about page length vs. season the diss was written? I could imagine that in dark and lengthy winter days average page length increases, when there is no need to play outside ;-).

Reply
- beckmw says:
  
  April 16, 2013 at 7:01 am
  
  I found this blog after I published mine… there’s a plot of page length vs year, pretty interesting!
  
  http://www.r-bloggers.com/ucla-statistics-analyzing-thesisdissertation-lengths/
  
  Reply
Carson Sievert says:

April 16, 2013 at 9:15 am

It’d be nice if the boxplots were ordered in a sensible fashion (as opposed to alphabetical).

Reply
- beckmw says:
  
  May 15, 2013 at 7:31 am
  
  Okay, I caved…
  
  Reply
Walter Reade says:

April 18, 2013 at 10:48 am

My masters thesis was 165 pages. My phd dissertation was 164 pages. I must be a conformist!

Reply
- beckmw says:
  
  April 18, 2013 at 11:25 am
  
  Haha, I was legitimately concerned at one point that my dissertation would have less pages than my masters thesis…
  
  Reply
- Antonio Lopez (@Antonio_LG) says:
  
  July 15, 2013 at 11:39 am
  
  #epicwin
  
  Reply
carlwitthoft says:

April 18, 2013 at 2:02 pm

You need to send this analysis over to phdcomics, for sure!

Reply
- beckmw says:
  
  April 18, 2013 at 2:43 pm
  
  I’m on it, thanks for the suggestion.
  
  Reply
UM PhD says:

April 21, 2013 at 11:15 pm

Check this one out (if I’ve found the right one after all these years, I think his goal was to have one the longest UM PhD theses ever):

Epidemiologic study of dietary factors in the etiology of pancreatic cancer
Olsen, Geary Wayne.
Thesis (Ph.D.)–University of Minnesota. 1985,c1986
Available at UM TC Bio-Medical Library (WI810 O52e 1985 )

It was literally a pile of several reams of paper, containing I suspect everything he wrote during his program. This was back in the days before personal computer wordprocessing; candidates hired typists to complete the drafts incorporating committee changes.

Reply
statsinthewild says:

May 2, 2013 at 12:06 pm

Reblogged this on Stats in the Wild.

Reply
Length of the average dissertation | Design Interaction
Average Dissertation Length by Topic | Design Interaction
VotD: Length of the Average Dissertation | A (Budding) Sociologist's Commonplace Book
How Dissertation Length Varies By Major - Floating Path
Alex says:

May 8, 2013 at 12:17 pm

how about looking at length of time to complete dissertation by major and/or page-length? That would be an interesting one

Reply
- beckmw says:
  
  May 9, 2013 at 7:18 am
  
  I agree Alex, that would be neat, but I don’t think the start date is available in the database I used.
  
  Reply
David Giles says:

May 8, 2013 at 4:21 pm

See my post, http://davegiles.blogspot.ca/2011/02/how-long-should-my-thesis-be.html

Reply
- beckmw says:
  
  May 9, 2013 at 7:26 am
  
  Point well taken David. This is a nice summary coming from an academic. I’ve heard more than one advisor express the mantra of quality over quantity regarding dissertation length, and that the two are often not correlated.
  
  Reply
How long to make your dissertation « Worlding
Daniel says:

May 9, 2013 at 4:57 am

I want to know who had the balls to submit a 21 page thesis.

Reply
- beckmw says:
  
  May 9, 2013 at 7:26 am
  
  Get access to the UMN library system and you can find out 🙂
  
  Reply
Tim says:

May 9, 2013 at 5:01 am

Science subjects seem to be unfairly sub-categorized compared to the lack of sub-categorisation of arts subjects. Several types of biology, one type of history. English is especially worrisome, is that literature or language? (both very popular).

Reply
Average Dissertation Length by Topic | infoGraphi
Dissertation lengths compared | Skulking in Holes and Corners
Somewhere else, part 52 | Freakonometrics
Vasily Volkov says:

May 15, 2013 at 4:09 am

It would help if the graph was sorted such as by mean values. So far it seems the shortest dissertations are in math and biostatistics (100 pages). The longest are in history (300 pages).

Reply
- beckmw says:
  
  May 15, 2013 at 7:31 am
  
  Check the update above.
  
  Reply
  - Vasily Volkov says:
    
    May 15, 2013 at 7:43 am
    
    Wow! This is really cool. Thanks!
CAA News | College Art Association » Blog Archive » News from the Art and Academic Worlds | CAA
How long is the average dissertation? | MU Information Graphics Spring 2013
WMC says:

May 16, 2013 at 2:33 pm

Very helpful- thanks for your blog. I am new to R– would you mind sharing the code you used in the updated box plot in which you sorted on mean? It’s unclear to me if I need to transform pop.maj or if there is an argument passed to ggplot. Thanks in advance.

Reply
- beckmw says:
  
  May 16, 2013 at 2:55 pm
  
  Hi,
  
  ggplot organizes boxplots based on the factor levels that define the boxes. To reorganize, you have to rename the levels of your factor before calling ggplot. In this case, I reordered based on median (using order and redefining the factor):
  
  #pop.maj is dataframe with all the records
  #major is the factor I’m redefining
  medians<-aggregate(pages~major,median,data=pop.maj)
  med.rank<-order(medians$pages)
  pop.maj.2<-pop.maj
  pop.maj.2$major<-factor(pop.maj$major,levels=levels(factor(pop.maj$major))[med.rank])
  
  Reply
  - WMC says:
    
    May 16, 2013 at 3:07 pm
    
    Ah. Merci.
Jessica Elizabeth Kaiser says:

May 17, 2013 at 8:23 am

Thanks! This is really cool (and you’ve been amazingly nice to the people demanding you do it differently, explain yourself, justify categorization, etc.). 🙂

Reply
- mcmains says:
  
  August 1, 2013 at 12:46 am
  
  of course he has. because doing it differently, explaining and justifying, for this project, is NOT working on the dissertation. 🙂
  we’ve all been there.
  
  Reply
论文的篇幅 | 视物 | 致知
jmgorman says:

May 23, 2013 at 10:42 am

My advisor often teased me (300pages) that a previous department chair used to keep a scale in his office with a 5kg weight on one side. Dissertations were not accepted for review until the scale tilted towards the tome. There were rumors of students arriving with slightly damp copies hoping to get a shot at defense.

Reply
- beckmw says:
  
  June 8, 2013 at 4:52 am
  
  I still haven’t submitted my final dissertation, perhaps I’ll try soaking it first.
  
  Reply
eschauster says:

June 7, 2013 at 1:30 pm

You’re smart.

Reply
- beckmw says:
  
  June 8, 2013 at 4:53 am
  
  Clever procrastination may often look smart 🙂
  
  Reply
Web Scraping with BeautifulSoup4 | Piquant Bites
dissertation length | mommacommaphd
nomansalehzada says:

July 3, 2013 at 1:30 am

It’s very informative article, thank you for sharing your words with us, Today many students find difficulty in writing dissertations,for them i would like to recommend a very useful site, they have expert writers and help out students best. Here is the addresshttp://writing-dissertation.org.

Reply
jack says:

July 11, 2013 at 2:16 pm

Do you have acess to a similar set of data for Master’s Theses?

Reply
- beckmw says:
  
  July 11, 2013 at 2:56 pm
  
  Hi Jack,
  
  The Digital Conservancy where I obtained the data does contain Master’s theses. Perhaps an idea for another blog entry… in the mean time you can check out the link below. A similar analysis was conducted, with the second figure separating theses from dissertations (but note this only applies to stats majors at UCLA).
  
  http://www.r-bloggers.com/ucla-statistics-analyzing-thesisdissertation-lengths/
  
  -Marcus
  
  Reply
Field Notes
Length of the average dissertation | Teachers Tech
Matt Whyndham says:

July 15, 2013 at 10:22 am

Are these authors free to play with fonts and spacing, or are these dictated by the institution and/or Department? Ideally, I would like to see a calibration across to Number of Words.

Reply
- beckmw says:
  
  July 15, 2013 at 10:40 am
  
  Formatting requirements are very strict and the final dissertation is accepted by the graduate school only after verification that the requirements are met. So it’s safe to say that these dissertations are all standardized for a given format. With that said, word count would be interesting as I suspect many dissertations have appendices filled with tabular data, rather than text.
  
  -Marcus
  
  Reply
PhD Dissertation Lengths | Brian Sandberg: Historical Perspectives
Hoeveel pagina’s telt een gemiddeld doctoraat of promotie-onderzoek? (grafiek) | X, Y of Einstein?
saadgulzar says:

July 23, 2013 at 1:38 am

Reblogged this on Saad Gulzar and commented:
The new political science – the version with more stats – is converging to economics.

Reply
Martin says:

October 18, 2013 at 9:45 am

Any possibility to access the complete dissertation?

Reply
- beckmw says:
  
  October 18, 2013 at 3:51 pm
  
  Not unless you’re a student/faculty at the University of Minnesota!
  
  Reply
The Average Length of a Dissertation | Everyday Theology
E.L. Wisty says:

November 10, 2013 at 6:58 am

Reblogged this on Pink Iguana.

Reply
How Long Should My Dissertation Be? | PROFpost+
Bryce Marquis says:

December 6, 2013 at 1:34 pm

LTTP- this is fun. I wonder if there’s any correlation between dissertation length and average salary after graduation. It seems like really high paying fields (engineering, etc.) have really short dissertations. I wonder if the job market demand is inversely proportional to the dissertation length.

Reply
James Harlan says:

December 9, 2013 at 4:43 am

@Bryce: That topic sounds interesting! Probably, beckmw can definitely enlighten us on that… Cheers!

Reply
- beckmw says:
  
  December 9, 2013 at 8:41 am
  
  Haha, perhaps an inspiration for another blog!
  
  Reply
Ana says:

January 23, 2014 at 12:21 pm

I am finishing a thesis in biophysics and I have currently written 270 pages…After seeing these stats, I feel I should erase half of them :S

Reply
- beckmw says:
  
  January 23, 2014 at 1:17 pm
  
  Don’t do that! That’s what appendices are for 🙂
  
  Reply
How long is (your PhD dissertation) long enough? | marketmealex
fdujiaoyang says:

February 26, 2014 at 8:58 pm

Hi, I appreciate your published data. I found the mean length of recent (2007-2013) Ph.D dissertations on cultural anthropology (including applied anthropology) from Univ. of Florida is 262.91 pages. (Sample size=37, Min=145, Max=417). I think the mean is very close to that of the Minnesota data.
Thanks,

Reply
- beckmw says:
  
  February 27, 2014 at 7:10 am
  
  Cool! I did my undergrad at UF. It would be neat to do a more comprehensive comparison.
  
  Reply
Great links: All the best advice on dissertation writing in one post! | Meagan Kittle Autry
datagraphsandplots says:

April 3, 2014 at 8:29 am

thanks for the detailed explanation and the code. i’d love to do this for my home uni, so I’ll wait for the next break and try to do it.

Reply
- beckmw says:
  
  April 3, 2014 at 8:06 pm
  
  I would love to see this for other Universities, although I’m willing to bet the University of Minnesota is a pretty good representation of at least US trends.
  
  Reply
How long is the average dissertation? | Belajar Parasit's Blog
RT @randal_olson: Oh, crap! –> Average length… » Personal blog of Peter "Sci" Turpin
nathanaelsmith says:

April 3, 2014 at 3:28 pm

This is interesting. I’m curious, is there a reason why dissertations in Philosophy are not included in the data?

Reply
- beckmw says:
  
  April 7, 2014 at 7:49 pm
  
  Hi there,
  
  Only a handful of records in the database had philosophy as the primary major, which is why they weren’t included in the ‘top 50’ plot. In case you were curious, the mean for the records was 176 pages, sd 43.
  
  -Marcus
  
  Reply
  - nathanaelsmith says:
    
    May 4, 2014 at 4:03 pm
    
    Thanks for the info!