3. Researching Articles

Research is important. If you don't cite references at all … well, we have to break it to you, others may delete parts of the article they don't find credible, and in some cases, the whole article may be deleted. A good article is well documented, with reputable sources cited for all facts. As an article author, you're responsible for referencing your work; at the very least when writing, include a selection of the sources you used to put the article together. If you can't find a source for a fact, it probably doesn't belong in the encyclopedia.

The research for any substantial article should take at least as long—and probably much longer—than the actual writing. This requirement to research puts limits on how prolific anyone can be as a Wikipedia author; after all, Wikipedia is not a touch-typing test. Thus, although wiki editing is quick and spontaneous, writing a good article depends on a great deal of preparation time spent gathering sources and searching for information, and the skilled Wikipedia editor must also be skilled at these research tasks. Although adding things you know "off the top of your head" is easy and tempting, think about how you actually know that fact. Is it something you learned in school or spent time researching? Have you seen it yourself, or is it something you heard? If you can remember how you learned it, you can probably cite a source. You'll still need to research even if you mainly modify existing articles, rather than beginning new ones. Edits and additions to articles are more easily accepted by other editors if they are accompanied by good supporting citations. And if you enjoy doing research, fact-checking existing articles and adding existing citations is also a major cleanup task, as described in Chapter 7, Cleanup, Projects, and Processes. Finally, good research skills are useful for evaluating information in an article; if you question a particular point on a talk page, for instance, first do a quick search to see if the information is supported elsewhere or not.

In this section, we'll talk about research techniques and the different citation styles for Wikipedia articles. Research, like writing, is a skill that takes some practice to do well, and this is just a brief introduction to the topic. Other resources include guides to doing research (The Oxford Guide to Library Research by Thomas Mann is one such guide), libraries and librarians (whose job is to help with research questions), and forums such as Wikipedia's own reference desk (at [[Wikipedia:Reference desk]]), where you can ask questions about any topic. Ideally research will be interesting and natural, rather than burdensome; collating facts is the primary job of the encyclopedist, and research is the process behind that.

3.1. Good Wikipedia Research

What is meant by good research? On Wikipedia, research makes the site a quality reference work. For an individual editor, doing your research before writing is also thoughtful preparation ahead of exposing your work to criticism and modification in an open forum. Good research can give you confidence in the content you're submitting to Wikipedia, and it fortifies an article against questions from other editors about content—which may be important, because Wikipedia, like online communities in general, can be an argumentative place.

The Verifiability and No Original Research policies are the background to referencing information on Wikipedia. All facts should be verifiable, and theories that are advanced must be based only on what is already published, without novel synthesis. In practice, this means that sources should be cited for facts in accordance with the Reliable Sources guideline. From Wikipedia's point of view, some sources are better than others. What is also true is that, whereas all facts should flow from reliable sources, some facts are more worthy of thorough research before including them in an article than others. Anything contentious, or possibly suspect, should be well documented first. In contentious areas, however, don't expect complete agreement on what a reliable source is. If an article presents two sides of an issue, its editors should find reputable authorities on both sides.

Occasionally information in Wikipedia has turned out to be urban myth, something that comes from a friend of a friend… . Make sure, in cases like this, that what's been said is actually something that's documented somewhere and not just gossip or lazy journalism. Cite a good, reliable, printed source for the information. If verifying a point is harder than you thought, that's all the more reason to record where you found it for the sake of Wikipedia readers. Remember that Wikipedia doesn't pass judgment on what is true, but the site is responsible for reporting on and gathering reliable information about all topics.

The easiest topics to research well have mostly been added to the encyclopedia. More obscure topics may be more difficult to research, but again, part of learning about a subject well enough to write about it for a lay encyclopedia audience is doing good background research.

3.2. Doing Research

Researching is hard work, but like hunting for treasure or solving a crossword puzzle, the act of discovering the unknown through careful steps can be quite satisfying. Regardless of the topic you're interested in, doing research to corroborate information in an article or to find out more about a topic consists of a few basic steps:

  1. Determine the question you're trying to answer. The question may regard specific facts in an article that you want to cite or may be more general (for instance, "What year did this scientist receive a Nobel Prize?" or "What basic topics did this scientist work on, and did he or she receive recognition for this work?"). With existing articles, if you need a place to begin asking questions, take a second and skeptical look at the factual content. Which salient facts are most likely to be in doubt?

  2. Figure out what kind of sources are likely to have the information you're looking for and where you can find those sources.

  3. Use the appropriate type of source. This is a critical (and often overlooked) part of good research: Not all sources are right for all topics. For information on a recent natural disaster, review newspaper or news agency accounts and governmental bulletins; these news stories can be found either through searching web indexes such as Google News, which is free online, or news indexes such as LexisNexis, which many libraries subscribe to. On the other hand, if you're researching a 19th-century writer, accounts of his or her life may be found in literary encyclopedias, biographical dictionaries, introductions that appear in his or her work, or (if they were quite famous) dedicated biographies, which may be found in library catalogs. Often specific questions are easiest to answer: For the scientist example, a list of Nobel Prize winners may be found easily on the Nobel Prize committee website. For more in-depth information, however, you will need to turn to biographies.

  4. Search for appropriate sources. This requires turning your search question into keywords that can be used in a web search engine, library catalog, article database, or similar system. If you can't find anything using one term, try another similar term or name variation. It is at this stage and the previous one that asking for help from people knowledgeable about the topic and librarians can be very helpful.

  5. Read and evaluate potential sources. This is the time-consuming part of doing research. Do the sources answer your questions? Where did they get their information? Do they seem trustworthy? What trustworthy means can vary depending on what you're looking for; although a scholarly biography published by an academic publisher is usually a reliable source for investigating a scientist's life, an essay about that scientist that was written by a student and posted online on a personal website probably isn't. The Reliable Sources guideline, discussed starting on Section 3.3, “Reliable Sources”, talks a good deal about what a reliable source is for most topics on Wikipedia. The point is, however, that statements of fact should be based on the best information available.

  6. Document your sources. Once you've found a good, reliable source that answers your question, cite it, either with the inline style described in "Referencing Styles" on Section 3.4, “Referencing Styles” (for specific facts) or in a separate References section (for more general sources).

Congratulations, you're done! (For that article, anyway.)

Note

File away URLs. If you find likely web pages by searching online, save the URLs, even if you remember exactly how you found a key reference. Searches are not always repeatable experiments because search engines and inbound links update themselves.

A few particular research resources are worth mentioning:

Other good sources for finding books and articles are general textbooks or overview sources about a topic, which almost always include a bibliography of core sources that you can then turn to. And as with any research project, local libraries and librarians can also help suggest and locate possible sources.

3.3. Reliable Sources

Some sources are better than others for verifying information. Always use the best possible sources you can find. In addition, Wikipedia has a few general preferences for sources:

  • Given Wikipedia's global availability as a research tool and the possibility that collaborating researchers will have access only to publicly available sources, try to use sources that are freely available to everyone online (usually in addition to printed or subscription-based sources). Beware, however, that many resources accessible through universities or libraries, such as online magazines, are paid for by subscription and not available to the general public.

  • Up-to-date sources are preferable, though deeper coverage of a topic may depend on older scholarly sources. New sources can always be added in as they are published.

  • Use a mixture of sources whenever possible: A combination of external links to websites, references to standard textbooks, and specific references to academic books and journals is ideal.

  • Don't rely on blogs or any other sources that are led more by opinion than factual reporting; editorials may be good sources for discussing viewpoints about a controversial subject, but they are generally not the best sources for factual information. When researching a very controversial subject, be extremely careful to double-check the origins of sources and present both sides of the story.

  • For any source, ask about the expertise of the author writing. The idea of who an expert is will clearly vary in context; an expert in current popular musical culture might be a music journalist, not an academic professor. Regardless, be clear on where information is originating from.

  • Don't randomly cut well-referenced information or reasonable sources. If you find a better source than the one listed, by all means add it, but in most cases, don't remove the old source. If the sources disagree, note this in the article or on the talk page.

Debates are ongoing on Wikipedia about what a reliable source is and what claims need to be sourced within an article. Scholarly sources are, for the most part, going to carry more conviction than (say) what a columnist once wrote. But you should also be reasonable about sources: Reliability is not the same as infallibility.

Note

Don't self reference by using other Wikipedia articles as sources. This kind of self referencing defeats the point of getting information from outside reliable sources. Links to other articles should go in the See also section.

For a detailed discussion, see [[Wikipedia:Reliable sources]] (shortcut WP:RS). This guideline on reliable sources and the concurrent debate around it (unfortunately but maybe inevitably) has been dominated by extreme cases, which are not helpful for most articles. The guidelines for sources tend to deal with areas where it makes the most sense to take a tough line on reliability, such as religion, politics, or biographies of living people.

A related guideline deals with sources for biographies. The following is from [[Wikipedia:Biographies of living persons]] (shortcut WP:BLP), which is official policy (accessed October 3, 2007):

Material from self-published books, zines, websites, and blogs should never be used as a source about a living person, unless written or published by the subject of the article.

Editors should avoid repeating gossip. Ask yourself whether the source is reliable; whether the material is being presented as true; and whether, even if true, it is relevant to an encyclopedia article about the subject. When less-than-reliable publications print material they suspect is untrue, they often include weasel phrases. Look out for these. If the original publication doesn't believe its own story, why should we?

3.4. Referencing Styles

Once you have the perfect source in hand, you then need to actually add a citation to it in the article. A variety of referencing styles have been adopted and ultimately rejected over the years on Wikipedia—not to mention the half-dozen styles commonly used in academia being added to the mix and technological solutions such as templates being introduced. What all these referencing styles have in common, however, is they attempt to make it clear where information in an article comes from and what sources were used. Getting the specific style of the reference right is much less important than making sure the reference is present in the first place; other editors can always come along and edit it stylistically.

That said, we recommend the inline referencing style, where specific references are noted in the main body of the text using numbered superscript links to footnotes. These footnotes appear at the end of the article (usually grouped in a section called References or Notes) and contain the specific references used for each piece of information that has a footnote. This style is gradually becoming the most common on Wikipedia today, and it has several advantages. In-text links to footnotes means a reader can easily click to the source for any particular piece of information and that you can make very specific citations—each sentence in an article could conceivably have a footnote to a different specific source. The citations for each fact can appear directly next to that fact. This style also means that an article can easily acquire more sources over time; if an editor finds a perfect source for documenting a single fact in an article, he or she can easily insert a footnote to that source, and the list of footnotes and their numbers will automatically update. Additionally, if a footnote is deleted through text being edited or moved, the footnote numbers will automatically update. This makes it a good style for collaborative referencing work.

Along with footnotes that only contain a reference, they can also comment on or include asides to the main text, though these are less common in articles. For a comprehensively referenced article, the footnote section (which in this case would be called Notes) can also be combined with a separate References or Further reading section, where works can be listed as general references for the whole article. Figure 6.3, “The References, Further reading, and External links sections of the article [[Phineas Gage]]” shows this style used for the end sections of the article [[Phineas Gage]] (about a 19th-century man famed for surviving a large iron rod puncturing his skull and brain).

Figure 6.3. The References, Further reading, and External links sections of the article [[Phineas Gage]]

The References, Further reading, and External links sections of the article [[Phineas Gage]]

Wikipedia has no default style for the citations themselves; several different variations on academic styles are common (see [[Wikipedia:Citations]] for more information). How a work is cited will depend on what it is; a typical Wikipedia article includes sources from web pages, books, and perhaps periodicals and learned journals. Here is some advice for styling citations on Wikipedia:

  • Remain consistent. If one referencing method or style is already used in the article, stick to that style.

  • Inverting names is not necessary. Although many academic styles call for putting the last name of an author first in a reference listing, this style is unhelpful for readers who may be searching the site for a specific author.

  • Wikilinks can be used judiciously; many famous authors and well-known sources have their own articles, and in these cases, names within the citation can be linked for the curious reader.

  • Include a URL if you can, but make sure the URL is stable and accessible to everyone (not just subscribers to a magazine, for instance). Include the date that you accessed the URL.

  • Spell it out. Cryptic abbreviations often used in scholarly journals are unhelpful for a reader who may be new to a field of study. Give as much information about the source as you can (the full name of the author, journal name, or ISBN for a book, for instance). Readers from all over the world may ultimately try to find the source.

3.4.1. Using Footnotes

The workhorse of the inline style is the <ref> tag. Notes and citations are placed in between a pair of <ref> tags in the text itself, where you want the footnote to appear. For example,

According to scientists, the Sun is pretty big<ref> Miller, E: "The Sun", page
23. Academic Press, 2005</ref>, however the moon is not so big.<ref>Smith, R:
"Size of the Moon", ''Scientific American'', 46(78):46</ref>.

This example cites two sources: a work by E. Miller published by Academic Press and a work by R. Smith published in Scientific American. The references will only display in the text as a numbered link to a footnote, which appears directly where you have placed the first <ref> tag. These numbers automatically update whenever a new footnote is added.

For a reader to actually see the references cited, you must add a second piece—the <references/> tag, which is inserted at the bottom of the article in a section called References or Notes. The text of all footnotes will appear wherever you place this tag (though placing <references/> before <ref> doesn't work). For instance, for the previous example, you would create a section that looks like this:

==Notes==
<references/>

Don't add anything to the <references/> tag; the linked references from the text will automatically display here:

==Notes==
# ^ Miller, E: "The Sun", page 23. Academic Press, 2005
# ^ Smith, R: "Size of the Moon", ''Scientific American'', 46(78):46

Each footnote starts with a caret (^), which is automatically linked back to the text where the reference was placed. To edit the reference, you change it directly in the article text rather than in the References section.

Though you don't need to add anything to the <references/> tag, you can list other, non-footnoted references in this section as well. Creating two sections, one called Notes (for footnotes) and one called References (for all other references) is better, however. This form also enables you to use short references in the footnotes (such as "Smith, p. 47"), as long as you spell out the full reference in the References section.

You can use the same footnote (that is, the same citation), and hence the same footnote number, more than once. To do this, insert the name variable in the <ref> tag, like this:

According to scientists, the Sun is pretty big<ref name="sun"> Miller, E: "The
Sun", page 23. Academic Press, 2005</ref>, however the moon is not so big.<ref
name="moon">Smith, R: "Size of the Moon", ''Scientific American'', 46(78):46</ref>.

To refer to each of these citations at a later point, you only need to refer to the abbreviated form of the reference with ref name:

The sun is also very hot <ref name="sun"/>. The moon, however, is cold <ref
name="moon"/>.

Be sure to add that last ending slash, or all the text after the reference won't display when you save the page!

These two references then use the same footnote numbers as the first reference using that name. In the footnotes section, links to each instance of the reference will appear next to the footnote that is used multiple times, labeled alphabetically: a, b, c, and so on.

Using ref name is the best way to cite a source multiple times within the text. The old academic convention of ibid. (short for ibidem, meaning this is the same book as mentioned in the previous note, but on a different page) is not robust enough. Someone could later add a footnote between your successive notes, and the second note wouldn't make sense. The inline system works because if text is edited or moved, the numbers follow automatically. Unfortunately, one limitation of the current footnoting system is that if you want to change any part of the cited reference (such as to cite a different page number) ref name doesn't work; you have to type out the entire reference again. If this becomes time consuming in a long article, consider using the short reference style in a Notes section with the full references in a separate References section.

3.4.2. Referencing Templates

Many templates have been developed for formatting citations. You can use templates to ensure that cited references for different types of sources (such as newspapers, books, etc.) are clearly displayed in a standardized way. You can also use citation templates both for formatting citations in footnotes or citations listed separately in a References section.

In a citation template, the editor fills in the template parameters with the source information (such as the author, title of the work, and so on) and the template automatically arranges the citation appropriately, according to the type of source. For instance, here is the wikitext of the citation for an online newspaper story using the {{news}} template:

{{cite news |last=Plunkett |first=John |url=http://media.guardian.co.uk/site/
story/0,14173,1601858,00.html |title=Sorrell accuses Murdoch of panic buying
|publisher=The Guardian |date=2005-10-27 |accessdate=2005-10-27}}

When you save the page, the reader sees the following citation:

Plunkett, John. "Sorrell accuses Murdoch of panic buying", The Guardian,
October 27, 2005. Retrieved on October 27, 2005.

with the title of the article linked to the given URL. Templates do the work of making sure citations are correctly formatted. They also help ensure that references include all the necessary information—not just a URL for a website but also a date—so articles can be accessed from an archive if the URL no longer works.

Some templates have been created for specific sources that are commonly cited. In this case, the main source information is already encoded in the template, so the editor only has to add variable information such as author name or article title. The full source citation will appear when the page is saved. For instance, the template {{Fishbase_species}} is for adding a link to FishBase, a scholarly online database of information about fish. In the [[Guppy]] article, the following citation appears:

{{FishBase_species|genus=Poecilia|species=reticulata|year=2004|month=April}}

The editor only fills in the genus and species information and the date that the site was accessed. The saved page renders this as the following citation:

"Poecilia reticulata". FishBase. Ed. Ranier Froese and Daniel Pauly. April 2004
version. N.p.: FishBase, 2004.

The title of the database, the editors, and the formatting of the citation are all encoded in the template. This is clearly useful if you're citing the same source multiple times. A list of specific source templates can be found at [[Category:Specific source templates]].

One problem with templates is that they are inflexible; for instance, if you want to wikilink the author name in the news example given, you'll have trouble because the author first and last names are in different fields. There are two solutions for this particular problem: Either don't use the template and write the citation out by hand, or add an extra field that has been developed just for this called authorlink, which can be filled out like |authorlink=[[John Plunket]] (for joint authors, you can use authorlink1= and authorlink2=).

As you can see, templates can get complicated. They can also make the wiki markup denser and harder to read, especially if they are used for in-text citations, and they may not add that much to the presentation of references. Templates are certainly optional. They can also be added gradually: Existing references can be converted to templates over time. More information about citation templates can be found at [[Wikipedia:Citation_templates]].