Unbiased estimation of the rates of synonymous and nonsynonymous substitution

J Mol Evol. 1993 Jan;36(1):96-9. doi: 10.1007/BF02407308.

Abstract

The current convention in estimating the number of substitutions per synonymous site (KS) and per nonsynonymous site (KA) between two protein-coding genes is to count each twofold degenerate site as one-third synonymous and two-thirds nonsynonymous because one of the three possible changes at such a site is synonymous and the other two are nonsynonymous. This counting rule can considerably overestimate the KS value because transitional mutations tend to occur more often than transversional mutations and because most transitional mutations at twofold degenerate sites are synonymous. A new method that gives unbiased estimates is proposed. An application of the new and the old method to 14 pairs of mouse and rat genes shows that the new method gives a KS value very close to the number of substitutions per four-fold degenerate site whereas the old method gives a value 30% higher. Both methods give a KA value close to the number of substitutions per nondegenerate site.

Publication types

  • Letter
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Genes*
  • Mathematics
  • Mice
  • Mutation
  • Proteins / genetics*
  • Rats

Substances

  • Proteins