Amino acid dipepetide frequency for Physeter macrocephalus (Sperm whale) (Physeter catodon)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.367AlaAla: 7.367 ± 0.035
1.385AlaCys: 1.385 ± 0.01
3.039AlaAsp: 3.039 ± 0.012
4.972AlaGlu: 4.972 ± 0.02
2.542AlaPhe: 2.542 ± 0.012
4.941AlaGly: 4.941 ± 0.022
1.583AlaHis: 1.583 ± 0.008
2.6AlaIle: 2.6 ± 0.01
3.492AlaLys: 3.492 ± 0.024
7.27AlaLeu: 7.27 ± 0.024
1.417AlaMet: 1.417 ± 0.008
1.979AlaAsn: 1.979 ± 0.01
4.588AlaPro: 4.588 ± 0.022
3.399AlaGln: 3.399 ± 0.015
3.943AlaArg: 3.943 ± 0.017
6.11AlaSer: 6.11 ± 0.02
3.558AlaThr: 3.558 ± 0.016
4.773AlaVal: 4.773 ± 0.016
0.792AlaTrp: 0.792 ± 0.006
1.487AlaTyr: 1.487 ± 0.01
0.007AlaXaa: 0.007 ± 0.001
Cys
1.25CysAla: 1.25 ± 0.01
0.595CysCys: 0.595 ± 0.006
0.99CysAsp: 0.99 ± 0.009
1.239CysGlu: 1.239 ± 0.013
0.799CysPhe: 0.799 ± 0.007
1.664CysGly: 1.664 ± 0.016
0.642CysHis: 0.642 ± 0.007
0.887CysIle: 0.887 ± 0.008
1.099CysLys: 1.099 ± 0.009
2.115CysLeu: 2.115 ± 0.013
0.386CysMet: 0.386 ± 0.003
0.754CysAsn: 0.754 ± 0.007
1.393CysPro: 1.393 ± 0.013
1.06CysGln: 1.06 ± 0.009
1.284CysArg: 1.284 ± 0.009
1.963CysSer: 1.963 ± 0.014
1.049CysThr: 1.049 ± 0.008
1.272CysVal: 1.272 ± 0.012
0.281CysTrp: 0.281 ± 0.003
0.546CysTyr: 0.546 ± 0.005
0.003CysXaa: 0.003 ± 0.0
Asp
2.951AspAla: 2.951 ± 0.014
1.025AspCys: 1.025 ± 0.009
2.691AspAsp: 2.691 ± 0.016
3.481AspGlu: 3.481 ± 0.014
2.101AspPhe: 2.101 ± 0.01
3.341AspGly: 3.341 ± 0.017
1.144AspHis: 1.144 ± 0.008
2.496AspIle: 2.496 ± 0.013
2.479AspLys: 2.479 ± 0.012
5.031AspLeu: 5.031 ± 0.015
1.081AspMet: 1.081 ± 0.007
1.595AspAsn: 1.595 ± 0.01
2.963AspPro: 2.963 ± 0.015
1.839AspGln: 1.839 ± 0.01
2.473AspArg: 2.473 ± 0.011
4.283AspSer: 4.283 ± 0.015
2.446AspThr: 2.446 ± 0.014
3.129AspVal: 3.129 ± 0.014
0.62AspTrp: 0.62 ± 0.005
1.449AspTyr: 1.449 ± 0.01
0.005AspXaa: 0.005 ± 0.0
Glu
5.416GluAla: 5.416 ± 0.022
1.412GluCys: 1.412 ± 0.018
4.454GluAsp: 4.454 ± 0.018
8.167GluGlu: 8.167 ± 0.046
2.0GluPhe: 2.0 ± 0.01
4.28GluGly: 4.28 ± 0.017
1.521GluHis: 1.521 ± 0.008
3.119GluIle: 3.119 ± 0.018
5.556GluLys: 5.556 ± 0.032
6.589GluLeu: 6.589 ± 0.03
1.682GluMet: 1.682 ± 0.009
3.144GluAsn: 3.144 ± 0.017
3.401GluPro: 3.401 ± 0.019
3.292GluGln: 3.292 ± 0.017
4.282GluArg: 4.282 ± 0.019
4.575GluSer: 4.575 ± 0.019
3.433GluThr: 3.433 ± 0.015
4.223GluVal: 4.223 ± 0.017
0.684GluTrp: 0.684 ± 0.006
1.536GluTyr: 1.536 ± 0.01
0.005GluXaa: 0.005 ± 0.0
Phe
1.925PheAla: 1.925 ± 0.01
0.882PheCys: 0.882 ± 0.007
1.6PheAsp: 1.6 ± 0.01
1.996PheGlu: 1.996 ± 0.011
1.455PhePhe: 1.455 ± 0.009
2.103PheGly: 2.103 ± 0.012
1.023PheHis: 1.023 ± 0.007
1.669PheIle: 1.669 ± 0.011
1.695PheLys: 1.695 ± 0.009
3.826PheLeu: 3.826 ± 0.018
0.736PheMet: 0.736 ± 0.006
1.241PheAsn: 1.241 ± 0.008
1.997PhePro: 1.997 ± 0.011
1.756PheGln: 1.756 ± 0.008
1.945PheArg: 1.945 ± 0.01
3.361PheSer: 3.361 ± 0.012
1.938PheThr: 1.938 ± 0.011
2.013PheVal: 2.013 ± 0.011
0.448PheTrp: 0.448 ± 0.005
1.085PheTyr: 1.085 ± 0.007
0.005PheXaa: 0.005 ± 0.0
Gly
4.758GlyAla: 4.758 ± 0.022
1.258GlyCys: 1.258 ± 0.01
3.121GlyAsp: 3.121 ± 0.015
4.216GlyGlu: 4.216 ± 0.02
2.302GlyPhe: 2.302 ± 0.013
5.255GlyGly: 5.255 ± 0.03
1.717GlyHis: 1.717 ± 0.01
2.556GlyIle: 2.556 ± 0.011
3.703GlyLys: 3.703 ± 0.015
5.791GlyLeu: 5.791 ± 0.022
1.245GlyMet: 1.245 ± 0.008
2.237GlyAsn: 2.237 ± 0.011
4.523GlyPro: 4.523 ± 0.034
2.829GlyGln: 2.829 ± 0.013
3.996GlyArg: 3.996 ± 0.017
5.924GlySer: 5.924 ± 0.022
3.544GlyThr: 3.544 ± 0.016
3.5GlyVal: 3.5 ± 0.015
0.804GlyTrp: 0.804 ± 0.009
1.642GlyTyr: 1.642 ± 0.011
0.008GlyXaa: 0.008 ± 0.001
His
1.391HisAla: 1.391 ± 0.007
0.702HisCys: 0.702 ± 0.006
0.901HisAsp: 0.901 ± 0.007
1.315HisGlu: 1.315 ± 0.009
1.041HisPhe: 1.041 ± 0.007
1.564HisGly: 1.564 ± 0.01
0.924HisHis: 0.924 ± 0.009
1.19HisIle: 1.19 ± 0.009
1.266HisLys: 1.266 ± 0.007
2.946HisLeu: 2.946 ± 0.012
0.559HisMet: 0.559 ± 0.005
0.815HisAsn: 0.815 ± 0.006
1.725HisPro: 1.725 ± 0.01
1.402HisGln: 1.402 ± 0.012
1.643HisArg: 1.643 ± 0.01
2.324HisSer: 2.324 ± 0.012
1.524HisThr: 1.524 ± 0.013
1.564HisVal: 1.564 ± 0.009
0.336HisTrp: 0.336 ± 0.004
0.79HisTyr: 0.79 ± 0.006
0.004HisXaa: 0.004 ± 0.0
Ile
2.436IleAla: 2.436 ± 0.011
0.986IleCys: 0.986 ± 0.008
1.951IleAsp: 1.951 ± 0.012
2.53IleGlu: 2.53 ± 0.013
1.722IlePhe: 1.722 ± 0.012
2.068IleGly: 2.068 ± 0.013
1.326IleHis: 1.326 ± 0.01
2.154IleIle: 2.154 ± 0.013
2.504IleLys: 2.504 ± 0.013
4.264IleLeu: 4.264 ± 0.016
0.91IleMet: 0.91 ± 0.006
1.714IleAsn: 1.714 ± 0.009
2.545IlePro: 2.545 ± 0.013
2.26IleGln: 2.26 ± 0.014
2.293IleArg: 2.293 ± 0.01
3.568IleSer: 3.568 ± 0.014
2.415IleThr: 2.415 ± 0.011
2.313IleVal: 2.313 ± 0.012
0.465IleTrp: 0.465 ± 0.004
1.268IleTyr: 1.268 ± 0.009
0.004IleXaa: 0.004 ± 0.0
Lys
3.993LysAla: 3.993 ± 0.021
1.11LysCys: 1.11 ± 0.008
3.169LysAsp: 3.169 ± 0.019
5.249LysGlu: 5.249 ± 0.03
1.677LysPhe: 1.677 ± 0.008
3.212LysGly: 3.212 ± 0.019
1.416LysHis: 1.416 ± 0.01
2.69LysIle: 2.69 ± 0.014
4.675LysLys: 4.675 ± 0.028
5.172LysLeu: 5.172 ± 0.022
1.423LysMet: 1.423 ± 0.009
2.35LysAsn: 2.35 ± 0.012
3.124LysPro: 3.124 ± 0.019
2.707LysGln: 2.707 ± 0.016
3.332LysArg: 3.332 ± 0.014
3.995LysSer: 3.995 ± 0.018
3.093LysThr: 3.093 ± 0.014
3.359LysVal: 3.359 ± 0.013
0.594LysTrp: 0.594 ± 0.005
1.522LysTyr: 1.522 ± 0.015
0.004LysXaa: 0.004 ± 0.0
Leu
6.792LeuAla: 6.792 ± 0.022
2.122LeuCys: 2.122 ± 0.014
4.729LeuAsp: 4.729 ± 0.016
7.35LeuGlu: 7.35 ± 0.036
3.222LeuPhe: 3.222 ± 0.015
5.825LeuGly: 5.825 ± 0.018
2.752LeuHis: 2.752 ± 0.013
3.677LeuIle: 3.677 ± 0.016
5.837LeuLys: 5.837 ± 0.026
10.559LeuLeu: 10.559 ± 0.04
1.932LeuMet: 1.932 ± 0.01
3.42LeuAsn: 3.42 ± 0.014
6.24LeuPro: 6.24 ± 0.021
6.013LeuGln: 6.013 ± 0.029
6.024LeuArg: 6.024 ± 0.021
7.995LeuSer: 7.995 ± 0.023
4.92LeuThr: 4.92 ± 0.017
5.318LeuVal: 5.318 ± 0.02
1.098LeuTrp: 1.098 ± 0.009
2.441LeuTyr: 2.441 ± 0.013
0.011LeuXaa: 0.011 ± 0.001
Met
1.839MetAla: 1.839 ± 0.007
0.39MetCys: 0.39 ± 0.004
1.208MetAsp: 1.208 ± 0.008
1.838MetGlu: 1.838 ± 0.009
0.694MetPhe: 0.694 ± 0.005
1.201MetGly: 1.201 ± 0.008
0.465MetHis: 0.465 ± 0.004
0.784MetIle: 0.784 ± 0.006
1.382MetLys: 1.382 ± 0.008
1.905MetLeu: 1.905 ± 0.008
0.548MetMet: 0.548 ± 0.005
0.861MetAsn: 0.861 ± 0.006
1.064MetPro: 1.064 ± 0.008
0.951MetGln: 0.951 ± 0.007
1.042MetArg: 1.042 ± 0.007
1.519MetSer: 1.519 ± 0.009
1.076MetThr: 1.076 ± 0.007
1.335MetVal: 1.335 ± 0.007
0.228MetTrp: 0.228 ± 0.003
0.531MetTyr: 0.531 ± 0.005
0.002MetXaa: 0.002 ± 0.0
Asn
1.996AsnAla: 1.996 ± 0.011
0.782AsnCys: 0.782 ± 0.007
1.472AsnAsp: 1.472 ± 0.01
2.178AsnGlu: 2.178 ± 0.013
1.382AsnPhe: 1.382 ± 0.007
2.324AsnGly: 2.324 ± 0.015
0.923AsnHis: 0.923 ± 0.007
1.991AsnIle: 1.991 ± 0.01
2.149AsnLys: 2.149 ± 0.012
3.584AsnLeu: 3.584 ± 0.015
0.861AsnMet: 0.861 ± 0.006
1.417AsnAsn: 1.417 ± 0.01
2.092AsnPro: 2.092 ± 0.01
1.662AsnGln: 1.662 ± 0.011
1.774AsnArg: 1.774 ± 0.009
3.085AsnSer: 3.085 ± 0.015
1.907AsnThr: 1.907 ± 0.01
2.115AsnVal: 2.115 ± 0.011
0.438AsnTrp: 0.438 ± 0.004
1.059AsnTyr: 1.059 ± 0.009
0.003AsnXaa: 0.003 ± 0.0
Pro
5.457ProAla: 5.457 ± 0.027
1.138ProCys: 1.138 ± 0.01
2.857ProAsp: 2.857 ± 0.016
4.625ProGlu: 4.625 ± 0.02
1.912ProPhe: 1.912 ± 0.011
5.588ProGly: 5.588 ± 0.041
1.491ProHis: 1.491 ± 0.009
1.814ProIle: 1.814 ± 0.011
2.797ProLys: 2.797 ± 0.016
5.451ProLeu: 5.451 ± 0.022
1.029ProMet: 1.029 ± 0.008
1.756ProAsn: 1.756 ± 0.01
6.767ProPro: 6.767 ± 0.053
2.998ProGln: 2.998 ± 0.016
3.661ProArg: 3.661 ± 0.017
6.138ProSer: 6.138 ± 0.026
3.123ProThr: 3.123 ± 0.015
4.008ProVal: 4.008 ± 0.017
0.701ProTrp: 0.701 ± 0.007
1.502ProTyr: 1.502 ± 0.01
0.008ProXaa: 0.008 ± 0.001
Gln
3.704GlnAla: 3.704 ± 0.018
0.929GlnCys: 0.929 ± 0.008
2.397GlnAsp: 2.397 ± 0.011
4.052GlnGlu: 4.052 ± 0.022
1.331GlnPhe: 1.331 ± 0.008
2.891GlnGly: 2.891 ± 0.014
1.372GlnHis: 1.372 ± 0.008
1.98GlnIle: 1.98 ± 0.011
2.995GlnLys: 2.995 ± 0.017
4.914GlnLeu: 4.914 ± 0.024
1.124GlnMet: 1.124 ± 0.007
1.837GlnAsn: 1.837 ± 0.01
3.006GlnPro: 3.006 ± 0.017
3.291GlnGln: 3.291 ± 0.028
3.156GlnArg: 3.156 ± 0.015
3.292GlnSer: 3.292 ± 0.015
2.353GlnThr: 2.353 ± 0.011
2.839GlnVal: 2.839 ± 0.012
0.533GlnTrp: 0.533 ± 0.005
1.128GlnTyr: 1.128 ± 0.007
0.004GlnXaa: 0.004 ± 0.0
Arg
4.144ArgAla: 4.144 ± 0.018
1.235ArgCys: 1.235 ± 0.011
2.803ArgAsp: 2.803 ± 0.011
4.244ArgGlu: 4.244 ± 0.02
1.802ArgPhe: 1.802 ± 0.011
3.839ArgGly: 3.839 ± 0.02
1.59ArgHis: 1.59 ± 0.008
2.368ArgIle: 2.368 ± 0.009
3.742ArgLys: 3.742 ± 0.017
5.589ArgLeu: 5.589 ± 0.02
1.171ArgMet: 1.171 ± 0.008
2.066ArgAsn: 2.066 ± 0.011
3.538ArgPro: 3.538 ± 0.017
2.798ArgGln: 2.798 ± 0.014
4.711ArgArg: 4.711 ± 0.022
4.56ArgSer: 4.56 ± 0.025
2.882ArgThr: 2.882 ± 0.013
3.208ArgVal: 3.208 ± 0.012
0.698ArgTrp: 0.698 ± 0.005
1.376ArgTyr: 1.376 ± 0.009
0.005ArgXaa: 0.005 ± 0.0
Ser
5.63SerAla: 5.63 ± 0.02
1.792SerCys: 1.792 ± 0.013
4.041SerAsp: 4.041 ± 0.024
5.438SerGlu: 5.438 ± 0.023
2.975SerPhe: 2.975 ± 0.013
5.734SerGly: 5.734 ± 0.021
2.128SerHis: 2.128 ± 0.01
3.09SerIle: 3.09 ± 0.012
4.162SerLys: 4.162 ± 0.017
8.347SerLeu: 8.347 ± 0.024
1.572SerMet: 1.572 ± 0.008
2.615SerAsn: 2.615 ± 0.013
6.527SerPro: 6.527 ± 0.032
4.027SerGln: 4.027 ± 0.016
4.842SerArg: 4.842 ± 0.024
9.9SerSer: 9.9 ± 0.046
4.496SerThr: 4.496 ± 0.02
4.933SerVal: 4.933 ± 0.017
1.03SerTrp: 1.03 ± 0.008
1.976SerTyr: 1.976 ± 0.011
0.008SerXaa: 0.008 ± 0.001
Thr
3.82ThrAla: 3.82 ± 0.015
1.213ThrCys: 1.213 ± 0.01
2.419ThrAsp: 2.419 ± 0.011
3.54ThrGlu: 3.54 ± 0.016
1.983ThrPhe: 1.983 ± 0.009
3.491ThrGly: 3.491 ± 0.015
1.296ThrHis: 1.296 ± 0.011
2.161ThrIle: 2.161 ± 0.01
2.605ThrLys: 2.605 ± 0.013
5.157ThrLeu: 5.157 ± 0.018
1.066ThrMet: 1.066 ± 0.006
1.648ThrAsn: 1.648 ± 0.009
3.746ThrPro: 3.746 ± 0.02
2.317ThrGln: 2.317 ± 0.014
2.522ThrArg: 2.522 ± 0.012
4.726ThrSer: 4.726 ± 0.018
2.881ThrThr: 2.881 ± 0.019
3.738ThrVal: 3.738 ± 0.016
0.653ThrTrp: 0.653 ± 0.006
1.333ThrTyr: 1.333 ± 0.009
0.006ThrXaa: 0.006 ± 0.001
Val
4.264ValAla: 4.264 ± 0.016
1.415ValCys: 1.415 ± 0.011
2.929ValAsp: 2.929 ± 0.014
3.864ValGlu: 3.864 ± 0.016
2.237ValPhe: 2.237 ± 0.011
3.33ValGly: 3.33 ± 0.014
1.584ValHis: 1.584 ± 0.008
2.707ValIle: 2.707 ± 0.012
3.409ValLys: 3.409 ± 0.015
6.072ValLeu: 6.072 ± 0.021
1.254ValMet: 1.254 ± 0.007
2.217ValAsn: 2.217 ± 0.011
3.78ValPro: 3.78 ± 0.016
2.79ValGln: 2.79 ± 0.011
3.091ValArg: 3.091 ± 0.012
4.935ValSer: 4.935 ± 0.016
3.651ValThr: 3.651 ± 0.019
3.842ValVal: 3.842 ± 0.017
0.673ValTrp: 0.673 ± 0.006
1.552ValTyr: 1.552 ± 0.01
0.006ValXaa: 0.006 ± 0.001
Trp
0.787TrpAla: 0.787 ± 0.006
0.244TrpCys: 0.244 ± 0.003
0.641TrpAsp: 0.641 ± 0.006
0.788TrpGlu: 0.788 ± 0.006
0.41TrpPhe: 0.41 ± 0.004
0.71TrpGly: 0.71 ± 0.006
0.296TrpHis: 0.296 ± 0.004
0.497TrpIle: 0.497 ± 0.005
0.779TrpLys: 0.779 ± 0.006
1.171TrpLeu: 1.171 ± 0.009
0.313TrpMet: 0.313 ± 0.003
0.514TrpAsn: 0.514 ± 0.005
0.541TrpPro: 0.541 ± 0.005
0.51TrpGln: 0.51 ± 0.005
0.741TrpArg: 0.741 ± 0.007
0.862TrpSer: 0.862 ± 0.007
0.644TrpThr: 0.644 ± 0.006
0.645TrpVal: 0.645 ± 0.006
0.183TrpTrp: 0.183 ± 0.003
0.311TrpTyr: 0.311 ± 0.005
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.317TyrAla: 1.317 ± 0.008
0.653TyrCys: 0.653 ± 0.006
1.207TyrAsp: 1.207 ± 0.008
1.681TyrGlu: 1.681 ± 0.012
1.109TyrPhe: 1.109 ± 0.007
1.559TyrGly: 1.559 ± 0.01
0.728TyrHis: 0.728 ± 0.006
1.262TyrIle: 1.262 ± 0.007
1.533TyrLys: 1.533 ± 0.029
2.513TyrLeu: 2.513 ± 0.014
0.551TyrMet: 0.551 ± 0.006
1.013TyrAsn: 1.013 ± 0.007
1.252TyrPro: 1.252 ± 0.008
1.223TyrGln: 1.223 ± 0.007
1.61TyrArg: 1.61 ± 0.012
2.14TyrSer: 2.14 ± 0.01
1.382TyrThr: 1.382 ± 0.009
1.47TyrVal: 1.47 ± 0.009
0.329TyrTrp: 0.329 ± 0.005
0.891TyrTyr: 0.891 ± 0.006
0.003TyrXaa: 0.003 ± 0.0
Xaa
0.008XaaAla: 0.008 ± 0.001
0.003XaaCys: 0.003 ± 0.0
0.004XaaAsp: 0.004 ± 0.0
0.007XaaGlu: 0.007 ± 0.001
0.004XaaPhe: 0.004 ± 0.0
0.008XaaGly: 0.008 ± 0.001
0.003XaaHis: 0.003 ± 0.0
0.004XaaIle: 0.004 ± 0.0
0.006XaaLys: 0.006 ± 0.001
0.009XaaLeu: 0.009 ± 0.001
0.002XaaMet: 0.002 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.007XaaPro: 0.007 ± 0.001
0.005XaaGln: 0.005 ± 0.0
0.006XaaArg: 0.006 ± 0.0
0.007XaaSer: 0.007 ± 0.001
0.004XaaThr: 0.004 ± 0.0
0.007XaaVal: 0.007 ± 0.001
0.001XaaTrp: 0.001 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
0.022XaaXaa: 0.022 ± 0.005
Statistics based on 41868 proteins (27548336 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski