Amino acid dipepetide frequency for Phascolarctos cinereus (Koala)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.275AlaAla: 6.275 ± 0.038
1.239AlaCys: 1.239 ± 0.007
2.703AlaAsp: 2.703 ± 0.013
4.486AlaGlu: 4.486 ± 0.019
2.675AlaPhe: 2.675 ± 0.016
4.31AlaGly: 4.31 ± 0.029
1.418AlaHis: 1.418 ± 0.009
2.813AlaIle: 2.813 ± 0.013
3.38AlaLys: 3.38 ± 0.017
6.648AlaLeu: 6.648 ± 0.027
1.459AlaMet: 1.459 ± 0.008
2.024AlaAsn: 2.024 ± 0.011
3.793AlaPro: 3.793 ± 0.024
3.035AlaGln: 3.035 ± 0.015
3.252AlaArg: 3.252 ± 0.014
5.669AlaSer: 5.669 ± 0.022
3.366AlaThr: 3.366 ± 0.012
4.309AlaVal: 4.309 ± 0.014
0.713AlaTrp: 0.713 ± 0.006
1.453AlaTyr: 1.453 ± 0.008
0.005AlaXaa: 0.005 ± 0.0
Cys
1.133CysAla: 1.133 ± 0.009
0.589CysCys: 0.589 ± 0.008
0.986CysAsp: 0.986 ± 0.01
1.203CysGlu: 1.203 ± 0.011
0.819CysPhe: 0.819 ± 0.007
1.733CysGly: 1.733 ± 0.019
0.639CysHis: 0.639 ± 0.007
0.963CysIle: 0.963 ± 0.008
1.114CysLys: 1.114 ± 0.009
2.068CysLeu: 2.068 ± 0.012
0.406CysMet: 0.406 ± 0.004
0.925CysAsn: 0.925 ± 0.012
1.29CysPro: 1.29 ± 0.014
1.037CysGln: 1.037 ± 0.01
1.156CysArg: 1.156 ± 0.009
2.046CysSer: 2.046 ± 0.018
1.042CysThr: 1.042 ± 0.009
1.176CysVal: 1.176 ± 0.009
0.271CysTrp: 0.271 ± 0.004
0.561CysTyr: 0.561 ± 0.005
0.002CysXaa: 0.002 ± 0.0
Asp
2.618AspAla: 2.618 ± 0.013
1.008AspCys: 1.008 ± 0.009
2.684AspAsp: 2.684 ± 0.015
3.511AspGlu: 3.511 ± 0.015
2.104AspPhe: 2.104 ± 0.01
3.19AspGly: 3.19 ± 0.016
1.139AspHis: 1.139 ± 0.008
2.661AspIle: 2.661 ± 0.013
2.599AspLys: 2.599 ± 0.013
4.966AspLeu: 4.966 ± 0.018
1.109AspMet: 1.109 ± 0.007
1.753AspAsn: 1.753 ± 0.011
2.836AspPro: 2.836 ± 0.015
1.895AspGln: 1.895 ± 0.009
2.383AspArg: 2.383 ± 0.013
4.285AspSer: 4.285 ± 0.02
2.448AspThr: 2.448 ± 0.013
2.981AspVal: 2.981 ± 0.015
0.6AspTrp: 0.6 ± 0.006
1.492AspTyr: 1.492 ± 0.008
0.004AspXaa: 0.004 ± 0.0
Glu
5.033GluAla: 5.033 ± 0.025
1.6GluCys: 1.6 ± 0.023
4.427GluAsp: 4.427 ± 0.017
8.304GluGlu: 8.304 ± 0.044
2.013GluPhe: 2.013 ± 0.01
4.217GluGly: 4.217 ± 0.018
1.516GluHis: 1.516 ± 0.012
3.334GluIle: 3.334 ± 0.017
5.876GluLys: 5.876 ± 0.033
6.398GluLeu: 6.398 ± 0.026
1.77GluMet: 1.77 ± 0.01
3.355GluAsn: 3.355 ± 0.015
3.175GluPro: 3.175 ± 0.017
3.237GluGln: 3.237 ± 0.02
4.175GluArg: 4.175 ± 0.022
4.719GluSer: 4.719 ± 0.019
3.685GluThr: 3.685 ± 0.025
4.109GluVal: 4.109 ± 0.017
0.697GluTrp: 0.697 ± 0.006
1.61GluTyr: 1.61 ± 0.011
0.006GluXaa: 0.006 ± 0.001
Phe
1.867PheAla: 1.867 ± 0.01
0.921PheCys: 0.921 ± 0.007
1.666PheAsp: 1.666 ± 0.01
1.995PheGlu: 1.995 ± 0.01
1.609PhePhe: 1.609 ± 0.011
2.138PheGly: 2.138 ± 0.013
1.078PheHis: 1.078 ± 0.008
1.87PheIle: 1.87 ± 0.012
1.77PheLys: 1.77 ± 0.01
4.012PheLeu: 4.012 ± 0.019
0.772PheMet: 0.772 ± 0.006
1.369PheAsn: 1.369 ± 0.007
2.047PhePro: 2.047 ± 0.013
1.779PheGln: 1.779 ± 0.01
1.928PheArg: 1.928 ± 0.013
3.593PheSer: 3.593 ± 0.02
2.011PheThr: 2.011 ± 0.011
2.048PheVal: 2.048 ± 0.01
0.487PheTrp: 0.487 ± 0.005
1.182PheTyr: 1.182 ± 0.008
0.005PheXaa: 0.005 ± 0.0
Gly
4.061GlyAla: 4.061 ± 0.02
1.186GlyCys: 1.186 ± 0.009
3.082GlyAsp: 3.082 ± 0.016
4.226GlyGlu: 4.226 ± 0.022
2.294GlyPhe: 2.294 ± 0.013
4.971GlyGly: 4.971 ± 0.028
1.625GlyHis: 1.625 ± 0.011
2.896GlyIle: 2.896 ± 0.017
3.975GlyLys: 3.975 ± 0.021
5.563GlyLeu: 5.563 ± 0.024
1.432GlyMet: 1.432 ± 0.018
2.481GlyAsn: 2.481 ± 0.013
3.933GlyPro: 3.933 ± 0.033
2.742GlyGln: 2.742 ± 0.016
3.527GlyArg: 3.527 ± 0.017
5.837GlySer: 5.837 ± 0.026
3.534GlyThr: 3.534 ± 0.015
3.393GlyVal: 3.393 ± 0.02
0.746GlyTrp: 0.746 ± 0.007
1.674GlyTyr: 1.674 ± 0.011
0.009GlyXaa: 0.009 ± 0.001
His
1.229HisAla: 1.229 ± 0.008
0.691HisCys: 0.691 ± 0.007
0.873HisAsp: 0.873 ± 0.007
1.33HisGlu: 1.33 ± 0.009
1.066HisPhe: 1.066 ± 0.007
1.463HisGly: 1.463 ± 0.009
0.957HisHis: 0.957 ± 0.009
1.304HisIle: 1.304 ± 0.008
1.342HisLys: 1.342 ± 0.009
2.883HisLeu: 2.883 ± 0.013
0.578HisMet: 0.578 ± 0.006
0.877HisAsn: 0.877 ± 0.006
1.659HisPro: 1.659 ± 0.012
1.575HisGln: 1.575 ± 0.016
1.544HisArg: 1.544 ± 0.009
2.397HisSer: 2.397 ± 0.011
1.596HisThr: 1.596 ± 0.016
1.418HisVal: 1.418 ± 0.009
0.343HisTrp: 0.343 ± 0.004
0.804HisTyr: 0.804 ± 0.006
0.003HisXaa: 0.003 ± 0.0
Ile
2.641IleAla: 2.641 ± 0.014
1.084IleCys: 1.084 ± 0.009
2.075IleAsp: 2.075 ± 0.011
2.703IleGlu: 2.703 ± 0.016
1.95IlePhe: 1.95 ± 0.011
2.302IleGly: 2.302 ± 0.012
1.535IleHis: 1.535 ± 0.015
2.52IleIle: 2.52 ± 0.014
2.72IleLys: 2.72 ± 0.016
4.758IleLeu: 4.758 ± 0.021
1.021IleMet: 1.021 ± 0.006
1.88IleAsn: 1.88 ± 0.01
2.839IlePro: 2.839 ± 0.015
2.464IleGln: 2.464 ± 0.012
2.411IleArg: 2.411 ± 0.011
4.023IleSer: 4.023 ± 0.015
2.7IleThr: 2.7 ± 0.018
2.519IleVal: 2.519 ± 0.015
0.523IleTrp: 0.523 ± 0.005
1.423IleTyr: 1.423 ± 0.009
0.005IleXaa: 0.005 ± 0.0
Lys
4.063LysAla: 4.063 ± 0.021
1.163LysCys: 1.163 ± 0.011
3.276LysAsp: 3.276 ± 0.017
5.495LysGlu: 5.495 ± 0.028
1.721LysPhe: 1.721 ± 0.01
3.316LysGly: 3.316 ± 0.023
1.451LysHis: 1.451 ± 0.01
2.913LysIle: 2.913 ± 0.015
5.088LysLys: 5.088 ± 0.029
5.324LysLeu: 5.324 ± 0.021
1.525LysMet: 1.525 ± 0.011
2.509LysAsn: 2.509 ± 0.013
3.37LysPro: 3.37 ± 0.019
2.733LysGln: 2.733 ± 0.016
3.413LysArg: 3.413 ± 0.017
4.174LysSer: 4.174 ± 0.021
3.265LysThr: 3.265 ± 0.016
3.556LysVal: 3.556 ± 0.017
0.613LysTrp: 0.613 ± 0.006
1.573LysTyr: 1.573 ± 0.01
0.006LysXaa: 0.006 ± 0.0
Leu
6.302LeuAla: 6.302 ± 0.024
2.071LeuCys: 2.071 ± 0.012
4.694LeuAsp: 4.694 ± 0.018
7.291LeuGlu: 7.291 ± 0.028
3.318LeuPhe: 3.318 ± 0.017
5.631LeuGly: 5.631 ± 0.021
2.724LeuHis: 2.724 ± 0.015
4.001LeuIle: 4.001 ± 0.019
6.015LeuLys: 6.015 ± 0.025
10.358LeuLeu: 10.358 ± 0.046
2.006LeuMet: 2.006 ± 0.012
3.611LeuAsn: 3.611 ± 0.013
5.97LeuPro: 5.97 ± 0.022
5.677LeuGln: 5.677 ± 0.024
5.54LeuArg: 5.54 ± 0.024
8.202LeuSer: 8.202 ± 0.031
5.176LeuThr: 5.176 ± 0.018
5.234LeuVal: 5.234 ± 0.02
1.101LeuTrp: 1.101 ± 0.008
2.493LeuTyr: 2.493 ± 0.013
0.01LeuXaa: 0.01 ± 0.001
Met
1.862MetAla: 1.862 ± 0.01
0.383MetCys: 0.383 ± 0.004
1.278MetAsp: 1.278 ± 0.008
1.975MetGlu: 1.975 ± 0.01
0.744MetPhe: 0.744 ± 0.006
1.386MetGly: 1.386 ± 0.014
0.452MetHis: 0.452 ± 0.004
0.912MetIle: 0.912 ± 0.007
1.529MetLys: 1.529 ± 0.009
1.998MetLeu: 1.998 ± 0.009
0.592MetMet: 0.592 ± 0.006
0.936MetAsn: 0.936 ± 0.007
1.089MetPro: 1.089 ± 0.01
0.922MetGln: 0.922 ± 0.007
1.043MetArg: 1.043 ± 0.007
1.621MetSer: 1.621 ± 0.01
1.163MetThr: 1.163 ± 0.008
1.367MetVal: 1.367 ± 0.008
0.233MetTrp: 0.233 ± 0.003
0.594MetTyr: 0.594 ± 0.006
0.001MetXaa: 0.001 ± 0.0
Asn
2.053AsnAla: 2.053 ± 0.011
0.827AsnCys: 0.827 ± 0.007
1.607AsnAsp: 1.607 ± 0.012
2.478AsnGlu: 2.478 ± 0.013
1.482AsnPhe: 1.482 ± 0.009
2.537AsnGly: 2.537 ± 0.013
0.987AsnHis: 0.987 ± 0.007
2.192AsnIle: 2.192 ± 0.011
2.349AsnLys: 2.349 ± 0.013
3.848AsnLeu: 3.848 ± 0.015
0.942AsnMet: 0.942 ± 0.006
1.589AsnAsn: 1.589 ± 0.01
2.251AsnPro: 2.251 ± 0.013
1.813AsnGln: 1.813 ± 0.011
1.897AsnArg: 1.897 ± 0.01
3.383AsnSer: 3.383 ± 0.016
2.036AsnThr: 2.036 ± 0.011
2.234AsnVal: 2.234 ± 0.012
0.455AsnTrp: 0.455 ± 0.005
1.13AsnTyr: 1.13 ± 0.008
0.004AsnXaa: 0.004 ± 0.0
Pro
4.452ProAla: 4.452 ± 0.027
1.1ProCys: 1.1 ± 0.012
2.743ProAsp: 2.743 ± 0.017
4.453ProGlu: 4.453 ± 0.023
1.973ProPhe: 1.973 ± 0.011
4.918ProGly: 4.918 ± 0.045
1.473ProHis: 1.473 ± 0.01
2.081ProIle: 2.081 ± 0.013
2.956ProLys: 2.956 ± 0.024
5.255ProLeu: 5.255 ± 0.02
1.068ProMet: 1.068 ± 0.007
1.929ProAsn: 1.929 ± 0.01
6.141ProPro: 6.141 ± 0.045
2.848ProGln: 2.848 ± 0.018
3.182ProArg: 3.182 ± 0.017
5.986ProSer: 5.986 ± 0.029
3.244ProThr: 3.244 ± 0.026
3.756ProVal: 3.756 ± 0.02
0.669ProTrp: 0.669 ± 0.007
1.586ProTyr: 1.586 ± 0.013
0.01ProXaa: 0.01 ± 0.001
Gln
3.27GlnAla: 3.27 ± 0.017
0.941GlnCys: 0.941 ± 0.01
2.304GlnAsp: 2.304 ± 0.011
3.982GlnGlu: 3.982 ± 0.019
1.354GlnPhe: 1.354 ± 0.008
2.83GlnGly: 2.83 ± 0.016
1.341GlnHis: 1.341 ± 0.01
2.117GlnIle: 2.117 ± 0.01
3.112GlnLys: 3.112 ± 0.016
4.795GlnLeu: 4.795 ± 0.023
1.17GlnMet: 1.17 ± 0.007
1.982GlnAsn: 1.982 ± 0.011
2.813GlnPro: 2.813 ± 0.019
3.287GlnGln: 3.287 ± 0.031
3.023GlnArg: 3.023 ± 0.016
3.355GlnSer: 3.355 ± 0.014
2.437GlnThr: 2.437 ± 0.013
2.76GlnVal: 2.76 ± 0.013
0.53GlnTrp: 0.53 ± 0.005
1.167GlnTyr: 1.167 ± 0.008
0.004GlnXaa: 0.004 ± 0.0
Arg
3.417ArgAla: 3.417 ± 0.014
1.106ArgCys: 1.106 ± 0.009
2.668ArgAsp: 2.668 ± 0.014
3.971ArgGlu: 3.971 ± 0.018
1.758ArgPhe: 1.758 ± 0.009
3.412ArgGly: 3.412 ± 0.018
1.493ArgHis: 1.493 ± 0.009
2.586ArgIle: 2.586 ± 0.015
3.765ArgLys: 3.765 ± 0.015
5.18ArgLeu: 5.18 ± 0.024
1.16ArgMet: 1.16 ± 0.007
2.179ArgAsn: 2.179 ± 0.01
3.063ArgPro: 3.063 ± 0.016
2.599ArgGln: 2.599 ± 0.016
4.162ArgArg: 4.162 ± 0.024
4.345ArgSer: 4.345 ± 0.025
2.789ArgThr: 2.789 ± 0.013
2.909ArgVal: 2.909 ± 0.015
0.634ArgTrp: 0.634 ± 0.006
1.437ArgTyr: 1.437 ± 0.009
0.006ArgXaa: 0.006 ± 0.0
Ser
5.108SerAla: 5.108 ± 0.019
1.812SerCys: 1.812 ± 0.013
4.012SerAsp: 4.012 ± 0.019
5.66SerGlu: 5.66 ± 0.038
3.129SerPhe: 3.129 ± 0.013
5.632SerGly: 5.632 ± 0.023
2.236SerHis: 2.236 ± 0.011
3.459SerIle: 3.459 ± 0.016
4.356SerLys: 4.356 ± 0.019
8.446SerLeu: 8.446 ± 0.026
1.662SerMet: 1.662 ± 0.009
2.905SerAsn: 2.905 ± 0.014
6.578SerPro: 6.578 ± 0.044
4.165SerGln: 4.165 ± 0.021
4.516SerArg: 4.516 ± 0.024
10.347SerSer: 10.347 ± 0.049
4.79SerThr: 4.79 ± 0.022
4.873SerVal: 4.873 ± 0.017
1.05SerTrp: 1.05 ± 0.008
2.131SerTyr: 2.131 ± 0.011
0.008SerXaa: 0.008 ± 0.001
Thr
3.703ThrAla: 3.703 ± 0.024
1.212ThrCys: 1.212 ± 0.011
2.513ThrAsp: 2.513 ± 0.013
3.847ThrGlu: 3.847 ± 0.031
2.151ThrPhe: 2.151 ± 0.011
3.644ThrGly: 3.644 ± 0.02
1.284ThrHis: 1.284 ± 0.009
2.512ThrIle: 2.512 ± 0.014
2.79ThrLys: 2.79 ± 0.016
5.285ThrLeu: 5.285 ± 0.016
1.163ThrMet: 1.163 ± 0.008
1.815ThrAsn: 1.815 ± 0.01
3.595ThrPro: 3.595 ± 0.03
2.317ThrGln: 2.317 ± 0.012
2.437ThrArg: 2.437 ± 0.012
4.909ThrSer: 4.909 ± 0.022
3.17ThrThr: 3.17 ± 0.049
3.836ThrVal: 3.836 ± 0.017
0.678ThrTrp: 0.678 ± 0.006
1.422ThrTyr: 1.422 ± 0.009
0.007ThrXaa: 0.007 ± 0.001
Val
3.889ValAla: 3.889 ± 0.013
1.328ValCys: 1.328 ± 0.01
2.79ValAsp: 2.79 ± 0.014
3.771ValGlu: 3.771 ± 0.016
2.365ValPhe: 2.365 ± 0.013
3.201ValGly: 3.201 ± 0.015
1.522ValHis: 1.522 ± 0.009
2.986ValIle: 2.986 ± 0.015
3.474ValLys: 3.474 ± 0.019
5.866ValLeu: 5.866 ± 0.02
1.328ValMet: 1.328 ± 0.009
2.298ValAsn: 2.298 ± 0.011
3.549ValPro: 3.549 ± 0.018
2.656ValGln: 2.656 ± 0.011
2.803ValArg: 2.803 ± 0.012
4.863ValSer: 4.863 ± 0.017
3.722ValThr: 3.722 ± 0.017
3.765ValVal: 3.765 ± 0.018
0.657ValTrp: 0.657 ± 0.006
1.563ValTyr: 1.563 ± 0.009
0.005ValXaa: 0.005 ± 0.0
Trp
0.72TrpAla: 0.72 ± 0.005
0.23TrpCys: 0.23 ± 0.003
0.62TrpAsp: 0.62 ± 0.005
0.808TrpGlu: 0.808 ± 0.008
0.43TrpPhe: 0.43 ± 0.004
0.694TrpGly: 0.694 ± 0.007
0.293TrpHis: 0.293 ± 0.003
0.563TrpIle: 0.563 ± 0.006
0.821TrpLys: 0.821 ± 0.006
1.151TrpLeu: 1.151 ± 0.008
0.305TrpMet: 0.305 ± 0.004
0.561TrpAsn: 0.561 ± 0.005
0.512TrpPro: 0.512 ± 0.005
0.503TrpGln: 0.503 ± 0.005
0.688TrpArg: 0.688 ± 0.005
0.854TrpSer: 0.854 ± 0.008
0.643TrpThr: 0.643 ± 0.006
0.629TrpVal: 0.629 ± 0.005
0.18TrpTrp: 0.18 ± 0.003
0.323TrpTyr: 0.323 ± 0.004
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.323TyrAla: 1.323 ± 0.008
0.668TyrCys: 0.668 ± 0.006
1.26TyrAsp: 1.26 ± 0.01
1.759TyrGlu: 1.759 ± 0.013
1.186TyrPhe: 1.186 ± 0.008
1.655TyrGly: 1.655 ± 0.009
0.767TyrHis: 0.767 ± 0.006
1.408TyrIle: 1.408 ± 0.009
1.513TyrLys: 1.513 ± 0.012
2.612TyrLeu: 2.612 ± 0.013
0.61TyrMet: 0.61 ± 0.005
1.113TyrAsn: 1.113 ± 0.008
1.311TyrPro: 1.311 ± 0.008
1.288TyrGln: 1.288 ± 0.009
1.554TyrArg: 1.554 ± 0.009
2.259TyrSer: 2.259 ± 0.011
1.464TyrThr: 1.464 ± 0.009
1.521TyrVal: 1.521 ± 0.009
0.349TyrTrp: 0.349 ± 0.005
0.921TyrTyr: 0.921 ± 0.007
0.003TyrXaa: 0.003 ± 0.0
Xaa
0.007XaaAla: 0.007 ± 0.001
0.003XaaCys: 0.003 ± 0.0
0.005XaaAsp: 0.005 ± 0.0
0.007XaaGlu: 0.007 ± 0.001
0.005XaaPhe: 0.005 ± 0.0
0.009XaaGly: 0.009 ± 0.001
0.003XaaHis: 0.003 ± 0.0
0.004XaaIle: 0.004 ± 0.001
0.006XaaLys: 0.006 ± 0.0
0.011XaaLeu: 0.011 ± 0.001
0.002XaaMet: 0.002 ± 0.0
0.003XaaAsn: 0.003 ± 0.0
0.01XaaPro: 0.01 ± 0.001
0.005XaaGln: 0.005 ± 0.0
0.005XaaArg: 0.005 ± 0.0
0.007XaaSer: 0.007 ± 0.001
0.004XaaThr: 0.004 ± 0.0
0.005XaaVal: 0.005 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.003XaaTyr: 0.003 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37679 proteins (24402630 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski