Amino acid dipepetide frequency for Oryzias javanicus (Javanese ricefish) (Aplocheilus javanicus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.121AlaAla: 7.121 ± 0.05
1.304AlaCys: 1.304 ± 0.014
3.372AlaAsp: 3.372 ± 0.019
4.913AlaGlu: 4.913 ± 0.031
2.421AlaPhe: 2.421 ± 0.018
4.698AlaGly: 4.698 ± 0.032
1.506AlaHis: 1.506 ± 0.014
2.476AlaIle: 2.476 ± 0.018
3.309AlaLys: 3.309 ± 0.023
6.575AlaLeu: 6.575 ± 0.035
1.51AlaMet: 1.51 ± 0.013
2.081AlaAsn: 2.081 ± 0.015
3.881AlaPro: 3.881 ± 0.027
2.917AlaGln: 2.917 ± 0.023
3.228AlaArg: 3.228 ± 0.021
5.778AlaSer: 5.778 ± 0.036
3.273AlaThr: 3.273 ± 0.022
5.047AlaVal: 5.047 ± 0.023
0.667AlaTrp: 0.667 ± 0.009
1.389AlaTyr: 1.389 ± 0.012
0.0AlaXaa: 0.0 ± 0.0
Cys
1.216CysAla: 1.216 ± 0.013
0.695CysCys: 0.695 ± 0.011
1.14CysAsp: 1.14 ± 0.015
1.27CysGlu: 1.27 ± 0.017
0.877CysPhe: 0.877 ± 0.008
1.598CysGly: 1.598 ± 0.02
0.65CysHis: 0.65 ± 0.008
0.912CysIle: 0.912 ± 0.009
1.137CysLys: 1.137 ± 0.014
2.132CysLeu: 2.132 ± 0.017
0.455CysMet: 0.455 ± 0.006
0.799CysAsn: 0.799 ± 0.01
1.346CysPro: 1.346 ± 0.026
1.062CysGln: 1.062 ± 0.012
1.399CysArg: 1.399 ± 0.013
2.339CysSer: 2.339 ± 0.022
1.131CysThr: 1.131 ± 0.013
1.485CysVal: 1.485 ± 0.016
0.317CysTrp: 0.317 ± 0.005
0.582CysTyr: 0.582 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.17AspAla: 3.17 ± 0.024
1.168AspCys: 1.168 ± 0.013
2.99AspAsp: 2.99 ± 0.032
3.771AspGlu: 3.771 ± 0.021
2.181AspPhe: 2.181 ± 0.017
3.841AspGly: 3.841 ± 0.031
1.163AspHis: 1.163 ± 0.011
2.441AspIle: 2.441 ± 0.02
2.591AspLys: 2.591 ± 0.02
5.026AspLeu: 5.026 ± 0.025
1.24AspMet: 1.24 ± 0.01
1.759AspAsn: 1.759 ± 0.016
2.968AspPro: 2.968 ± 0.02
1.998AspGln: 1.998 ± 0.014
2.776AspArg: 2.776 ± 0.022
4.583AspSer: 4.583 ± 0.025
2.433AspThr: 2.433 ± 0.016
3.312AspVal: 3.312 ± 0.019
0.659AspTrp: 0.659 ± 0.009
1.441AspTyr: 1.441 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
4.892GluAla: 4.892 ± 0.032
1.213GluCys: 1.213 ± 0.018
4.609GluAsp: 4.609 ± 0.025
8.281GluGlu: 8.281 ± 0.065
1.947GluPhe: 1.947 ± 0.013
4.1GluGly: 4.1 ± 0.024
1.429GluHis: 1.429 ± 0.012
2.739GluIle: 2.739 ± 0.02
4.776GluLys: 4.776 ± 0.041
6.194GluLeu: 6.194 ± 0.039
1.738GluMet: 1.738 ± 0.015
2.811GluAsn: 2.811 ± 0.019
3.031GluPro: 3.031 ± 0.021
3.119GluGln: 3.119 ± 0.022
4.3GluArg: 4.3 ± 0.032
4.552GluSer: 4.552 ± 0.022
3.455GluThr: 3.455 ± 0.022
4.282GluVal: 4.282 ± 0.022
0.682GluTrp: 0.682 ± 0.009
1.482GluTyr: 1.482 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
1.874PheAla: 1.874 ± 0.017
0.992PheCys: 0.992 ± 0.009
1.729PheAsp: 1.729 ± 0.013
1.848PheGlu: 1.848 ± 0.015
1.581PhePhe: 1.581 ± 0.016
2.228PheGly: 2.228 ± 0.018
1.031PheHis: 1.031 ± 0.01
1.842PheIle: 1.842 ± 0.016
1.802PheLys: 1.802 ± 0.014
3.913PheLeu: 3.913 ± 0.027
0.818PheMet: 0.818 ± 0.01
1.418PheAsn: 1.418 ± 0.012
1.882PhePro: 1.882 ± 0.016
1.68PheGln: 1.68 ± 0.012
1.968PheArg: 1.968 ± 0.017
3.55PheSer: 3.55 ± 0.02
2.237PheThr: 2.237 ± 0.017
2.237PheVal: 2.237 ± 0.019
0.509PheTrp: 0.509 ± 0.011
1.188PheTyr: 1.188 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
4.377GlyAla: 4.377 ± 0.03
1.319GlyCys: 1.319 ± 0.013
3.181GlyAsp: 3.181 ± 0.023
4.116GlyGlu: 4.116 ± 0.027
2.497GlyPhe: 2.497 ± 0.018
5.646GlyGly: 5.646 ± 0.042
1.614GlyHis: 1.614 ± 0.013
2.467GlyIle: 2.467 ± 0.017
3.455GlyLys: 3.455 ± 0.023
5.517GlyLeu: 5.517 ± 0.032
1.389GlyMet: 1.389 ± 0.013
2.369GlyAsn: 2.369 ± 0.018
3.399GlyPro: 3.399 ± 0.045
2.71GlyGln: 2.71 ± 0.021
3.82GlyArg: 3.82 ± 0.023
6.03GlySer: 6.03 ± 0.031
3.267GlyThr: 3.267 ± 0.025
4.085GlyVal: 4.085 ± 0.026
0.782GlyTrp: 0.782 ± 0.01
1.717GlyTyr: 1.717 ± 0.015
0.0GlyXaa: 0.0 ± 0.0
His
1.417HisAla: 1.417 ± 0.013
0.724HisCys: 0.724 ± 0.01
0.948HisAsp: 0.948 ± 0.008
1.195HisGlu: 1.195 ± 0.011
1.06HisPhe: 1.06 ± 0.011
1.561HisGly: 1.561 ± 0.013
1.061HisHis: 1.061 ± 0.016
1.203HisIle: 1.203 ± 0.01
1.269HisLys: 1.269 ± 0.011
2.777HisLeu: 2.777 ± 0.02
0.657HisMet: 0.657 ± 0.009
0.951HisAsn: 0.951 ± 0.011
1.64HisPro: 1.64 ± 0.015
1.313HisGln: 1.313 ± 0.013
1.681HisArg: 1.681 ± 0.013
2.402HisSer: 2.402 ± 0.019
1.429HisThr: 1.429 ± 0.013
1.436HisVal: 1.436 ± 0.011
0.337HisTrp: 0.337 ± 0.005
0.786HisTyr: 0.786 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
2.292IleAla: 2.292 ± 0.015
1.038IleCys: 1.038 ± 0.01
1.902IleAsp: 1.902 ± 0.014
2.199IleGlu: 2.199 ± 0.018
1.737IlePhe: 1.737 ± 0.018
2.164IleGly: 2.164 ± 0.017
1.205IleHis: 1.205 ± 0.013
2.222IleIle: 2.222 ± 0.02
2.461IleLys: 2.461 ± 0.019
4.104IleLeu: 4.104 ± 0.022
0.979IleMet: 0.979 ± 0.011
1.813IleAsn: 1.813 ± 0.014
2.407IlePro: 2.407 ± 0.019
2.151IleGln: 2.151 ± 0.016
2.394IleArg: 2.394 ± 0.014
3.718IleSer: 3.718 ± 0.019
2.585IleThr: 2.585 ± 0.021
2.381IleVal: 2.381 ± 0.021
0.461IleTrp: 0.461 ± 0.007
1.29IleTyr: 1.29 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.794LysAla: 3.794 ± 0.025
0.996LysCys: 0.996 ± 0.012
3.219LysAsp: 3.219 ± 0.024
4.802LysGlu: 4.802 ± 0.042
1.573LysPhe: 1.573 ± 0.013
3.14LysGly: 3.14 ± 0.031
1.388LysHis: 1.388 ± 0.012
2.404LysIle: 2.404 ± 0.016
4.548LysLys: 4.548 ± 0.038
4.899LysLeu: 4.899 ± 0.027
1.501LysMet: 1.501 ± 0.013
2.278LysAsn: 2.278 ± 0.015
2.988LysPro: 2.988 ± 0.025
2.546LysGln: 2.546 ± 0.021
3.488LysArg: 3.488 ± 0.023
3.916LysSer: 3.916 ± 0.02
3.237LysThr: 3.237 ± 0.022
3.454LysVal: 3.454 ± 0.019
0.564LysTrp: 0.564 ± 0.008
1.385LysTyr: 1.385 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
5.828LeuAla: 5.828 ± 0.027
2.26LeuCys: 2.26 ± 0.018
4.803LeuAsp: 4.803 ± 0.022
6.451LeuGlu: 6.451 ± 0.042
3.362LeuPhe: 3.362 ± 0.024
5.068LeuGly: 5.068 ± 0.027
2.778LeuHis: 2.778 ± 0.019
3.63LeuIle: 3.63 ± 0.021
5.706LeuLys: 5.706 ± 0.034
10.384LeuLeu: 10.384 ± 0.057
2.147LeuMet: 2.147 ± 0.015
3.54LeuAsn: 3.54 ± 0.019
5.417LeuPro: 5.417 ± 0.028
5.768LeuGln: 5.768 ± 0.039
5.78LeuArg: 5.78 ± 0.026
8.495LeuSer: 8.495 ± 0.042
5.293LeuThr: 5.293 ± 0.024
5.353LeuVal: 5.353 ± 0.027
1.115LeuTrp: 1.115 ± 0.011
2.455LeuTyr: 2.455 ± 0.019
0.0LeuXaa: 0.0 ± 0.0
Met
1.829MetAla: 1.829 ± 0.015
0.459MetCys: 0.459 ± 0.006
1.406MetAsp: 1.406 ± 0.014
2.023MetGlu: 2.023 ± 0.017
0.841MetPhe: 0.841 ± 0.01
1.37MetGly: 1.37 ± 0.013
0.47MetHis: 0.47 ± 0.006
0.852MetIle: 0.852 ± 0.008
1.518MetLys: 1.518 ± 0.012
2.029MetLeu: 2.029 ± 0.017
0.713MetMet: 0.713 ± 0.01
0.882MetAsn: 0.882 ± 0.01
1.034MetPro: 1.034 ± 0.012
0.985MetGln: 0.985 ± 0.01
1.218MetArg: 1.218 ± 0.013
1.914MetSer: 1.914 ± 0.013
1.282MetThr: 1.282 ± 0.011
1.482MetVal: 1.482 ± 0.011
0.273MetTrp: 0.273 ± 0.005
0.575MetTyr: 0.575 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.117AsnAla: 2.117 ± 0.015
0.865AsnCys: 0.865 ± 0.01
1.599AsnAsp: 1.599 ± 0.015
2.028AsnGlu: 2.028 ± 0.015
1.419AsnPhe: 1.419 ± 0.014
2.645AsnGly: 2.645 ± 0.019
0.988AsnHis: 0.988 ± 0.01
2.003AsnIle: 2.003 ± 0.014
2.173AsnLys: 2.173 ± 0.015
3.586AsnLeu: 3.586 ± 0.023
0.991AsnMet: 0.991 ± 0.01
1.719AsnAsn: 1.719 ± 0.015
2.291AsnPro: 2.291 ± 0.015
1.84AsnGln: 1.84 ± 0.022
2.01AsnArg: 2.01 ± 0.014
3.233AsnSer: 3.233 ± 0.019
2.087AsnThr: 2.087 ± 0.016
2.214AsnVal: 2.214 ± 0.017
0.431AsnTrp: 0.431 ± 0.006
1.066AsnTyr: 1.066 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
4.637ProAla: 4.637 ± 0.031
1.111ProCys: 1.111 ± 0.018
3.009ProAsp: 3.009 ± 0.018
3.936ProGlu: 3.936 ± 0.022
1.826ProPhe: 1.826 ± 0.016
4.234ProGly: 4.234 ± 0.061
1.556ProHis: 1.556 ± 0.014
1.777ProIle: 1.777 ± 0.018
2.616ProLys: 2.616 ± 0.027
4.934ProLeu: 4.934 ± 0.024
0.985ProMet: 0.985 ± 0.012
1.929ProAsn: 1.929 ± 0.016
5.886ProPro: 5.886 ± 0.057
2.742ProGln: 2.742 ± 0.022
2.909ProArg: 2.909 ± 0.021
5.82ProSer: 5.82 ± 0.035
3.059ProThr: 3.059 ± 0.022
3.748ProVal: 3.748 ± 0.02
0.555ProTrp: 0.555 ± 0.008
1.319ProTyr: 1.319 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
3.108GlnAla: 3.108 ± 0.023
0.919GlnCys: 0.919 ± 0.013
2.32GlnAsp: 2.32 ± 0.018
3.677GlnGlu: 3.677 ± 0.028
1.35GlnPhe: 1.35 ± 0.012
2.605GlnGly: 2.605 ± 0.02
1.362GlnHis: 1.362 ± 0.012
1.922GlnIle: 1.922 ± 0.015
2.736GlnLys: 2.736 ± 0.022
4.545GlnLeu: 4.545 ± 0.03
1.167GlnMet: 1.167 ± 0.013
1.89GlnAsn: 1.89 ± 0.017
2.579GlnPro: 2.579 ± 0.023
3.341GlnGln: 3.341 ± 0.041
3.247GlnArg: 3.247 ± 0.022
3.624GlnSer: 3.624 ± 0.02
2.827GlnThr: 2.827 ± 0.028
2.733GlnVal: 2.733 ± 0.017
0.541GlnTrp: 0.541 ± 0.007
1.12GlnTyr: 1.12 ± 0.009
0.0GlnXaa: 0.0 ± 0.0
Arg
3.678ArgAla: 3.678 ± 0.022
1.332ArgCys: 1.332 ± 0.015
2.928ArgAsp: 2.928 ± 0.021
3.921ArgGlu: 3.921 ± 0.026
2.024ArgPhe: 2.024 ± 0.014
3.769ArgGly: 3.769 ± 0.027
1.569ArgHis: 1.569 ± 0.014
2.344ArgIle: 2.344 ± 0.016
3.568ArgLys: 3.568 ± 0.024
5.459ArgLeu: 5.459 ± 0.026
1.338ArgMet: 1.338 ± 0.013
2.117ArgAsn: 2.117 ± 0.014
3.075ArgPro: 3.075 ± 0.021
2.674ArgGln: 2.674 ± 0.02
5.007ArgArg: 5.007 ± 0.036
5.044ArgSer: 5.044 ± 0.027
3.06ArgThr: 3.06 ± 0.017
3.349ArgVal: 3.349 ± 0.019
0.731ArgTrp: 0.731 ± 0.01
1.439ArgTyr: 1.439 ± 0.013
0.0ArgXaa: 0.0 ± 0.0
Ser
6.073SerAla: 6.073 ± 0.032
2.176SerCys: 2.176 ± 0.02
4.507SerAsp: 4.507 ± 0.031
5.182SerGlu: 5.182 ± 0.027
3.267SerPhe: 3.267 ± 0.02
5.878SerGly: 5.878 ± 0.032
2.234SerHis: 2.234 ± 0.019
3.297SerIle: 3.297 ± 0.021
4.103SerLys: 4.103 ± 0.024
8.418SerLeu: 8.418 ± 0.038
1.856SerMet: 1.856 ± 0.012
3.039SerAsn: 3.039 ± 0.029
6.115SerPro: 6.115 ± 0.042
3.947SerGln: 3.947 ± 0.023
4.974SerArg: 4.974 ± 0.026
11.223SerSer: 11.223 ± 0.074
4.821SerThr: 4.821 ± 0.028
5.549SerVal: 5.549 ± 0.026
1.08SerTrp: 1.08 ± 0.012
2.051SerTyr: 2.051 ± 0.015
0.0SerXaa: 0.0 ± 0.0
Thr
4.067ThrAla: 4.067 ± 0.021
1.369ThrCys: 1.369 ± 0.021
2.821ThrAsp: 2.821 ± 0.018
3.727ThrGlu: 3.727 ± 0.02
2.112ThrPhe: 2.112 ± 0.017
3.545ThrGly: 3.545 ± 0.024
1.307ThrHis: 1.307 ± 0.014
2.244ThrIle: 2.244 ± 0.016
2.656ThrLys: 2.656 ± 0.018
5.234ThrLeu: 5.234 ± 0.026
1.162ThrMet: 1.162 ± 0.01
1.939ThrAsn: 1.939 ± 0.015
3.682ThrPro: 3.682 ± 0.03
2.311ThrGln: 2.311 ± 0.017
2.515ThrArg: 2.515 ± 0.015
4.884ThrSer: 4.884 ± 0.028
3.245ThrThr: 3.245 ± 0.04
4.057ThrVal: 4.057 ± 0.035
0.67ThrTrp: 0.67 ± 0.009
1.323ThrTyr: 1.323 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
3.987ValAla: 3.987 ± 0.022
1.692ValCys: 1.692 ± 0.017
3.109ValAsp: 3.109 ± 0.016
4.073ValGlu: 4.073 ± 0.022
2.653ValPhe: 2.653 ± 0.019
3.474ValGly: 3.474 ± 0.021
1.535ValHis: 1.535 ± 0.011
2.843ValIle: 2.843 ± 0.021
3.556ValLys: 3.556 ± 0.023
6.307ValLeu: 6.307 ± 0.029
1.518ValMet: 1.518 ± 0.011
2.328ValAsn: 2.328 ± 0.018
3.279ValPro: 3.279 ± 0.021
2.855ValGln: 2.855 ± 0.019
3.291ValArg: 3.291 ± 0.018
5.439ValSer: 5.439 ± 0.037
3.906ValThr: 3.906 ± 0.029
4.318ValVal: 4.318 ± 0.024
0.797ValTrp: 0.797 ± 0.01
1.709ValTyr: 1.709 ± 0.015
0.0ValXaa: 0.0 ± 0.0
Trp
0.659TrpAla: 0.659 ± 0.008
0.248TrpCys: 0.248 ± 0.005
0.637TrpAsp: 0.637 ± 0.008
0.75TrpGlu: 0.75 ± 0.009
0.478TrpPhe: 0.478 ± 0.007
0.635TrpGly: 0.635 ± 0.01
0.256TrpHis: 0.256 ± 0.005
0.582TrpIle: 0.582 ± 0.008
0.745TrpLys: 0.745 ± 0.008
1.153TrpLeu: 1.153 ± 0.012
0.365TrpMet: 0.365 ± 0.008
0.511TrpAsn: 0.511 ± 0.007
0.438TrpPro: 0.438 ± 0.007
0.465TrpGln: 0.465 ± 0.007
0.823TrpArg: 0.823 ± 0.009
1.018TrpSer: 1.018 ± 0.012
0.758TrpThr: 0.758 ± 0.01
0.681TrpVal: 0.681 ± 0.01
0.192TrpTrp: 0.192 ± 0.004
0.324TrpTyr: 0.324 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.331TyrAla: 1.331 ± 0.013
0.664TyrCys: 0.664 ± 0.01
1.283TyrAsp: 1.283 ± 0.012
1.471TyrGlu: 1.471 ± 0.013
1.121TyrPhe: 1.121 ± 0.011
1.559TyrGly: 1.559 ± 0.017
0.741TyrHis: 0.741 ± 0.009
1.305TyrIle: 1.305 ± 0.014
1.392TyrLys: 1.392 ± 0.014
2.448TyrLeu: 2.448 ± 0.019
0.625TyrMet: 0.625 ± 0.007
1.131TyrAsn: 1.131 ± 0.011
1.252TyrPro: 1.252 ± 0.013
1.186TyrGln: 1.186 ± 0.012
1.594TyrArg: 1.594 ± 0.013
2.231TyrSer: 2.231 ± 0.015
1.482TyrThr: 1.482 ± 0.013
1.464TyrVal: 1.464 ± 0.014
0.357TyrTrp: 0.357 ± 0.007
0.893TyrTyr: 0.893 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21400 proteins (11474022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski