Amino acid dipepetide frequency for Musa acuminata subsp. malaccensis (Wild banana) (Musa malaccensis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.876AlaAla: 8.876 ± 0.043
1.48AlaCys: 1.48 ± 0.012
3.61AlaAsp: 3.61 ± 0.021
4.628AlaGlu: 4.628 ± 0.022
2.905AlaPhe: 2.905 ± 0.017
4.98AlaGly: 4.98 ± 0.025
1.487AlaHis: 1.487 ± 0.012
3.801AlaIle: 3.801 ± 0.017
3.72AlaLys: 3.72 ± 0.019
7.209AlaLeu: 7.209 ± 0.028
1.949AlaMet: 1.949 ± 0.013
2.469AlaAsn: 2.469 ± 0.014
3.394AlaPro: 3.394 ± 0.019
2.16AlaGln: 2.16 ± 0.015
4.176AlaArg: 4.176 ± 0.024
6.876AlaSer: 6.876 ± 0.023
4.069AlaThr: 4.069 ± 0.019
5.581AlaVal: 5.581 ± 0.028
0.871AlaTrp: 0.871 ± 0.01
1.923AlaTyr: 1.923 ± 0.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.156CysAla: 1.156 ± 0.01
0.643CysCys: 0.643 ± 0.009
0.916CysAsp: 0.916 ± 0.009
0.853CysGlu: 0.853 ± 0.009
0.951CysPhe: 0.951 ± 0.01
1.483CysGly: 1.483 ± 0.012
0.566CysHis: 0.566 ± 0.006
1.026CysIle: 1.026 ± 0.009
1.041CysLys: 1.041 ± 0.011
2.011CysLeu: 2.011 ± 0.013
0.506CysMet: 0.506 ± 0.007
0.799CysAsn: 0.799 ± 0.009
1.023CysPro: 1.023 ± 0.011
0.636CysGln: 0.636 ± 0.008
1.302CysArg: 1.302 ± 0.012
2.058CysSer: 2.058 ± 0.016
0.924CysThr: 0.924 ± 0.009
1.114CysVal: 1.114 ± 0.011
0.282CysTrp: 0.282 ± 0.004
0.584CysTyr: 0.584 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
4.053AspAla: 4.053 ± 0.022
0.953AspCys: 0.953 ± 0.01
3.568AspAsp: 3.568 ± 0.024
3.773AspGlu: 3.773 ± 0.022
2.13AspPhe: 2.13 ± 0.016
4.078AspGly: 4.078 ± 0.022
1.289AspHis: 1.289 ± 0.009
2.752AspIle: 2.752 ± 0.016
2.382AspLys: 2.382 ± 0.017
5.168AspLeu: 5.168 ± 0.023
1.303AspMet: 1.303 ± 0.01
1.783AspAsn: 1.783 ± 0.014
2.777AspPro: 2.777 ± 0.017
1.623AspGln: 1.623 ± 0.011
2.695AspArg: 2.695 ± 0.017
3.897AspSer: 3.897 ± 0.021
2.113AspThr: 2.113 ± 0.014
3.596AspVal: 3.596 ± 0.019
0.725AspTrp: 0.725 ± 0.008
1.432AspTyr: 1.432 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
5.085GluAla: 5.085 ± 0.026
0.87GluCys: 0.87 ± 0.008
3.643GluAsp: 3.643 ± 0.023
5.971GluGlu: 5.971 ± 0.04
2.149GluPhe: 2.149 ± 0.014
3.694GluGly: 3.694 ± 0.019
1.335GluHis: 1.335 ± 0.01
3.34GluIle: 3.34 ± 0.021
4.151GluLys: 4.151 ± 0.027
5.785GluLeu: 5.785 ± 0.03
1.687GluMet: 1.687 ± 0.014
2.483GluAsn: 2.483 ± 0.019
2.137GluPro: 2.137 ± 0.017
2.159GluGln: 2.159 ± 0.016
3.687GluArg: 3.687 ± 0.023
4.169GluSer: 4.169 ± 0.021
2.818GluThr: 2.818 ± 0.015
3.985GluVal: 3.985 ± 0.022
0.715GluTrp: 0.715 ± 0.008
1.531GluTyr: 1.531 ± 0.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.704PheAla: 2.704 ± 0.016
0.963PheCys: 0.963 ± 0.01
2.285PheAsp: 2.285 ± 0.014
2.044PheGlu: 2.044 ± 0.013
2.072PhePhe: 2.072 ± 0.015
2.99PheGly: 2.99 ± 0.017
1.144PheHis: 1.144 ± 0.01
1.892PheIle: 1.892 ± 0.014
1.715PheLys: 1.715 ± 0.012
4.443PheLeu: 4.443 ± 0.024
0.969PheMet: 0.969 ± 0.009
1.43PheAsn: 1.43 ± 0.011
2.079PhePro: 2.079 ± 0.014
1.43PheGln: 1.43 ± 0.011
2.193PheArg: 2.193 ± 0.013
3.89PheSer: 3.89 ± 0.017
1.811PheThr: 1.811 ± 0.013
2.741PheVal: 2.741 ± 0.015
0.566PheTrp: 0.566 ± 0.007
1.221PheTyr: 1.221 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
4.53GlyAla: 4.53 ± 0.024
1.441GlyCys: 1.441 ± 0.012
3.546GlyAsp: 3.546 ± 0.019
3.695GlyGlu: 3.695 ± 0.018
3.049GlyPhe: 3.049 ± 0.018
6.051GlyGly: 6.051 ± 0.035
1.673GlyHis: 1.673 ± 0.013
3.4GlyIle: 3.4 ± 0.016
3.706GlyLys: 3.706 ± 0.022
6.079GlyLeu: 6.079 ± 0.029
1.569GlyMet: 1.569 ± 0.013
2.816GlyAsn: 2.816 ± 0.019
2.616GlyPro: 2.616 ± 0.017
2.063GlyGln: 2.063 ± 0.015
4.44GlyArg: 4.44 ± 0.023
6.183GlySer: 6.183 ± 0.028
3.355GlyThr: 3.355 ± 0.016
4.238GlyVal: 4.238 ± 0.019
0.966GlyTrp: 0.966 ± 0.009
1.993GlyTyr: 1.993 ± 0.014
0.0GlyXaa: 0.0 ± 0.0
His
1.663HisAla: 1.663 ± 0.011
0.561HisCys: 0.561 ± 0.007
1.213HisAsp: 1.213 ± 0.011
1.304HisGlu: 1.304 ± 0.011
1.037HisPhe: 1.037 ± 0.011
1.96HisGly: 1.96 ± 0.013
1.063HisHis: 1.063 ± 0.012
1.188HisIle: 1.188 ± 0.01
1.107HisLys: 1.107 ± 0.011
2.65HisLeu: 2.65 ± 0.019
0.599HisMet: 0.599 ± 0.008
0.883HisAsn: 0.883 ± 0.01
1.538HisPro: 1.538 ± 0.011
1.052HisGln: 1.052 ± 0.011
1.721HisArg: 1.721 ± 0.013
1.961HisSer: 1.961 ± 0.015
0.975HisThr: 0.975 ± 0.008
1.608HisVal: 1.608 ± 0.012
0.329HisTrp: 0.329 ± 0.005
0.697HisTyr: 0.697 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
3.586IleAla: 3.586 ± 0.022
1.138IleCys: 1.138 ± 0.01
2.662IleAsp: 2.662 ± 0.015
2.774IleGlu: 2.774 ± 0.016
2.096IlePhe: 2.096 ± 0.014
3.247IleGly: 3.247 ± 0.021
1.295IleHis: 1.295 ± 0.01
2.616IleIle: 2.616 ± 0.018
2.573IleLys: 2.573 ± 0.015
5.015IleLeu: 5.015 ± 0.023
1.149IleMet: 1.149 ± 0.008
1.891IleAsn: 1.891 ± 0.014
2.723IlePro: 2.723 ± 0.018
1.812IleGln: 1.812 ± 0.014
2.692IleArg: 2.692 ± 0.016
4.552IleSer: 4.552 ± 0.021
2.456IleThr: 2.456 ± 0.015
3.222IleVal: 3.222 ± 0.018
0.653IleTrp: 0.653 ± 0.007
1.533IleTyr: 1.533 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.901LysAla: 3.901 ± 0.022
0.872LysCys: 0.872 ± 0.009
2.877LysAsp: 2.877 ± 0.018
3.988LysGlu: 3.988 ± 0.026
1.79LysPhe: 1.79 ± 0.013
3.246LysGly: 3.246 ± 0.019
1.306LysHis: 1.306 ± 0.011
2.785LysIle: 2.785 ± 0.019
4.017LysLys: 4.017 ± 0.026
5.306LysLeu: 5.306 ± 0.025
1.342LysMet: 1.342 ± 0.011
2.17LysAsn: 2.17 ± 0.014
2.511LysPro: 2.511 ± 0.017
2.061LysGln: 2.061 ± 0.014
3.4LysArg: 3.4 ± 0.019
3.905LysSer: 3.905 ± 0.023
2.433LysThr: 2.433 ± 0.014
3.381LysVal: 3.381 ± 0.018
0.703LysTrp: 0.703 ± 0.008
1.452LysTyr: 1.452 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
7.2LeuAla: 7.2 ± 0.027
1.983LeuCys: 1.983 ± 0.013
5.093LeuAsp: 5.093 ± 0.025
5.986LeuGlu: 5.986 ± 0.03
3.977LeuPhe: 3.977 ± 0.02
5.979LeuGly: 5.979 ± 0.024
2.842LeuHis: 2.842 ± 0.016
4.381LeuIle: 4.381 ± 0.021
5.349LeuLys: 5.349 ± 0.029
10.89LeuLeu: 10.89 ± 0.043
2.245LeuMet: 2.245 ± 0.015
3.436LeuAsn: 3.436 ± 0.016
5.601LeuPro: 5.601 ± 0.026
4.353LeuGln: 4.353 ± 0.021
6.259LeuArg: 6.259 ± 0.03
8.697LeuSer: 8.697 ± 0.043
4.303LeuThr: 4.303 ± 0.023
6.481LeuVal: 6.481 ± 0.022
1.214LeuTrp: 1.214 ± 0.011
2.495LeuTyr: 2.495 ± 0.018
0.001LeuXaa: 0.001 ± 0.0
Met
2.321MetAla: 2.321 ± 0.017
0.354MetCys: 0.354 ± 0.006
1.446MetAsp: 1.446 ± 0.012
1.951MetGlu: 1.951 ± 0.015
0.819MetPhe: 0.819 ± 0.009
1.622MetGly: 1.622 ± 0.012
0.595MetHis: 0.595 ± 0.007
1.209MetIle: 1.209 ± 0.011
1.436MetLys: 1.436 ± 0.011
2.29MetLeu: 2.29 ± 0.014
0.683MetMet: 0.683 ± 0.009
0.939MetAsn: 0.939 ± 0.009
1.135MetPro: 1.135 ± 0.01
0.952MetGln: 0.952 ± 0.009
1.317MetArg: 1.317 ± 0.01
1.765MetSer: 1.765 ± 0.013
1.125MetThr: 1.125 ± 0.009
1.66MetVal: 1.66 ± 0.01
0.281MetTrp: 0.281 ± 0.005
0.595MetTyr: 0.595 ± 0.006
0.0MetXaa: 0.0 ± 0.0
Asn
2.493AsnAla: 2.493 ± 0.013
0.783AsnCys: 0.783 ± 0.008
1.763AsnAsp: 1.763 ± 0.012
2.045AsnGlu: 2.045 ± 0.014
1.608AsnPhe: 1.608 ± 0.012
2.73AsnGly: 2.73 ± 0.018
0.997AsnHis: 0.997 ± 0.009
2.149AsnIle: 2.149 ± 0.015
1.95AsnLys: 1.95 ± 0.014
4.126AsnLeu: 4.126 ± 0.03
1.013AsnMet: 1.013 ± 0.01
1.683AsnAsn: 1.683 ± 0.016
2.115AsnPro: 2.115 ± 0.015
1.424AsnGln: 1.424 ± 0.011
1.941AsnArg: 1.941 ± 0.013
3.3AsnSer: 3.3 ± 0.019
1.689AsnThr: 1.689 ± 0.013
2.377AsnVal: 2.377 ± 0.014
0.528AsnTrp: 0.528 ± 0.007
1.171AsnTyr: 1.171 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
3.822ProAla: 3.822 ± 0.02
0.907ProCys: 0.907 ± 0.01
2.586ProAsp: 2.586 ± 0.015
3.06ProGlu: 3.06 ± 0.019
2.074ProPhe: 2.074 ± 0.014
2.987ProGly: 2.987 ± 0.017
1.176ProHis: 1.176 ± 0.009
2.159ProIle: 2.159 ± 0.013
2.392ProLys: 2.392 ± 0.013
4.668ProLeu: 4.668 ± 0.023
1.017ProMet: 1.017 ± 0.009
1.939ProAsn: 1.939 ± 0.012
4.471ProPro: 4.471 ± 0.041
1.767ProGln: 1.767 ± 0.014
2.964ProArg: 2.964 ± 0.016
5.604ProSer: 5.604 ± 0.029
2.699ProThr: 2.699 ± 0.015
3.223ProVal: 3.223 ± 0.017
0.691ProTrp: 0.691 ± 0.008
1.285ProTyr: 1.285 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.521GlnAla: 2.521 ± 0.016
0.599GlnCys: 0.599 ± 0.008
1.537GlnAsp: 1.537 ± 0.011
2.291GlnGlu: 2.291 ± 0.015
1.274GlnPhe: 1.274 ± 0.011
2.051GlnGly: 2.051 ± 0.013
0.976GlnHis: 0.976 ± 0.009
1.868GlnIle: 1.868 ± 0.013
2.076GlnLys: 2.076 ± 0.015
3.647GlnLeu: 3.647 ± 0.019
0.98GlnMet: 0.98 ± 0.009
1.495GlnAsn: 1.495 ± 0.013
1.757GlnPro: 1.757 ± 0.014
2.109GlnGln: 2.109 ± 0.023
2.252GlnArg: 2.252 ± 0.014
2.694GlnSer: 2.694 ± 0.016
1.603GlnThr: 1.603 ± 0.012
2.289GlnVal: 2.289 ± 0.013
0.451GlnTrp: 0.451 ± 0.006
0.87GlnTyr: 0.87 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
4.025ArgAla: 4.025 ± 0.021
1.26ArgCys: 1.26 ± 0.012
2.774ArgAsp: 2.774 ± 0.014
3.596ArgGlu: 3.596 ± 0.023
2.356ArgPhe: 2.356 ± 0.014
3.775ArgGly: 3.775 ± 0.023
1.56ArgHis: 1.56 ± 0.012
2.936ArgIle: 2.936 ± 0.016
3.688ArgLys: 3.688 ± 0.019
5.723ArgLeu: 5.723 ± 0.025
1.494ArgMet: 1.494 ± 0.012
2.312ArgAsn: 2.312 ± 0.017
2.958ArgPro: 2.958 ± 0.015
2.027ArgGln: 2.027 ± 0.014
5.332ArgArg: 5.332 ± 0.031
5.279ArgSer: 5.279 ± 0.023
2.728ArgThr: 2.728 ± 0.015
3.486ArgVal: 3.486 ± 0.018
0.969ArgTrp: 0.969 ± 0.009
1.564ArgTyr: 1.564 ± 0.012
0.0ArgXaa: 0.0 ± 0.0
Ser
6.184SerAla: 6.184 ± 0.026
1.944SerCys: 1.944 ± 0.017
4.358SerAsp: 4.358 ± 0.022
4.442SerGlu: 4.442 ± 0.025
3.914SerPhe: 3.914 ± 0.019
6.048SerGly: 6.048 ± 0.028
2.108SerHis: 2.108 ± 0.013
4.269SerIle: 4.269 ± 0.018
4.241SerLys: 4.241 ± 0.023
8.84SerLeu: 8.84 ± 0.039
2.133SerMet: 2.133 ± 0.012
3.451SerAsn: 3.451 ± 0.02
4.948SerPro: 4.948 ± 0.028
2.792SerGln: 2.792 ± 0.017
4.908SerArg: 4.908 ± 0.024
11.207SerSer: 11.207 ± 0.053
4.565SerThr: 4.565 ± 0.022
5.162SerVal: 5.162 ± 0.022
1.221SerTrp: 1.221 ± 0.012
2.288SerTyr: 2.288 ± 0.017
0.0SerXaa: 0.0 ± 0.0
Thr
3.854ThrAla: 3.854 ± 0.021
0.98ThrCys: 0.98 ± 0.009
2.289ThrAsp: 2.289 ± 0.015
2.662ThrGlu: 2.662 ± 0.018
1.939ThrPhe: 1.939 ± 0.014
3.417ThrGly: 3.417 ± 0.018
1.022ThrHis: 1.022 ± 0.009
2.5ThrIle: 2.5 ± 0.016
2.394ThrLys: 2.394 ± 0.016
4.4ThrLeu: 4.4 ± 0.019
1.192ThrMet: 1.192 ± 0.009
1.851ThrAsn: 1.851 ± 0.013
2.525ThrPro: 2.525 ± 0.015
1.368ThrGln: 1.368 ± 0.011
2.513ThrArg: 2.513 ± 0.015
4.513ThrSer: 4.513 ± 0.024
2.844ThrThr: 2.844 ± 0.016
3.32ThrVal: 3.32 ± 0.018
0.639ThrTrp: 0.639 ± 0.008
1.344ThrTyr: 1.344 ± 0.013
0.0ThrXaa: 0.0 ± 0.0
Val
5.504ValAla: 5.504 ± 0.026
1.234ValCys: 1.234 ± 0.012
3.741ValAsp: 3.741 ± 0.016
4.108ValGlu: 4.108 ± 0.022
2.653ValPhe: 2.653 ± 0.018
4.28ValGly: 4.28 ± 0.023
1.595ValHis: 1.595 ± 0.012
3.249ValIle: 3.249 ± 0.019
3.328ValLys: 3.328 ± 0.019
6.422ValLeu: 6.422 ± 0.025
1.555ValMet: 1.555 ± 0.012
2.272ValAsn: 2.272 ± 0.014
3.397ValPro: 3.397 ± 0.016
2.161ValGln: 2.161 ± 0.013
3.478ValArg: 3.478 ± 0.018
5.255ValSer: 5.255 ± 0.023
3.188ValThr: 3.188 ± 0.018
5.057ValVal: 5.057 ± 0.022
0.782ValTrp: 0.782 ± 0.009
1.835ValTyr: 1.835 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.853TrpAla: 0.853 ± 0.009
0.269TrpCys: 0.269 ± 0.005
0.685TrpAsp: 0.685 ± 0.009
0.75TrpGlu: 0.75 ± 0.008
0.56TrpPhe: 0.56 ± 0.007
0.752TrpGly: 0.752 ± 0.009
0.33TrpHis: 0.33 ± 0.005
0.703TrpIle: 0.703 ± 0.008
0.859TrpLys: 0.859 ± 0.009
1.288TrpLeu: 1.288 ± 0.011
0.376TrpMet: 0.376 ± 0.005
0.657TrpAsn: 0.657 ± 0.008
0.568TrpPro: 0.568 ± 0.007
0.475TrpGln: 0.475 ± 0.006
0.976TrpArg: 0.976 ± 0.01
1.087TrpSer: 1.087 ± 0.009
0.664TrpThr: 0.664 ± 0.007
0.792TrpVal: 0.792 ± 0.009
0.271TrpTrp: 0.271 ± 0.005
0.35TrpTyr: 0.35 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.83TyrAla: 1.83 ± 0.013
0.641TyrCys: 0.641 ± 0.008
1.501TyrAsp: 1.501 ± 0.013
1.477TyrGlu: 1.477 ± 0.013
1.229TyrPhe: 1.229 ± 0.01
2.039TyrGly: 2.039 ± 0.017
0.752TyrHis: 0.752 ± 0.009
1.46TyrIle: 1.46 ± 0.012
1.318TyrLys: 1.318 ± 0.012
2.804TyrLeu: 2.804 ± 0.018
0.732TyrMet: 0.732 ± 0.008
1.128TyrAsn: 1.128 ± 0.01
1.22TyrPro: 1.22 ± 0.011
0.918TyrGln: 0.918 ± 0.008
1.595TyrArg: 1.595 ± 0.012
2.129TyrSer: 2.129 ± 0.014
1.204TyrThr: 1.204 ± 0.011
1.779TyrVal: 1.779 ± 0.012
0.406TyrTrp: 0.406 ± 0.006
0.953TyrTyr: 0.953 ± 0.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
4.363XaaXaa: 4.363 ± 1.356
Statistics based on 36474 proteins (12599352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski