Amino acid dipepetide frequency for Flavobacteriaceae bacterium 144Ye

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.752AlaAla: 3.752 ± 0.073
0.54AlaCys: 0.54 ± 0.028
3.161AlaAsp: 3.161 ± 0.065
4.001AlaGlu: 4.001 ± 0.08
3.476AlaPhe: 3.476 ± 0.065
3.68AlaGly: 3.68 ± 0.082
1.085AlaHis: 1.085 ± 0.038
5.435AlaIle: 5.435 ± 0.083
4.667AlaLys: 4.667 ± 0.086
6.348AlaLeu: 6.348 ± 0.098
1.469AlaMet: 1.469 ± 0.045
3.691AlaAsn: 3.691 ± 0.066
1.781AlaPro: 1.781 ± 0.055
2.413AlaGln: 2.413 ± 0.05
1.76AlaArg: 1.76 ± 0.049
4.23AlaSer: 4.23 ± 0.076
3.855AlaThr: 3.855 ± 0.094
3.993AlaVal: 3.993 ± 0.071
0.571AlaTrp: 0.571 ± 0.026
2.54AlaTyr: 2.54 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.468CysAla: 0.468 ± 0.023
0.115CysCys: 0.115 ± 0.011
0.499CysAsp: 0.499 ± 0.028
0.516CysGlu: 0.516 ± 0.029
0.428CysPhe: 0.428 ± 0.021
0.67CysGly: 0.67 ± 0.037
0.171CysHis: 0.171 ± 0.015
0.597CysIle: 0.597 ± 0.028
0.479CysLys: 0.479 ± 0.02
0.673CysLeu: 0.673 ± 0.024
0.146CysMet: 0.146 ± 0.013
0.453CysAsn: 0.453 ± 0.025
0.339CysPro: 0.339 ± 0.023
0.204CysGln: 0.204 ± 0.015
0.178CysArg: 0.178 ± 0.012
0.596CysSer: 0.596 ± 0.03
0.397CysThr: 0.397 ± 0.029
0.482CysVal: 0.482 ± 0.022
0.067CysTrp: 0.067 ± 0.009
0.287CysTyr: 0.287 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.973AspAla: 3.973 ± 0.068
0.493AspCys: 0.493 ± 0.027
3.545AspAsp: 3.545 ± 0.068
4.018AspGlu: 4.018 ± 0.074
3.607AspPhe: 3.607 ± 0.068
3.749AspGly: 3.749 ± 0.08
0.819AspHis: 0.819 ± 0.025
4.728AspIle: 4.728 ± 0.08
4.22AspLys: 4.22 ± 0.077
5.11AspLeu: 5.11 ± 0.071
1.183AspMet: 1.183 ± 0.037
3.672AspAsn: 3.672 ± 0.062
1.542AspPro: 1.542 ± 0.045
1.335AspGln: 1.335 ± 0.037
1.707AspArg: 1.707 ± 0.043
3.359AspSer: 3.359 ± 0.072
3.204AspThr: 3.204 ± 0.056
4.129AspVal: 4.129 ± 0.079
0.755AspTrp: 0.755 ± 0.026
3.107AspTyr: 3.107 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
4.692GluAla: 4.692 ± 0.09
0.38GluCys: 0.38 ± 0.024
3.971GluAsp: 3.971 ± 0.065
4.7GluGlu: 4.7 ± 0.104
3.174GluPhe: 3.174 ± 0.062
3.587GluGly: 3.587 ± 0.055
1.354GluHis: 1.354 ± 0.039
5.282GluIle: 5.282 ± 0.083
5.193GluLys: 5.193 ± 0.098
6.194GluLeu: 6.194 ± 0.091
1.456GluMet: 1.456 ± 0.043
4.496GluAsn: 4.496 ± 0.074
1.52GluPro: 1.52 ± 0.041
2.391GluGln: 2.391 ± 0.053
2.462GluArg: 2.462 ± 0.059
3.436GluSer: 3.436 ± 0.058
4.066GluThr: 4.066 ± 0.062
4.339GluVal: 4.339 ± 0.081
0.604GluTrp: 0.604 ± 0.029
2.431GluTyr: 2.431 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
2.822PheAla: 2.822 ± 0.06
0.458PheCys: 0.458 ± 0.023
3.385PheAsp: 3.385 ± 0.058
3.533PheGlu: 3.533 ± 0.06
2.575PhePhe: 2.575 ± 0.072
3.631PheGly: 3.631 ± 0.071
0.829PheHis: 0.829 ± 0.027
3.738PheIle: 3.738 ± 0.074
4.007PheLys: 4.007 ± 0.069
4.491PheLeu: 4.491 ± 0.078
1.105PheMet: 1.105 ± 0.037
3.713PheAsn: 3.713 ± 0.069
1.63PhePro: 1.63 ± 0.038
1.502PheGln: 1.502 ± 0.039
1.423PheArg: 1.423 ± 0.04
4.086PheSer: 4.086 ± 0.077
3.208PheThr: 3.208 ± 0.06
3.011PheVal: 3.011 ± 0.067
0.547PheTrp: 0.547 ± 0.026
2.234PheTyr: 2.234 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
3.851GlyAla: 3.851 ± 0.079
0.584GlyCys: 0.584 ± 0.035
3.444GlyAsp: 3.444 ± 0.075
3.624GlyGlu: 3.624 ± 0.067
3.606GlyPhe: 3.606 ± 0.067
4.152GlyGly: 4.152 ± 0.102
1.097GlyHis: 1.097 ± 0.036
4.976GlyIle: 4.976 ± 0.082
4.573GlyLys: 4.573 ± 0.077
5.483GlyLeu: 5.483 ± 0.088
1.49GlyMet: 1.49 ± 0.043
3.688GlyAsn: 3.688 ± 0.086
1.216GlyPro: 1.216 ± 0.042
1.849GlyGln: 1.849 ± 0.045
1.893GlyArg: 1.893 ± 0.055
3.937GlySer: 3.937 ± 0.083
4.123GlyThr: 4.123 ± 0.109
4.312GlyVal: 4.312 ± 0.08
0.699GlyTrp: 0.699 ± 0.028
2.755GlyTyr: 2.755 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
0.888HisAla: 0.888 ± 0.032
0.194HisCys: 0.194 ± 0.014
0.874HisAsp: 0.874 ± 0.032
0.989HisGlu: 0.989 ± 0.034
1.129HisPhe: 1.129 ± 0.037
1.02HisGly: 1.02 ± 0.033
0.483HisHis: 0.483 ± 0.025
1.506HisIle: 1.506 ± 0.036
1.272HisLys: 1.272 ± 0.039
1.835HisLeu: 1.835 ± 0.042
0.332HisMet: 0.332 ± 0.019
1.149HisAsn: 1.149 ± 0.033
0.828HisPro: 0.828 ± 0.035
0.733HisGln: 0.733 ± 0.028
0.596HisArg: 0.596 ± 0.03
1.026HisSer: 1.026 ± 0.033
0.955HisThr: 0.955 ± 0.031
0.965HisVal: 0.965 ± 0.037
0.217HisTrp: 0.217 ± 0.017
0.868HisTyr: 0.868 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.485IleAla: 5.485 ± 0.092
0.607IleCys: 0.607 ± 0.024
5.205IleAsp: 5.205 ± 0.082
5.569IleGlu: 5.569 ± 0.097
3.365IlePhe: 3.365 ± 0.067
4.916IleGly: 4.916 ± 0.086
1.23IleHis: 1.23 ± 0.036
5.959IleIle: 5.959 ± 0.093
6.003IleLys: 6.003 ± 0.1
6.661IleLeu: 6.661 ± 0.104
1.326IleMet: 1.326 ± 0.038
5.06IleAsn: 5.06 ± 0.086
3.079IlePro: 3.079 ± 0.056
2.422IleGln: 2.422 ± 0.053
2.204IleArg: 2.204 ± 0.048
5.629IleSer: 5.629 ± 0.086
5.041IleThr: 5.041 ± 0.088
5.003IleVal: 5.003 ± 0.075
0.632IleTrp: 0.632 ± 0.026
2.869IleTyr: 2.869 ± 0.056
0.0IleXaa: 0.0 ± 0.0
Lys
5.234LysAla: 5.234 ± 0.09
0.338LysCys: 0.338 ± 0.02
4.754LysAsp: 4.754 ± 0.086
5.581LysGlu: 5.581 ± 0.104
2.834LysPhe: 2.834 ± 0.06
4.26LysGly: 4.26 ± 0.073
1.66LysHis: 1.66 ± 0.038
5.587LysIle: 5.587 ± 0.104
6.231LysLys: 6.231 ± 0.11
6.801LysLeu: 6.801 ± 0.097
1.776LysMet: 1.776 ± 0.047
4.919LysAsn: 4.919 ± 0.087
2.521LysPro: 2.521 ± 0.053
3.163LysGln: 3.163 ± 0.066
3.098LysArg: 3.098 ± 0.061
4.646LysSer: 4.646 ± 0.083
5.097LysThr: 5.097 ± 0.086
4.757LysVal: 4.757 ± 0.074
0.691LysTrp: 0.691 ± 0.026
2.868LysTyr: 2.868 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
5.399LeuAla: 5.399 ± 0.08
0.695LeuCys: 0.695 ± 0.03
5.109LeuAsp: 5.109 ± 0.069
5.993LeuGlu: 5.993 ± 0.083
4.889LeuPhe: 4.889 ± 0.092
5.656LeuGly: 5.656 ± 0.086
1.534LeuHis: 1.534 ± 0.042
6.89LeuIle: 6.89 ± 0.105
8.105LeuLys: 8.105 ± 0.112
8.39LeuLeu: 8.39 ± 0.134
2.024LeuMet: 2.024 ± 0.052
6.06LeuAsn: 6.06 ± 0.09
3.376LeuPro: 3.376 ± 0.058
3.21LeuGln: 3.21 ± 0.062
2.931LeuArg: 2.931 ± 0.065
6.702LeuSer: 6.702 ± 0.088
5.132LeuThr: 5.132 ± 0.067
5.55LeuVal: 5.55 ± 0.077
0.753LeuTrp: 0.753 ± 0.03
3.162LeuTyr: 3.162 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.546MetAla: 1.546 ± 0.042
0.131MetCys: 0.131 ± 0.012
1.105MetAsp: 1.105 ± 0.033
1.269MetGlu: 1.269 ± 0.036
0.894MetPhe: 0.894 ± 0.031
1.138MetGly: 1.138 ± 0.038
0.404MetHis: 0.404 ± 0.02
1.467MetIle: 1.467 ± 0.044
2.1MetLys: 2.1 ± 0.054
2.001MetLeu: 2.001 ± 0.052
0.547MetMet: 0.547 ± 0.025
1.279MetAsn: 1.279 ± 0.035
0.782MetPro: 0.782 ± 0.03
0.839MetGln: 0.839 ± 0.03
0.791MetArg: 0.791 ± 0.026
1.489MetSer: 1.489 ± 0.039
1.2MetThr: 1.2 ± 0.031
1.346MetVal: 1.346 ± 0.036
0.139MetTrp: 0.139 ± 0.012
0.732MetTyr: 0.732 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
4.122AsnAla: 4.122 ± 0.069
0.549AsnCys: 0.549 ± 0.028
3.74AsnAsp: 3.74 ± 0.069
4.044AsnGlu: 4.044 ± 0.078
3.069AsnPhe: 3.069 ± 0.063
4.158AsnGly: 4.158 ± 0.088
1.099AsnHis: 1.099 ± 0.035
4.987AsnIle: 4.987 ± 0.077
4.44AsnLys: 4.44 ± 0.081
5.556AsnLeu: 5.556 ± 0.088
1.337AsnMet: 1.337 ± 0.039
4.547AsnAsn: 4.547 ± 0.09
2.81AsnPro: 2.81 ± 0.057
2.219AsnGln: 2.219 ± 0.047
2.042AsnArg: 2.042 ± 0.049
4.182AsnSer: 4.182 ± 0.074
4.37AsnThr: 4.37 ± 0.094
3.851AsnVal: 3.851 ± 0.069
0.804AsnTrp: 0.804 ± 0.032
3.035AsnTyr: 3.035 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
1.62ProAla: 1.62 ± 0.043
0.213ProCys: 0.213 ± 0.015
1.93ProAsp: 1.93 ± 0.052
2.815ProGlu: 2.815 ± 0.055
1.805ProPhe: 1.805 ± 0.049
1.677ProGly: 1.677 ± 0.05
0.602ProHis: 0.602 ± 0.023
2.574ProIle: 2.574 ± 0.053
2.561ProLys: 2.561 ± 0.056
2.818ProLeu: 2.818 ± 0.052
0.691ProMet: 0.691 ± 0.028
2.408ProAsn: 2.408 ± 0.056
0.765ProPro: 0.765 ± 0.039
1.128ProGln: 1.128 ± 0.033
0.863ProArg: 0.863 ± 0.031
2.208ProSer: 2.208 ± 0.055
2.048ProThr: 2.048 ± 0.052
2.218ProVal: 2.218 ± 0.056
0.292ProTrp: 0.292 ± 0.018
1.387ProTyr: 1.387 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
1.963GlnAla: 1.963 ± 0.046
0.184GlnCys: 0.184 ± 0.016
1.907GlnAsp: 1.907 ± 0.046
2.222GlnGlu: 2.222 ± 0.052
1.776GlnPhe: 1.776 ± 0.043
1.763GlnGly: 1.763 ± 0.044
0.694GlnHis: 0.694 ± 0.028
2.556GlnIle: 2.556 ± 0.052
2.599GlnLys: 2.599 ± 0.057
3.611GlnLeu: 3.611 ± 0.07
0.807GlnMet: 0.807 ± 0.029
2.237GlnAsn: 2.237 ± 0.05
1.236GlnPro: 1.236 ± 0.034
1.577GlnGln: 1.577 ± 0.047
1.242GlnArg: 1.242 ± 0.034
2.001GlnSer: 2.001 ± 0.041
2.033GlnThr: 2.033 ± 0.055
1.957GlnVal: 1.957 ± 0.047
0.331GlnTrp: 0.331 ± 0.018
1.407GlnTyr: 1.407 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
1.968ArgAla: 1.968 ± 0.045
0.187ArgCys: 0.187 ± 0.012
1.677ArgAsp: 1.677 ± 0.046
2.025ArgGlu: 2.025 ± 0.052
1.848ArgPhe: 1.848 ± 0.047
1.722ArgGly: 1.722 ± 0.046
0.587ArgHis: 0.587 ± 0.024
2.626ArgIle: 2.626 ± 0.054
2.494ArgLys: 2.494 ± 0.056
3.101ArgLeu: 3.101 ± 0.062
0.767ArgMet: 0.767 ± 0.027
1.877ArgAsn: 1.877 ± 0.042
1.009ArgPro: 1.009 ± 0.031
1.127ArgGln: 1.127 ± 0.036
1.236ArgArg: 1.236 ± 0.039
1.718ArgSer: 1.718 ± 0.041
1.782ArgThr: 1.782 ± 0.045
2.098ArgVal: 2.098 ± 0.053
0.346ArgTrp: 0.346 ± 0.017
1.494ArgTyr: 1.494 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.767SerAla: 3.767 ± 0.076
0.657SerCys: 0.657 ± 0.033
3.631SerAsp: 3.631 ± 0.055
4.038SerGlu: 4.038 ± 0.07
3.633SerPhe: 3.633 ± 0.07
4.711SerGly: 4.711 ± 0.089
1.122SerHis: 1.122 ± 0.031
5.418SerIle: 5.418 ± 0.077
5.306SerLys: 5.306 ± 0.086
5.835SerLeu: 5.835 ± 0.091
1.28SerMet: 1.28 ± 0.035
4.377SerAsn: 4.377 ± 0.082
2.053SerPro: 2.053 ± 0.045
2.269SerGln: 2.269 ± 0.049
1.919SerArg: 1.919 ± 0.045
4.351SerSer: 4.351 ± 0.093
3.892SerThr: 3.892 ± 0.075
4.242SerVal: 4.242 ± 0.08
0.719SerTrp: 0.719 ± 0.033
2.793SerTyr: 2.793 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
3.775ThrAla: 3.775 ± 0.085
0.421ThrCys: 0.421 ± 0.026
3.339ThrAsp: 3.339 ± 0.062
3.596ThrGlu: 3.596 ± 0.06
3.285ThrPhe: 3.285 ± 0.054
3.845ThrGly: 3.845 ± 0.088
1.109ThrHis: 1.109 ± 0.034
5.48ThrIle: 5.48 ± 0.088
4.017ThrLys: 4.017 ± 0.075
5.774ThrLeu: 5.774 ± 0.08
1.029ThrMet: 1.029 ± 0.033
3.789ThrAsn: 3.789 ± 0.072
2.443ThrPro: 2.443 ± 0.065
2.081ThrGln: 2.081 ± 0.062
1.609ThrArg: 1.609 ± 0.04
4.268ThrSer: 4.268 ± 0.086
4.021ThrThr: 4.021 ± 0.091
4.3ThrVal: 4.3 ± 0.104
0.627ThrTrp: 0.627 ± 0.03
2.742ThrTyr: 2.742 ± 0.074
0.0ThrXaa: 0.0 ± 0.0
Val
4.183ValAla: 4.183 ± 0.068
0.569ValCys: 0.569 ± 0.023
3.829ValAsp: 3.829 ± 0.072
4.187ValGlu: 4.187 ± 0.063
3.56ValPhe: 3.56 ± 0.071
3.829ValGly: 3.829 ± 0.081
0.963ValHis: 0.963 ± 0.033
4.905ValIle: 4.905 ± 0.075
4.545ValLys: 4.545 ± 0.071
6.192ValLeu: 6.192 ± 0.08
1.377ValMet: 1.377 ± 0.038
3.893ValAsn: 3.893 ± 0.064
2.093ValPro: 2.093 ± 0.046
1.698ValGln: 1.698 ± 0.043
1.848ValArg: 1.848 ± 0.045
4.685ValSer: 4.685 ± 0.075
3.956ValThr: 3.956 ± 0.124
4.576ValVal: 4.576 ± 0.081
0.599ValTrp: 0.599 ± 0.025
2.617ValTyr: 2.617 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.539TrpAla: 0.539 ± 0.024
0.096TrpCys: 0.096 ± 0.01
0.585TrpAsp: 0.585 ± 0.024
0.632TrpGlu: 0.632 ± 0.028
0.547TrpPhe: 0.547 ± 0.024
0.572TrpGly: 0.572 ± 0.031
0.209TrpHis: 0.209 ± 0.015
0.71TrpIle: 0.71 ± 0.029
0.749TrpLys: 0.749 ± 0.034
0.933TrpLeu: 0.933 ± 0.028
0.286TrpMet: 0.286 ± 0.017
0.715TrpAsn: 0.715 ± 0.029
0.193TrpPro: 0.193 ± 0.016
0.373TrpGln: 0.373 ± 0.02
0.402TrpArg: 0.402 ± 0.022
0.669TrpSer: 0.669 ± 0.028
0.586TrpThr: 0.586 ± 0.031
0.635TrpVal: 0.635 ± 0.026
0.148TrpTrp: 0.148 ± 0.013
0.446TrpTyr: 0.446 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.398TyrAla: 2.398 ± 0.047
0.356TyrCys: 0.356 ± 0.02
2.567TyrAsp: 2.567 ± 0.057
2.374TyrGlu: 2.374 ± 0.048
2.427TyrPhe: 2.427 ± 0.053
2.617TyrGly: 2.617 ± 0.057
0.808TyrHis: 0.808 ± 0.029
2.841TyrIle: 2.841 ± 0.059
3.254TyrLys: 3.254 ± 0.058
3.755TyrLeu: 3.755 ± 0.065
0.77TyrMet: 0.77 ± 0.031
2.94TyrAsn: 2.94 ± 0.063
1.406TyrPro: 1.406 ± 0.039
1.574TyrGln: 1.574 ± 0.045
1.487TyrArg: 1.487 ± 0.038
2.794TyrSer: 2.794 ± 0.062
2.583TyrThr: 2.583 ± 0.065
2.34TyrVal: 2.34 ± 0.051
0.485TyrTrp: 0.485 ± 0.027
1.98TyrTyr: 1.98 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2918 proteins (1003851 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski