Amino acid dipepetide frequency for Thermoflavimicrobium dichotomicum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.5AlaAla: 5.5 ± 0.093
0.782AlaCys: 0.782 ± 0.03
3.636AlaAsp: 3.636 ± 0.29
4.948AlaGlu: 4.948 ± 0.089
3.073AlaPhe: 3.073 ± 0.059
5.299AlaGly: 5.299 ± 0.084
1.603AlaHis: 1.603 ± 0.045
5.871AlaIle: 5.871 ± 0.071
4.891AlaLys: 4.891 ± 0.069
7.346AlaLeu: 7.346 ± 0.092
2.032AlaMet: 2.032 ± 0.052
2.428AlaAsn: 2.428 ± 0.044
2.279AlaPro: 2.279 ± 0.049
2.76AlaGln: 2.76 ± 0.061
3.596AlaArg: 3.596 ± 0.066
3.689AlaSer: 3.689 ± 0.051
3.396AlaThr: 3.396 ± 0.064
5.376AlaVal: 5.376 ± 0.087
0.838AlaTrp: 0.838 ± 0.028
2.552AlaTyr: 2.552 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.023
0.118CysCys: 0.118 ± 0.01
0.442CysAsp: 0.442 ± 0.023
0.53CysGlu: 0.53 ± 0.022
0.402CysPhe: 0.402 ± 0.021
0.798CysGly: 0.798 ± 0.029
0.277CysHis: 0.277 ± 0.017
0.599CysIle: 0.599 ± 0.025
0.41CysLys: 0.41 ± 0.019
0.937CysLeu: 0.937 ± 0.031
0.223CysMet: 0.223 ± 0.015
0.298CysAsn: 0.298 ± 0.017
0.475CysPro: 0.475 ± 0.026
0.445CysGln: 0.445 ± 0.021
0.481CysArg: 0.481 ± 0.022
0.572CysSer: 0.572 ± 0.025
0.512CysThr: 0.512 ± 0.022
0.475CysVal: 0.475 ± 0.022
0.115CysTrp: 0.115 ± 0.01
0.338CysTyr: 0.338 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.334AspAla: 3.334 ± 0.277
0.441AspCys: 0.441 ± 0.019
2.034AspAsp: 2.034 ± 0.052
3.908AspGlu: 3.908 ± 0.072
2.057AspPhe: 2.057 ± 0.049
3.219AspGly: 3.219 ± 0.06
1.366AspHis: 1.366 ± 0.041
3.369AspIle: 3.369 ± 0.061
2.299AspLys: 2.299 ± 0.051
5.353AspLeu: 5.353 ± 0.075
1.124AspMet: 1.124 ± 0.031
1.059AspAsn: 1.059 ± 0.036
2.726AspPro: 2.726 ± 0.056
2.719AspGln: 2.719 ± 0.06
2.739AspArg: 2.739 ± 0.053
2.137AspSer: 2.137 ± 0.053
1.998AspThr: 1.998 ± 0.044
3.42AspVal: 3.42 ± 0.065
0.795AspTrp: 0.795 ± 0.027
1.711AspTyr: 1.711 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
5.514GluAla: 5.514 ± 0.094
0.512GluCys: 0.512 ± 0.024
3.089GluAsp: 3.089 ± 0.06
6.944GluGlu: 6.944 ± 0.115
2.135GluPhe: 2.135 ± 0.041
4.108GluGly: 4.108 ± 0.077
1.587GluHis: 1.587 ± 0.043
5.456GluIle: 5.456 ± 0.081
6.646GluLys: 6.646 ± 0.104
7.091GluLeu: 7.091 ± 0.098
2.35GluMet: 2.35 ± 0.046
2.695GluAsn: 2.695 ± 0.044
2.278GluPro: 2.278 ± 0.041
4.07GluGln: 4.07 ± 0.08
4.381GluArg: 4.381 ± 0.074
3.181GluSer: 3.181 ± 0.066
3.39GluThr: 3.39 ± 0.059
5.19GluVal: 5.19 ± 0.088
1.131GluTrp: 1.131 ± 0.037
2.124GluTyr: 2.124 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.029PheAla: 3.029 ± 0.055
0.415PheCys: 0.415 ± 0.02
2.185PheAsp: 2.185 ± 0.052
2.387PheGlu: 2.387 ± 0.048
2.17PhePhe: 2.17 ± 0.054
3.077PheGly: 3.077 ± 0.072
1.18PheHis: 1.18 ± 0.034
3.138PheIle: 3.138 ± 0.058
1.81PheLys: 1.81 ± 0.045
4.381PheLeu: 4.381 ± 0.076
0.977PheMet: 0.977 ± 0.028
1.221PheAsn: 1.221 ± 0.035
1.823PhePro: 1.823 ± 0.042
1.729PheGln: 1.729 ± 0.045
2.014PheArg: 2.014 ± 0.041
2.713PheSer: 2.713 ± 0.058
2.263PheThr: 2.263 ± 0.047
3.084PheVal: 3.084 ± 0.062
0.502PheTrp: 0.502 ± 0.026
1.541PheTyr: 1.541 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
4.719GlyAla: 4.719 ± 0.085
0.769GlyCys: 0.769 ± 0.029
2.954GlyAsp: 2.954 ± 0.052
4.552GlyGlu: 4.552 ± 0.08
3.098GlyPhe: 3.098 ± 0.062
4.843GlyGly: 4.843 ± 0.078
1.469GlyHis: 1.469 ± 0.039
5.803GlyIle: 5.803 ± 0.078
5.292GlyLys: 5.292 ± 0.087
6.691GlyLeu: 6.691 ± 0.085
2.135GlyMet: 2.135 ± 0.052
2.286GlyAsn: 2.286 ± 0.05
1.994GlyPro: 1.994 ± 0.05
2.558GlyGln: 2.558 ± 0.06
3.225GlyArg: 3.225 ± 0.057
3.684GlySer: 3.684 ± 0.067
3.802GlyThr: 3.802 ± 0.07
5.234GlyVal: 5.234 ± 0.069
1.1GlyTrp: 1.1 ± 0.038
2.682GlyTyr: 2.682 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.606HisAla: 1.606 ± 0.041
0.269HisCys: 0.269 ± 0.018
0.978HisAsp: 0.978 ± 0.029
1.428HisGlu: 1.428 ± 0.037
1.193HisPhe: 1.193 ± 0.036
1.692HisGly: 1.692 ± 0.036
0.805HisHis: 0.805 ± 0.039
1.647HisIle: 1.647 ± 0.053
0.978HisLys: 0.978 ± 0.03
2.851HisLeu: 2.851 ± 0.069
0.601HisMet: 0.601 ± 0.023
0.628HisAsn: 0.628 ± 0.025
1.684HisPro: 1.684 ± 0.038
1.258HisGln: 1.258 ± 0.037
1.292HisArg: 1.292 ± 0.036
1.201HisSer: 1.201 ± 0.035
1.053HisThr: 1.053 ± 0.03
1.727HisVal: 1.727 ± 0.041
0.339HisTrp: 0.339 ± 0.019
0.926HisTyr: 0.926 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.991IleAla: 5.991 ± 0.092
0.78IleCys: 0.78 ± 0.027
3.847IleAsp: 3.847 ± 0.059
5.125IleGlu: 5.125 ± 0.073
2.885IlePhe: 2.885 ± 0.059
5.518IleGly: 5.518 ± 0.094
2.121IleHis: 2.121 ± 0.041
4.554IleIle: 4.554 ± 0.077
3.641IleLys: 3.641 ± 0.067
7.025IleLeu: 7.025 ± 0.082
1.553IleMet: 1.553 ± 0.041
2.361IleAsn: 2.361 ± 0.043
3.7IlePro: 3.7 ± 0.054
3.553IleGln: 3.553 ± 0.074
4.201IleArg: 4.201 ± 0.065
4.521IleSer: 4.521 ± 0.073
3.706IleThr: 3.706 ± 0.068
4.971IleVal: 4.971 ± 0.077
0.867IleTrp: 0.867 ± 0.032
2.368IleTyr: 2.368 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.284LysAla: 4.284 ± 0.072
0.378LysCys: 0.378 ± 0.019
3.004LysAsp: 3.004 ± 0.074
6.633LysGlu: 6.633 ± 0.091
1.506LysPhe: 1.506 ± 0.04
4.224LysGly: 4.224 ± 0.066
1.379LysHis: 1.379 ± 0.04
4.083LysIle: 4.083 ± 0.069
5.891LysLys: 5.891 ± 0.106
5.41LysLeu: 5.41 ± 0.076
2.02LysMet: 2.02 ± 0.042
2.686LysAsn: 2.686 ± 0.053
2.505LysPro: 2.505 ± 0.061
3.539LysGln: 3.539 ± 0.067
3.781LysArg: 3.781 ± 0.069
2.891LysSer: 2.891 ± 0.064
3.034LysThr: 3.034 ± 0.057
4.599LysVal: 4.599 ± 0.072
1.096LysTrp: 1.096 ± 0.038
2.01LysTyr: 2.01 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
8.202LeuAla: 8.202 ± 0.103
0.912LeuCys: 0.912 ± 0.031
4.994LeuAsp: 4.994 ± 0.064
6.987LeuGlu: 6.987 ± 0.088
4.77LeuPhe: 4.77 ± 0.075
6.834LeuGly: 6.834 ± 0.092
2.189LeuHis: 2.189 ± 0.048
6.957LeuIle: 6.957 ± 0.087
6.464LeuLys: 6.464 ± 0.095
10.139LeuLeu: 10.139 ± 0.13
2.376LeuMet: 2.376 ± 0.045
3.475LeuAsn: 3.475 ± 0.064
4.667LeuPro: 4.667 ± 0.077
4.103LeuGln: 4.103 ± 0.072
4.783LeuArg: 4.783 ± 0.082
6.67LeuSer: 6.67 ± 0.078
5.338LeuThr: 5.338 ± 0.072
6.841LeuVal: 6.841 ± 0.085
1.04LeuTrp: 1.04 ± 0.03
3.242LeuTyr: 3.242 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.104MetAla: 2.104 ± 0.046
0.176MetCys: 0.176 ± 0.014
1.488MetAsp: 1.488 ± 0.04
2.116MetGlu: 2.116 ± 0.047
0.91MetPhe: 0.91 ± 0.031
1.968MetGly: 1.968 ± 0.046
0.459MetHis: 0.459 ± 0.02
2.235MetIle: 2.235 ± 0.05
2.18MetLys: 2.18 ± 0.05
2.305MetLeu: 2.305 ± 0.048
0.827MetMet: 0.827 ± 0.033
1.208MetAsn: 1.208 ± 0.031
0.92MetPro: 0.92 ± 0.032
0.991MetGln: 0.991 ± 0.032
1.237MetArg: 1.237 ± 0.038
1.496MetSer: 1.496 ± 0.037
1.419MetThr: 1.419 ± 0.036
2.008MetVal: 2.008 ± 0.04
0.232MetTrp: 0.232 ± 0.017
0.674MetTyr: 0.674 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.119AsnAla: 2.119 ± 0.046
0.303AsnCys: 0.303 ± 0.016
1.555AsnAsp: 1.555 ± 0.037
2.397AsnGlu: 2.397 ± 0.052
1.115AsnPhe: 1.115 ± 0.032
2.47AsnGly: 2.47 ± 0.05
1.022AsnHis: 1.022 ± 0.031
2.432AsnIle: 2.432 ± 0.059
2.063AsnLys: 2.063 ± 0.053
3.204AsnLeu: 3.204 ± 0.052
0.882AsnMet: 0.882 ± 0.03
1.179AsnAsn: 1.179 ± 0.04
2.167AsnPro: 2.167 ± 0.045
2.137AsnGln: 2.137 ± 0.047
2.064AsnArg: 2.064 ± 0.047
1.535AsnSer: 1.535 ± 0.043
1.582AsnThr: 1.582 ± 0.045
2.194AsnVal: 2.194 ± 0.044
0.539AsnTrp: 0.539 ± 0.023
1.143AsnTyr: 1.143 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
2.835ProAla: 2.835 ± 0.06
0.29ProCys: 0.29 ± 0.017
2.577ProAsp: 2.577 ± 0.059
3.643ProGlu: 3.643 ± 0.065
2.096ProPhe: 2.096 ± 0.043
2.924ProGly: 2.924 ± 0.057
1.164ProHis: 1.164 ± 0.034
2.95ProIle: 2.95 ± 0.054
2.547ProLys: 2.547 ± 0.053
4.125ProLeu: 4.125 ± 0.072
0.955ProMet: 0.955 ± 0.031
1.576ProAsn: 1.576 ± 0.041
1.568ProPro: 1.568 ± 0.044
1.572ProGln: 1.572 ± 0.047
1.674ProArg: 1.674 ± 0.041
2.513ProSer: 2.513 ± 0.05
2.195ProThr: 2.195 ± 0.052
3.436ProVal: 3.436 ± 0.06
0.536ProTrp: 0.536 ± 0.023
1.623ProTyr: 1.623 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.884GlnAla: 3.884 ± 0.069
0.311GlnCys: 0.311 ± 0.018
1.842GlnAsp: 1.842 ± 0.044
3.466GlnGlu: 3.466 ± 0.061
1.652GlnPhe: 1.652 ± 0.042
2.643GlnGly: 2.643 ± 0.057
0.958GlnHis: 0.958 ± 0.034
3.394GlnIle: 3.394 ± 0.065
3.127GlnLys: 3.127 ± 0.059
4.832GlnLeu: 4.832 ± 0.094
1.357GlnMet: 1.357 ± 0.038
1.456GlnAsn: 1.456 ± 0.039
1.831GlnPro: 1.831 ± 0.055
2.296GlnGln: 2.296 ± 0.062
2.042GlnArg: 2.042 ± 0.049
2.289GlnSer: 2.289 ± 0.054
2.347GlnThr: 2.347 ± 0.057
3.571GlnVal: 3.571 ± 0.064
0.521GlnTrp: 0.521 ± 0.023
1.391GlnTyr: 1.391 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
3.007ArgAla: 3.007 ± 0.056
0.426ArgCys: 0.426 ± 0.02
2.255ArgAsp: 2.255 ± 0.049
4.069ArgGlu: 4.069 ± 0.075
2.408ArgPhe: 2.408 ± 0.048
2.817ArgGly: 2.817 ± 0.051
1.115ArgHis: 1.115 ± 0.032
3.993ArgIle: 3.993 ± 0.057
3.8ArgLys: 3.8 ± 0.061
5.637ArgLeu: 5.637 ± 0.075
1.616ArgMet: 1.616 ± 0.038
1.845ArgAsn: 1.845 ± 0.043
1.912ArgPro: 1.912 ± 0.046
2.314ArgGln: 2.314 ± 0.054
2.632ArgArg: 2.632 ± 0.054
2.698ArgSer: 2.698 ± 0.052
2.332ArgThr: 2.332 ± 0.05
3.481ArgVal: 3.481 ± 0.054
0.722ArgTrp: 0.722 ± 0.027
1.956ArgTyr: 1.956 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.416SerAla: 3.416 ± 0.056
0.465SerCys: 0.465 ± 0.022
2.562SerAsp: 2.562 ± 0.054
3.356SerGlu: 3.356 ± 0.065
2.819SerPhe: 2.819 ± 0.052
4.054SerGly: 4.054 ± 0.064
1.312SerHis: 1.312 ± 0.035
4.135SerIle: 4.135 ± 0.065
3.125SerLys: 3.125 ± 0.062
6.091SerLeu: 6.091 ± 0.085
1.636SerMet: 1.636 ± 0.039
1.823SerAsn: 1.823 ± 0.043
2.552SerPro: 2.552 ± 0.051
2.182SerGln: 2.182 ± 0.049
2.569SerArg: 2.569 ± 0.055
3.613SerSer: 3.613 ± 0.071
2.735SerThr: 2.735 ± 0.056
3.822SerVal: 3.822 ± 0.063
0.687SerTrp: 0.687 ± 0.028
1.993SerTyr: 1.993 ± 0.047
0.0SerXaa: 0.0 ± 0.0
Thr
3.667ThrAla: 3.667 ± 0.056
0.485ThrCys: 0.485 ± 0.021
2.38ThrAsp: 2.38 ± 0.043
3.163ThrGlu: 3.163 ± 0.056
2.237ThrPhe: 2.237 ± 0.05
4.185ThrGly: 4.185 ± 0.058
1.231ThrHis: 1.231 ± 0.036
3.783ThrIle: 3.783 ± 0.057
2.728ThrLys: 2.728 ± 0.062
5.075ThrLeu: 5.075 ± 0.08
1.212ThrMet: 1.212 ± 0.027
1.691ThrAsn: 1.691 ± 0.044
2.539ThrPro: 2.539 ± 0.046
1.665ThrGln: 1.665 ± 0.041
2.166ThrArg: 2.166 ± 0.051
2.886ThrSer: 2.886 ± 0.056
2.388ThrThr: 2.388 ± 0.059
3.804ThrVal: 3.804 ± 0.064
0.599ThrTrp: 0.599 ± 0.022
1.742ThrTyr: 1.742 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.288ValAla: 5.288 ± 0.079
0.697ValCys: 0.697 ± 0.023
3.703ValAsp: 3.703 ± 0.058
5.195ValGlu: 5.195 ± 0.083
2.914ValPhe: 2.914 ± 0.061
4.822ValGly: 4.822 ± 0.075
1.672ValHis: 1.672 ± 0.038
5.589ValIle: 5.589 ± 0.085
4.48ValLys: 4.48 ± 0.074
6.949ValLeu: 6.949 ± 0.088
1.833ValMet: 1.833 ± 0.043
2.549ValAsn: 2.549 ± 0.051
3.111ValPro: 3.111 ± 0.059
2.878ValGln: 2.878 ± 0.05
3.428ValArg: 3.428 ± 0.055
4.232ValSer: 4.232 ± 0.062
3.877ValThr: 3.877 ± 0.061
5.26ValVal: 5.26 ± 0.08
0.843ValTrp: 0.843 ± 0.034
2.289ValTyr: 2.289 ± 0.051
0.0ValXaa: 0.0 ± 0.0
Trp
0.729TrpAla: 0.729 ± 0.029
0.107TrpCys: 0.107 ± 0.011
0.617TrpAsp: 0.617 ± 0.022
0.893TrpGlu: 0.893 ± 0.036
0.606TrpPhe: 0.606 ± 0.025
0.811TrpGly: 0.811 ± 0.026
0.239TrpHis: 0.239 ± 0.015
1.155TrpIle: 1.155 ± 0.04
0.98TrpLys: 0.98 ± 0.033
1.642TrpLeu: 1.642 ± 0.048
0.452TrpMet: 0.452 ± 0.022
0.611TrpAsn: 0.611 ± 0.025
0.377TrpPro: 0.377 ± 0.017
0.509TrpGln: 0.509 ± 0.024
0.621TrpArg: 0.621 ± 0.027
0.719TrpSer: 0.719 ± 0.025
0.583TrpThr: 0.583 ± 0.023
0.925TrpVal: 0.925 ± 0.032
0.185TrpTrp: 0.185 ± 0.013
0.404TrpTyr: 0.404 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.127TyrAla: 2.127 ± 0.049
0.352TyrCys: 0.352 ± 0.021
1.667TyrAsp: 1.667 ± 0.04
2.131TyrGlu: 2.131 ± 0.047
1.594TyrPhe: 1.594 ± 0.038
2.583TyrGly: 2.583 ± 0.051
1.011TyrHis: 1.011 ± 0.035
2.147TyrIle: 2.147 ± 0.048
1.601TyrLys: 1.601 ± 0.045
3.898TyrLeu: 3.898 ± 0.06
0.79TyrMet: 0.79 ± 0.03
1.028TyrAsn: 1.028 ± 0.036
1.691TyrPro: 1.691 ± 0.044
1.93TyrGln: 1.93 ± 0.044
2.111TyrArg: 2.111 ± 0.045
1.709TyrSer: 1.709 ± 0.042
1.676TyrThr: 1.676 ± 0.043
2.19TyrVal: 2.19 ± 0.054
0.475TyrTrp: 0.475 ± 0.021
1.36TyrTyr: 1.36 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3602 proteins (1067728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski