Amino acid dipepetide frequency for Meiothermus granaticius NBRC 107808

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.222AlaAla: 12.222 ± 0.142
0.598AlaCys: 0.598 ± 0.027
3.968AlaAsp: 3.968 ± 0.062
7.309AlaGlu: 7.309 ± 0.106
3.903AlaPhe: 3.903 ± 0.069
8.884AlaGly: 8.884 ± 0.1
2.241AlaHis: 2.241 ± 0.057
4.324AlaIle: 4.324 ± 0.079
3.909AlaLys: 3.909 ± 0.071
16.387AlaLeu: 16.387 ± 0.21
2.295AlaMet: 2.295 ± 0.05
2.417AlaAsn: 2.417 ± 0.058
4.709AlaPro: 4.709 ± 0.079
6.326AlaGln: 6.326 ± 0.1
7.822AlaArg: 7.822 ± 0.115
5.108AlaSer: 5.108 ± 0.074
4.602AlaThr: 4.602 ± 0.071
7.901AlaVal: 7.901 ± 0.088
1.852AlaTrp: 1.852 ± 0.049
3.282AlaTyr: 3.282 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.025
0.062CysCys: 0.062 ± 0.009
0.264CysAsp: 0.264 ± 0.018
0.265CysGlu: 0.265 ± 0.017
0.194CysPhe: 0.194 ± 0.012
0.574CysGly: 0.574 ± 0.029
0.158CysHis: 0.158 ± 0.012
0.195CysIle: 0.195 ± 0.015
0.163CysLys: 0.163 ± 0.012
0.495CysLeu: 0.495 ± 0.022
0.084CysMet: 0.084 ± 0.008
0.153CysAsn: 0.153 ± 0.012
0.42CysPro: 0.42 ± 0.025
0.186CysGln: 0.186 ± 0.014
0.334CysArg: 0.334 ± 0.016
0.314CysSer: 0.314 ± 0.019
0.332CysThr: 0.332 ± 0.022
0.345CysVal: 0.345 ± 0.021
0.063CysTrp: 0.063 ± 0.008
0.15CysTyr: 0.15 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.718AspAla: 3.718 ± 0.057
0.228AspCys: 0.228 ± 0.014
1.349AspAsp: 1.349 ± 0.041
2.331AspGlu: 2.331 ± 0.051
1.576AspPhe: 1.576 ± 0.04
3.251AspGly: 3.251 ± 0.07
0.801AspHis: 0.801 ± 0.03
1.459AspIle: 1.459 ± 0.038
1.268AspLys: 1.268 ± 0.042
5.778AspLeu: 5.778 ± 0.076
0.587AspMet: 0.587 ± 0.023
0.849AspAsn: 0.849 ± 0.029
3.698AspPro: 3.698 ± 0.067
1.492AspGln: 1.492 ± 0.039
2.626AspArg: 2.626 ± 0.053
1.742AspSer: 1.742 ± 0.048
1.93AspThr: 1.93 ± 0.048
2.474AspVal: 2.474 ± 0.058
0.891AspTrp: 0.891 ± 0.031
1.256AspTyr: 1.256 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
9.893GluAla: 9.893 ± 0.147
0.241GluCys: 0.241 ± 0.016
2.275GluAsp: 2.275 ± 0.049
4.208GluGlu: 4.208 ± 0.091
2.095GluPhe: 2.095 ± 0.048
6.287GluGly: 6.287 ± 0.1
1.232GluHis: 1.232 ± 0.036
2.807GluIle: 2.807 ± 0.054
1.996GluLys: 1.996 ± 0.058
7.637GluLeu: 7.637 ± 0.127
1.133GluMet: 1.133 ± 0.033
1.44GluAsn: 1.44 ± 0.039
3.286GluPro: 3.286 ± 0.062
2.411GluGln: 2.411 ± 0.053
5.734GluArg: 5.734 ± 0.087
2.281GluSer: 2.281 ± 0.049
2.921GluThr: 2.921 ± 0.049
5.822GluVal: 5.822 ± 0.07
0.911GluTrp: 0.911 ± 0.03
1.54GluTyr: 1.54 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.732PheAla: 3.732 ± 0.069
0.233PheCys: 0.233 ± 0.014
1.662PheAsp: 1.662 ± 0.041
2.004PheGlu: 2.004 ± 0.048
1.439PhePhe: 1.439 ± 0.037
3.313PheGly: 3.313 ± 0.068
0.684PheHis: 0.684 ± 0.027
1.553PheIle: 1.553 ± 0.043
1.209PheLys: 1.209 ± 0.038
4.14PheLeu: 4.14 ± 0.084
0.591PheMet: 0.591 ± 0.025
1.038PheAsn: 1.038 ± 0.034
1.948PhePro: 1.948 ± 0.046
1.318PheGln: 1.318 ± 0.038
2.194PheArg: 2.194 ± 0.042
2.271PheSer: 2.271 ± 0.045
2.005PheThr: 2.005 ± 0.047
2.537PheVal: 2.537 ± 0.061
0.701PheTrp: 0.701 ± 0.028
1.186PheTyr: 1.186 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
8.217GlyAla: 8.217 ± 0.097
0.525GlyCys: 0.525 ± 0.029
3.26GlyAsp: 3.26 ± 0.067
5.245GlyGlu: 5.245 ± 0.074
3.889GlyPhe: 3.889 ± 0.066
7.656GlyGly: 7.656 ± 0.112
1.692GlyHis: 1.692 ± 0.05
4.145GlyIle: 4.145 ± 0.071
3.481GlyLys: 3.481 ± 0.054
12.404GlyLeu: 12.404 ± 0.13
2.06GlyMet: 2.06 ± 0.048
2.253GlyAsn: 2.253 ± 0.055
3.699GlyPro: 3.699 ± 0.067
4.053GlyGln: 4.053 ± 0.081
5.751GlyArg: 5.751 ± 0.082
4.901GlySer: 4.901 ± 0.081
4.0GlyThr: 4.0 ± 0.078
7.372GlyVal: 7.372 ± 0.092
1.713GlyTrp: 1.713 ± 0.045
2.906GlyTyr: 2.906 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.673HisAla: 1.673 ± 0.04
0.154HisCys: 0.154 ± 0.013
0.81HisAsp: 0.81 ± 0.031
0.906HisGlu: 0.906 ± 0.029
0.732HisPhe: 0.732 ± 0.03
1.451HisGly: 1.451 ± 0.041
0.585HisHis: 0.585 ± 0.027
0.776HisIle: 0.776 ± 0.031
0.606HisLys: 0.606 ± 0.025
2.779HisLeu: 2.779 ± 0.059
0.309HisMet: 0.309 ± 0.018
0.491HisAsn: 0.491 ± 0.025
1.833HisPro: 1.833 ± 0.046
0.779HisGln: 0.779 ± 0.03
1.431HisArg: 1.431 ± 0.039
1.003HisSer: 1.003 ± 0.033
1.106HisThr: 1.106 ± 0.037
1.005HisVal: 1.005 ± 0.034
0.345HisTrp: 0.345 ± 0.017
0.678HisTyr: 0.678 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
4.878IleAla: 4.878 ± 0.076
0.239IleCys: 0.239 ± 0.016
1.55IleAsp: 1.55 ± 0.049
2.365IleGlu: 2.365 ± 0.053
1.4IlePhe: 1.4 ± 0.045
3.709IleGly: 3.709 ± 0.071
0.933IleHis: 0.933 ± 0.031
1.391IleIle: 1.391 ± 0.046
1.159IleLys: 1.159 ± 0.037
4.83IleLeu: 4.83 ± 0.088
0.531IleMet: 0.531 ± 0.024
1.184IleAsn: 1.184 ± 0.033
2.958IlePro: 2.958 ± 0.055
1.616IleGln: 1.616 ± 0.034
2.806IleArg: 2.806 ± 0.052
2.121IleSer: 2.121 ± 0.041
2.218IleThr: 2.218 ± 0.055
2.878IleVal: 2.878 ± 0.06
0.541IleTrp: 0.541 ± 0.025
1.175IleTyr: 1.175 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.354LysAla: 4.354 ± 0.076
0.101LysCys: 0.101 ± 0.011
1.519LysAsp: 1.519 ± 0.048
2.067LysGlu: 2.067 ± 0.049
0.787LysPhe: 0.787 ± 0.03
3.101LysGly: 3.101 ± 0.064
0.559LysHis: 0.559 ± 0.026
1.37LysIle: 1.37 ± 0.039
1.196LysLys: 1.196 ± 0.039
3.446LysLeu: 3.446 ± 0.071
0.598LysMet: 0.598 ± 0.026
0.947LysAsn: 0.947 ± 0.031
2.372LysPro: 2.372 ± 0.051
1.034LysGln: 1.034 ± 0.029
2.187LysArg: 2.187 ± 0.055
1.476LysSer: 1.476 ± 0.034
1.864LysThr: 1.864 ± 0.042
2.769LysVal: 2.769 ± 0.054
0.327LysTrp: 0.327 ± 0.018
0.798LysTyr: 0.798 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
14.46LeuAla: 14.46 ± 0.163
0.582LeuCys: 0.582 ± 0.023
4.978LeuAsp: 4.978 ± 0.065
12.587LeuGlu: 12.587 ± 0.16
3.852LeuPhe: 3.852 ± 0.079
12.064LeuGly: 12.064 ± 0.137
2.32LeuHis: 2.32 ± 0.049
4.982LeuIle: 4.982 ± 0.089
4.32LeuLys: 4.32 ± 0.083
14.766LeuLeu: 14.766 ± 0.201
2.21LeuMet: 2.21 ± 0.041
3.023LeuAsn: 3.023 ± 0.061
7.219LeuPro: 7.219 ± 0.093
4.652LeuGln: 4.652 ± 0.073
8.965LeuArg: 8.965 ± 0.112
8.072LeuSer: 8.072 ± 0.115
5.844LeuThr: 5.844 ± 0.084
8.455LeuVal: 8.455 ± 0.105
1.91LeuTrp: 1.91 ± 0.051
3.165LeuTyr: 3.165 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.15MetAla: 2.15 ± 0.055
0.085MetCys: 0.085 ± 0.008
0.771MetAsp: 0.771 ± 0.028
1.076MetGlu: 1.076 ± 0.032
0.489MetPhe: 0.489 ± 0.024
1.853MetGly: 1.853 ± 0.047
0.319MetHis: 0.319 ± 0.018
0.782MetIle: 0.782 ± 0.029
0.81MetLys: 0.81 ± 0.032
2.119MetLeu: 2.119 ± 0.048
0.347MetMet: 0.347 ± 0.022
0.658MetAsn: 0.658 ± 0.028
1.014MetPro: 1.014 ± 0.03
0.676MetGln: 0.676 ± 0.026
1.405MetArg: 1.405 ± 0.037
1.115MetSer: 1.115 ± 0.034
0.883MetThr: 0.883 ± 0.031
1.437MetVal: 1.437 ± 0.038
0.165MetTrp: 0.165 ± 0.013
0.371MetTyr: 0.371 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
2.527AsnAla: 2.527 ± 0.06
0.144AsnCys: 0.144 ± 0.013
0.941AsnAsp: 0.941 ± 0.031
0.994AsnGlu: 0.994 ± 0.031
0.985AsnPhe: 0.985 ± 0.039
2.001AsnGly: 2.001 ± 0.063
0.457AsnHis: 0.457 ± 0.024
0.979AsnIle: 0.979 ± 0.031
0.653AsnLys: 0.653 ± 0.029
3.833AsnLeu: 3.833 ± 0.072
0.317AsnMet: 0.317 ± 0.016
0.649AsnAsn: 0.649 ± 0.033
2.782AsnPro: 2.782 ± 0.063
0.982AsnGln: 0.982 ± 0.03
1.851AsnArg: 1.851 ± 0.046
1.098AsnSer: 1.098 ± 0.036
1.301AsnThr: 1.301 ± 0.036
1.513AsnVal: 1.513 ± 0.046
0.443AsnTrp: 0.443 ± 0.022
0.746AsnTyr: 0.746 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
5.507ProAla: 5.507 ± 0.091
0.297ProCys: 0.297 ± 0.018
2.857ProAsp: 2.857 ± 0.056
5.096ProGlu: 5.096 ± 0.081
2.094ProPhe: 2.094 ± 0.045
5.273ProGly: 5.273 ± 0.078
1.202ProHis: 1.202 ± 0.039
2.184ProIle: 2.184 ± 0.04
2.185ProLys: 2.185 ± 0.044
6.551ProLeu: 6.551 ± 0.095
1.079ProMet: 1.079 ± 0.035
1.653ProAsn: 1.653 ± 0.042
3.399ProPro: 3.399 ± 0.074
2.929ProGln: 2.929 ± 0.062
3.278ProArg: 3.278 ± 0.057
3.324ProSer: 3.324 ± 0.051
3.122ProThr: 3.122 ± 0.051
4.158ProVal: 4.158 ± 0.068
0.928ProTrp: 0.928 ± 0.032
1.716ProTyr: 1.716 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
6.246GlnAla: 6.246 ± 0.105
0.122GlnCys: 0.122 ± 0.011
1.618GlnAsp: 1.618 ± 0.038
2.659GlnGlu: 2.659 ± 0.048
1.072GlnPhe: 1.072 ± 0.032
4.15GlnGly: 4.15 ± 0.085
0.786GlnHis: 0.786 ± 0.03
1.953GlnIle: 1.953 ± 0.044
1.383GlnLys: 1.383 ± 0.043
3.769GlnLeu: 3.769 ± 0.065
0.907GlnMet: 0.907 ± 0.029
1.193GlnAsn: 1.193 ± 0.039
2.382GlnPro: 2.382 ± 0.058
1.81GlnGln: 1.81 ± 0.05
2.808GlnArg: 2.808 ± 0.059
1.94GlnSer: 1.94 ± 0.044
2.386GlnThr: 2.386 ± 0.051
3.403GlnVal: 3.403 ± 0.071
0.431GlnTrp: 0.431 ± 0.022
0.91GlnTyr: 0.91 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
7.27ArgAla: 7.27 ± 0.11
0.361ArgCys: 0.361 ± 0.022
2.565ArgAsp: 2.565 ± 0.06
4.865ArgGlu: 4.865 ± 0.072
3.076ArgPhe: 3.076 ± 0.052
5.222ArgGly: 5.222 ± 0.078
1.254ArgHis: 1.254 ± 0.035
3.241ArgIle: 3.241 ± 0.061
2.2ArgLys: 2.2 ± 0.043
9.488ArgLeu: 9.488 ± 0.123
1.671ArgMet: 1.671 ± 0.043
1.531ArgAsn: 1.531 ± 0.043
3.46ArgPro: 3.46 ± 0.064
2.732ArgGln: 2.732 ± 0.046
4.976ArgArg: 4.976 ± 0.082
3.36ArgSer: 3.36 ± 0.058
2.953ArgThr: 2.953 ± 0.051
5.607ArgVal: 5.607 ± 0.08
1.287ArgTrp: 1.287 ± 0.031
2.51ArgTyr: 2.51 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
5.108SerAla: 5.108 ± 0.076
0.267SerCys: 0.267 ± 0.015
2.059SerAsp: 2.059 ± 0.05
2.871SerGlu: 2.871 ± 0.056
2.053SerPhe: 2.053 ± 0.055
5.401SerGly: 5.401 ± 0.106
0.918SerHis: 0.918 ± 0.032
1.941SerIle: 1.941 ± 0.045
1.628SerLys: 1.628 ± 0.046
6.975SerLeu: 6.975 ± 0.113
0.889SerMet: 0.889 ± 0.032
1.338SerAsn: 1.338 ± 0.042
3.353SerPro: 3.353 ± 0.06
1.958SerGln: 1.958 ± 0.048
3.479SerArg: 3.479 ± 0.066
3.115SerSer: 3.115 ± 0.059
2.584SerThr: 2.584 ± 0.064
3.515SerVal: 3.515 ± 0.065
0.833SerTrp: 0.833 ± 0.035
1.382SerTyr: 1.382 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
4.921ThrAla: 4.921 ± 0.08
0.289ThrCys: 0.289 ± 0.019
1.997ThrAsp: 1.997 ± 0.052
2.316ThrGlu: 2.316 ± 0.047
1.833ThrPhe: 1.833 ± 0.046
4.346ThrGly: 4.346 ± 0.072
1.073ThrHis: 1.073 ± 0.032
1.617ThrIle: 1.617 ± 0.048
1.138ThrLys: 1.138 ± 0.038
7.218ThrLeu: 7.218 ± 0.078
0.612ThrMet: 0.612 ± 0.026
1.155ThrAsn: 1.155 ± 0.04
3.939ThrPro: 3.939 ± 0.069
1.998ThrGln: 1.998 ± 0.047
2.942ThrArg: 2.942 ± 0.061
2.172ThrSer: 2.172 ± 0.052
2.332ThrThr: 2.332 ± 0.064
3.873ThrVal: 3.873 ± 0.07
0.711ThrTrp: 0.711 ± 0.029
1.503ThrTyr: 1.503 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
7.855ValAla: 7.855 ± 0.102
0.471ValCys: 0.471 ± 0.025
2.888ValAsp: 2.888 ± 0.056
4.735ValGlu: 4.735 ± 0.079
2.746ValPhe: 2.746 ± 0.053
6.683ValGly: 6.683 ± 0.081
1.347ValHis: 1.347 ± 0.037
3.101ValIle: 3.101 ± 0.071
2.197ValLys: 2.197 ± 0.055
9.836ValLeu: 9.836 ± 0.107
1.514ValMet: 1.514 ± 0.039
1.949ValAsn: 1.949 ± 0.043
3.725ValPro: 3.725 ± 0.061
2.968ValGln: 2.968 ± 0.069
5.486ValArg: 5.486 ± 0.076
3.978ValSer: 3.978 ± 0.067
3.266ValThr: 3.266 ± 0.07
6.274ValVal: 6.274 ± 0.081
1.254ValTrp: 1.254 ± 0.035
2.177ValTyr: 2.177 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
1.707TrpAla: 1.707 ± 0.043
0.077TrpCys: 0.077 ± 0.01
0.632TrpAsp: 0.632 ± 0.029
0.916TrpGlu: 0.916 ± 0.031
0.543TrpPhe: 0.543 ± 0.026
1.383TrpGly: 1.383 ± 0.04
0.304TrpHis: 0.304 ± 0.018
0.686TrpIle: 0.686 ± 0.032
0.552TrpLys: 0.552 ± 0.023
2.196TrpLeu: 2.196 ± 0.05
0.417TrpMet: 0.417 ± 0.021
0.568TrpAsn: 0.568 ± 0.03
0.811TrpPro: 0.811 ± 0.029
0.725TrpGln: 0.725 ± 0.025
1.089TrpArg: 1.089 ± 0.039
0.824TrpSer: 0.824 ± 0.035
0.643TrpThr: 0.643 ± 0.032
1.376TrpVal: 1.376 ± 0.039
0.285TrpTrp: 0.285 ± 0.019
0.386TrpTyr: 0.386 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.126TyrAla: 3.126 ± 0.055
0.174TyrCys: 0.174 ± 0.013
1.341TyrAsp: 1.341 ± 0.04
1.334TyrGlu: 1.334 ± 0.04
1.001TyrPhe: 1.001 ± 0.032
2.619TyrGly: 2.619 ± 0.048
0.587TyrHis: 0.587 ± 0.025
0.995TyrIle: 0.995 ± 0.031
0.82TyrLys: 0.82 ± 0.032
3.782TyrLeu: 3.782 ± 0.064
0.373TyrMet: 0.373 ± 0.02
0.706TyrAsn: 0.706 ± 0.034
1.823TyrPro: 1.823 ± 0.044
1.28TyrGln: 1.28 ± 0.038
2.465TyrArg: 2.465 ± 0.052
1.451TyrSer: 1.451 ± 0.04
1.692TyrThr: 1.692 ± 0.041
1.734TyrVal: 1.734 ± 0.041
0.53TyrTrp: 0.53 ± 0.023
0.868TyrTyr: 0.868 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3220 proteins (987299 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski