Amino acid dipepetide frequency for Thermoplasmatales archaeon SCGC AB-540-F20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.168AlaAla: 3.168 ± 0.135
0.948AlaCys: 0.948 ± 0.07
3.087AlaAsp: 3.087 ± 0.116
3.381AlaGlu: 3.381 ± 0.109
2.603AlaPhe: 2.603 ± 0.107
4.112AlaGly: 4.112 ± 0.141
1.012AlaHis: 1.012 ± 0.06
5.087AlaIle: 5.087 ± 0.16
3.727AlaLys: 3.727 ± 0.149
4.671AlaLeu: 4.671 ± 0.157
1.239AlaMet: 1.239 ± 0.064
2.346AlaAsn: 2.346 ± 0.092
1.665AlaPro: 1.665 ± 0.078
1.225AlaGln: 1.225 ± 0.058
2.099AlaArg: 2.099 ± 0.093
3.561AlaSer: 3.561 ± 0.114
3.002AlaThr: 3.002 ± 0.11
3.774AlaVal: 3.774 ± 0.119
0.782AlaTrp: 0.782 ± 0.05
1.851AlaTyr: 1.851 ± 0.089
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.059
0.217CysCys: 0.217 ± 0.032
0.85CysAsp: 0.85 ± 0.057
0.758CysGlu: 0.758 ± 0.05
0.531CysPhe: 0.531 ± 0.039
1.306CysGly: 1.306 ± 0.072
0.366CysHis: 0.366 ± 0.049
1.002CysIle: 1.002 ± 0.071
0.89CysLys: 0.89 ± 0.066
0.88CysLeu: 0.88 ± 0.062
0.369CysMet: 0.369 ± 0.028
0.714CysAsn: 0.714 ± 0.056
0.795CysPro: 0.795 ± 0.064
0.278CysGln: 0.278 ± 0.029
0.602CysArg: 0.602 ± 0.049
1.093CysSer: 1.093 ± 0.085
0.697CysThr: 0.697 ± 0.056
0.724CysVal: 0.724 ± 0.05
0.22CysTrp: 0.22 ± 0.032
0.579CysTyr: 0.579 ± 0.044
0.0CysXaa: 0.0 ± 0.0
Asp
3.185AspAla: 3.185 ± 0.106
0.778AspCys: 0.778 ± 0.054
4.024AspAsp: 4.024 ± 0.16
4.468AspGlu: 4.468 ± 0.129
3.033AspPhe: 3.033 ± 0.12
4.972AspGly: 4.972 ± 0.219
1.029AspHis: 1.029 ± 0.057
5.842AspIle: 5.842 ± 0.131
4.197AspLys: 4.197 ± 0.148
4.671AspLeu: 4.671 ± 0.127
1.455AspMet: 1.455 ± 0.07
3.347AspAsn: 3.347 ± 0.149
2.779AspPro: 2.779 ± 0.131
1.029AspGln: 1.029 ± 0.058
1.923AspArg: 1.923 ± 0.08
3.537AspSer: 3.537 ± 0.145
3.192AspThr: 3.192 ± 0.117
4.519AspVal: 4.519 ± 0.135
1.205AspTrp: 1.205 ± 0.07
2.748AspTyr: 2.748 ± 0.1
0.0AspXaa: 0.0 ± 0.0
Glu
3.046GluAla: 3.046 ± 0.106
0.707GluCys: 0.707 ± 0.056
3.828GluAsp: 3.828 ± 0.105
5.378GluGlu: 5.378 ± 0.161
2.847GluPhe: 2.847 ± 0.112
3.815GluGly: 3.815 ± 0.115
1.286GluHis: 1.286 ± 0.062
6.587GluIle: 6.587 ± 0.147
6.695GluLys: 6.695 ± 0.197
5.92GluLeu: 5.92 ± 0.153
1.686GluMet: 1.686 ± 0.086
4.312GluAsn: 4.312 ± 0.145
2.027GluPro: 2.027 ± 0.096
1.828GluGln: 1.828 ± 0.08
2.647GluArg: 2.647 ± 0.115
3.388GluSer: 3.388 ± 0.102
3.622GluThr: 3.622 ± 0.106
3.977GluVal: 3.977 ± 0.134
0.958GluTrp: 0.958 ± 0.067
2.779GluTyr: 2.779 ± 0.097
0.0GluXaa: 0.0 ± 0.0
Phe
2.308PheAla: 2.308 ± 0.113
0.592PheCys: 0.592 ± 0.045
3.09PheAsp: 3.09 ± 0.107
2.874PheGlu: 2.874 ± 0.112
2.396PhePhe: 2.396 ± 0.1
3.205PheGly: 3.205 ± 0.111
0.802PheHis: 0.802 ± 0.049
4.133PheIle: 4.133 ± 0.145
2.633PheLys: 2.633 ± 0.096
4.295PheLeu: 4.295 ± 0.164
0.995PheMet: 0.995 ± 0.059
2.515PheAsn: 2.515 ± 0.115
1.899PhePro: 1.899 ± 0.067
1.093PheGln: 1.093 ± 0.061
1.642PheArg: 1.642 ± 0.073
3.161PheSer: 3.161 ± 0.11
2.63PheThr: 2.63 ± 0.102
2.826PheVal: 2.826 ± 0.108
0.646PheTrp: 0.646 ± 0.048
2.075PheTyr: 2.075 ± 0.09
0.0PheXaa: 0.0 ± 0.0
Gly
4.055GlyAla: 4.055 ± 0.14
1.168GlyCys: 1.168 ± 0.077
4.414GlyAsp: 4.414 ± 0.152
4.434GlyGlu: 4.434 ± 0.126
3.51GlyPhe: 3.51 ± 0.12
5.189GlyGly: 5.189 ± 0.153
1.242GlyHis: 1.242 ± 0.066
6.634GlyIle: 6.634 ± 0.161
5.212GlyLys: 5.212 ± 0.166
5.51GlyLeu: 5.51 ± 0.154
1.794GlyMet: 1.794 ± 0.08
3.777GlyAsn: 3.777 ± 0.136
2.078GlyPro: 2.078 ± 0.085
1.435GlyGln: 1.435 ± 0.07
2.606GlyArg: 2.606 ± 0.106
4.387GlySer: 4.387 ± 0.184
4.265GlyThr: 4.265 ± 0.161
4.346GlyVal: 4.346 ± 0.11
1.103GlyTrp: 1.103 ± 0.065
2.935GlyTyr: 2.935 ± 0.105
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.056
0.305HisCys: 0.305 ± 0.031
1.026HisAsp: 1.026 ± 0.061
1.144HisGlu: 1.144 ± 0.053
0.887HisPhe: 0.887 ± 0.06
1.293HisGly: 1.293 ± 0.07
0.423HisHis: 0.423 ± 0.037
1.763HisIle: 1.763 ± 0.071
1.134HisLys: 1.134 ± 0.058
1.631HisLeu: 1.631 ± 0.078
0.457HisMet: 0.457 ± 0.04
1.046HisAsn: 1.046 ± 0.071
0.958HisPro: 0.958 ± 0.06
0.464HisGln: 0.464 ± 0.04
0.677HisArg: 0.677 ± 0.048
1.13HisSer: 1.13 ± 0.073
0.975HisThr: 0.975 ± 0.062
1.053HisVal: 1.053 ± 0.061
0.223HisTrp: 0.223 ± 0.029
0.772HisTyr: 0.772 ± 0.05
0.0HisXaa: 0.0 ± 0.0
Ile
5.361IleAla: 5.361 ± 0.155
1.208IleCys: 1.208 ± 0.073
5.845IleAsp: 5.845 ± 0.129
5.798IleGlu: 5.798 ± 0.139
4.051IlePhe: 4.051 ± 0.159
6.346IleGly: 6.346 ± 0.166
1.52IleHis: 1.52 ± 0.073
8.035IleIle: 8.035 ± 0.229
6.455IleLys: 6.455 ± 0.18
7.985IleLeu: 7.985 ± 0.205
2.004IleMet: 2.004 ± 0.093
4.539IleAsn: 4.539 ± 0.129
4.011IlePro: 4.011 ± 0.134
2.383IleGln: 2.383 ± 0.093
3.256IleArg: 3.256 ± 0.092
6.522IleSer: 6.522 ± 0.176
5.236IleThr: 5.236 ± 0.146
5.964IleVal: 5.964 ± 0.168
1.127IleTrp: 1.127 ± 0.074
3.388IleTyr: 3.388 ± 0.114
0.0IleXaa: 0.0 ± 0.0
Lys
3.679LysAla: 3.679 ± 0.14
0.846LysCys: 0.846 ± 0.062
4.563LysAsp: 4.563 ± 0.151
6.224LysGlu: 6.224 ± 0.186
2.308LysPhe: 2.308 ± 0.104
4.085LysGly: 4.085 ± 0.13
1.432LysHis: 1.432 ± 0.083
7.209LysIle: 7.209 ± 0.185
8.543LysLys: 8.543 ± 0.243
5.693LysLeu: 5.693 ± 0.164
2.088LysMet: 2.088 ± 0.084
4.999LysAsn: 4.999 ± 0.143
2.745LysPro: 2.745 ± 0.125
2.237LysGln: 2.237 ± 0.085
3.138LysArg: 3.138 ± 0.116
4.028LysSer: 4.028 ± 0.133
4.231LysThr: 4.231 ± 0.114
3.879LysVal: 3.879 ± 0.128
0.789LysTrp: 0.789 ± 0.068
2.836LysTyr: 2.836 ± 0.108
0.0LysXaa: 0.0 ± 0.0
Leu
4.627LeuAla: 4.627 ± 0.138
1.022LeuCys: 1.022 ± 0.064
5.053LeuAsp: 5.053 ± 0.14
5.768LeuGlu: 5.768 ± 0.15
4.106LeuPhe: 4.106 ± 0.159
5.669LeuGly: 5.669 ± 0.186
1.499LeuHis: 1.499 ± 0.066
6.932LeuIle: 6.932 ± 0.195
6.451LeuLys: 6.451 ± 0.182
7.744LeuLeu: 7.744 ± 0.243
2.021LeuMet: 2.021 ± 0.102
4.207LeuAsn: 4.207 ± 0.121
3.344LeuPro: 3.344 ± 0.14
2.241LeuGln: 2.241 ± 0.086
3.28LeuArg: 3.28 ± 0.115
5.923LeuSer: 5.923 ± 0.14
4.654LeuThr: 4.654 ± 0.122
4.921LeuVal: 4.921 ± 0.137
1.083LeuTrp: 1.083 ± 0.063
2.951LeuTyr: 2.951 ± 0.101
0.0LeuXaa: 0.0 ± 0.0
Met
1.527MetAla: 1.527 ± 0.082
0.271MetCys: 0.271 ± 0.033
1.462MetAsp: 1.462 ± 0.074
1.557MetGlu: 1.557 ± 0.067
0.877MetPhe: 0.877 ± 0.047
1.662MetGly: 1.662 ± 0.087
0.359MetHis: 0.359 ± 0.032
2.105MetIle: 2.105 ± 0.084
2.227MetLys: 2.227 ± 0.076
1.831MetLeu: 1.831 ± 0.089
0.613MetMet: 0.613 ± 0.046
1.455MetAsn: 1.455 ± 0.066
1.015MetPro: 1.015 ± 0.053
0.653MetGln: 0.653 ± 0.048
0.927MetArg: 0.927 ± 0.062
1.445MetSer: 1.445 ± 0.063
1.357MetThr: 1.357 ± 0.064
1.567MetVal: 1.567 ± 0.079
0.254MetTrp: 0.254 ± 0.031
0.745MetTyr: 0.745 ± 0.052
0.0MetXaa: 0.0 ± 0.0
Asn
2.914AsnAla: 2.914 ± 0.104
0.829AsnCys: 0.829 ± 0.055
2.897AsnAsp: 2.897 ± 0.115
3.669AsnGlu: 3.669 ± 0.109
2.291AsnPhe: 2.291 ± 0.085
3.838AsnGly: 3.838 ± 0.138
0.978AsnHis: 0.978 ± 0.066
5.788AsnIle: 5.788 ± 0.19
3.679AsnLys: 3.679 ± 0.123
4.566AsnLeu: 4.566 ± 0.143
1.147AsnMet: 1.147 ± 0.068
3.747AsnAsn: 3.747 ± 0.226
2.623AsnPro: 2.623 ± 0.09
1.496AsnGln: 1.496 ± 0.074
1.906AsnArg: 1.906 ± 0.084
3.361AsnSer: 3.361 ± 0.148
3.364AsnThr: 3.364 ± 0.144
3.666AsnVal: 3.666 ± 0.15
0.927AsnTrp: 0.927 ± 0.065
2.281AsnTyr: 2.281 ± 0.105
0.0AsnXaa: 0.0 ± 0.0
Pro
1.98ProAla: 1.98 ± 0.092
0.437ProCys: 0.437 ± 0.043
2.833ProAsp: 2.833 ± 0.104
2.904ProGlu: 2.904 ± 0.109
1.882ProPhe: 1.882 ± 0.078
2.549ProGly: 2.549 ± 0.105
0.806ProHis: 0.806 ± 0.062
3.334ProIle: 3.334 ± 0.125
2.41ProLys: 2.41 ± 0.102
3.158ProLeu: 3.158 ± 0.107
0.978ProMet: 0.978 ± 0.066
1.902ProAsn: 1.902 ± 0.083
1.75ProPro: 1.75 ± 0.097
0.998ProGln: 0.998 ± 0.054
1.266ProArg: 1.266 ± 0.062
2.66ProSer: 2.66 ± 0.098
2.203ProThr: 2.203 ± 0.089
2.613ProVal: 2.613 ± 0.09
0.552ProTrp: 0.552 ± 0.049
1.645ProTyr: 1.645 ± 0.092
0.0ProXaa: 0.0 ± 0.0
Gln
1.323GlnAla: 1.323 ± 0.071
0.328GlnCys: 0.328 ± 0.042
1.31GlnAsp: 1.31 ± 0.069
1.686GlnGlu: 1.686 ± 0.084
1.026GlnPhe: 1.026 ± 0.058
1.445GlnGly: 1.445 ± 0.072
0.454GlnHis: 0.454 ± 0.037
2.19GlnIle: 2.19 ± 0.081
2.139GlnLys: 2.139 ± 0.09
2.099GlnLeu: 2.099 ± 0.092
0.66GlnMet: 0.66 ± 0.046
1.499GlnAsn: 1.499 ± 0.074
0.833GlnPro: 0.833 ± 0.051
0.853GlnGln: 0.853 ± 0.065
1.093GlnArg: 1.093 ± 0.057
1.52GlnSer: 1.52 ± 0.069
1.469GlnThr: 1.469 ± 0.08
1.262GlnVal: 1.262 ± 0.071
0.372GlnTrp: 0.372 ± 0.039
0.907GlnTyr: 0.907 ± 0.061
0.0GlnXaa: 0.0 ± 0.0
Arg
1.909ArgAla: 1.909 ± 0.088
0.562ArgCys: 0.562 ± 0.049
2.22ArgAsp: 2.22 ± 0.089
2.66ArgGlu: 2.66 ± 0.101
1.801ArgPhe: 1.801 ± 0.08
2.386ArgGly: 2.386 ± 0.103
0.653ArgHis: 0.653 ± 0.048
3.391ArgIle: 3.391 ± 0.119
3.388ArgLys: 3.388 ± 0.116
3.012ArgLeu: 3.012 ± 0.11
1.103ArgMet: 1.103 ± 0.062
1.912ArgAsn: 1.912 ± 0.09
1.076ArgPro: 1.076 ± 0.069
0.863ArgGln: 0.863 ± 0.054
1.818ArgArg: 1.818 ± 0.088
2.031ArgSer: 2.031 ± 0.086
1.774ArgThr: 1.774 ± 0.073
2.105ArgVal: 2.105 ± 0.093
0.487ArgTrp: 0.487 ± 0.045
1.767ArgTyr: 1.767 ± 0.08
0.0ArgXaa: 0.0 ± 0.0
Ser
3.151SerAla: 3.151 ± 0.114
0.846SerCys: 0.846 ± 0.054
4.112SerAsp: 4.112 ± 0.177
3.747SerGlu: 3.747 ± 0.127
3.016SerPhe: 3.016 ± 0.117
5.446SerGly: 5.446 ± 0.172
1.151SerHis: 1.151 ± 0.067
5.619SerIle: 5.619 ± 0.15
4.4SerLys: 4.4 ± 0.127
5.504SerLeu: 5.504 ± 0.13
1.479SerMet: 1.479 ± 0.076
3.909SerAsn: 3.909 ± 0.186
2.335SerPro: 2.335 ± 0.091
1.469SerGln: 1.469 ± 0.067
2.254SerArg: 2.254 ± 0.08
4.854SerSer: 4.854 ± 0.187
3.743SerThr: 3.743 ± 0.152
3.909SerVal: 3.909 ± 0.136
1.107SerTrp: 1.107 ± 0.076
2.474SerTyr: 2.474 ± 0.109
0.0SerXaa: 0.0 ± 0.0
Thr
2.979ThrAla: 2.979 ± 0.103
0.738ThrCys: 0.738 ± 0.048
3.266ThrAsp: 3.266 ± 0.125
3.165ThrGlu: 3.165 ± 0.116
2.796ThrPhe: 2.796 ± 0.092
4.427ThrGly: 4.427 ± 0.139
1.046ThrHis: 1.046 ± 0.062
5.778ThrIle: 5.778 ± 0.153
3.5ThrLys: 3.5 ± 0.137
4.603ThrLeu: 4.603 ± 0.12
1.334ThrMet: 1.334 ± 0.083
3.053ThrAsn: 3.053 ± 0.157
2.542ThrPro: 2.542 ± 0.099
1.178ThrGln: 1.178 ± 0.061
1.889ThrArg: 1.889 ± 0.09
3.679ThrSer: 3.679 ± 0.15
3.652ThrThr: 3.652 ± 0.158
3.916ThrVal: 3.916 ± 0.132
0.9ThrTrp: 0.9 ± 0.066
2.583ThrTyr: 2.583 ± 0.129
0.0ThrXaa: 0.0 ± 0.0
Val
3.598ValAla: 3.598 ± 0.128
0.992ValCys: 0.992 ± 0.072
4.014ValAsp: 4.014 ± 0.131
4.326ValGlu: 4.326 ± 0.118
3.155ValPhe: 3.155 ± 0.116
4.312ValGly: 4.312 ± 0.135
1.059ValHis: 1.059 ± 0.063
5.338ValIle: 5.338 ± 0.136
4.678ValLys: 4.678 ± 0.138
5.206ValLeu: 5.206 ± 0.14
1.364ValMet: 1.364 ± 0.067
3.171ValAsn: 3.171 ± 0.118
2.217ValPro: 2.217 ± 0.083
1.422ValGln: 1.422 ± 0.072
1.95ValArg: 1.95 ± 0.09
4.349ValSer: 4.349 ± 0.134
3.889ValThr: 3.889 ± 0.143
4.339ValVal: 4.339 ± 0.151
0.836ValTrp: 0.836 ± 0.059
2.386ValTyr: 2.386 ± 0.105
0.0ValXaa: 0.0 ± 0.0
Trp
0.721TrpAla: 0.721 ± 0.059
0.2TrpCys: 0.2 ± 0.027
1.114TrpAsp: 1.114 ± 0.069
0.985TrpGlu: 0.985 ± 0.066
0.775TrpPhe: 0.775 ± 0.064
1.114TrpGly: 1.114 ± 0.071
0.352TrpHis: 0.352 ± 0.035
1.225TrpIle: 1.225 ± 0.067
0.806TrpLys: 0.806 ± 0.056
1.117TrpLeu: 1.117 ± 0.07
0.369TrpMet: 0.369 ± 0.037
0.938TrpAsn: 0.938 ± 0.074
0.379TrpPro: 0.379 ± 0.038
0.376TrpGln: 0.376 ± 0.038
0.518TrpArg: 0.518 ± 0.046
1.032TrpSer: 1.032 ± 0.088
0.877TrpThr: 0.877 ± 0.078
0.87TrpVal: 0.87 ± 0.061
0.298TrpTrp: 0.298 ± 0.042
0.589TrpTyr: 0.589 ± 0.048
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.031TyrAla: 2.031 ± 0.087
0.596TyrCys: 0.596 ± 0.053
2.924TyrAsp: 2.924 ± 0.118
2.407TyrGlu: 2.407 ± 0.098
1.923TyrPhe: 1.923 ± 0.079
2.921TyrGly: 2.921 ± 0.1
0.951TyrHis: 0.951 ± 0.057
3.046TyrIle: 3.046 ± 0.108
2.39TyrLys: 2.39 ± 0.096
3.402TyrLeu: 3.402 ± 0.114
0.751TyrMet: 0.751 ± 0.051
2.579TyrAsn: 2.579 ± 0.106
1.787TyrPro: 1.787 ± 0.089
0.948TyrGln: 0.948 ± 0.051
1.432TyrArg: 1.432 ± 0.074
2.863TyrSer: 2.863 ± 0.115
2.19TyrThr: 2.19 ± 0.093
2.369TyrVal: 2.369 ± 0.085
0.782TyrTrp: 0.782 ± 0.065
2.176TyrTyr: 2.176 ± 0.116
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1255 proteins (295448 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski