Amino acid dipepetide frequency for archaeon HR05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.742AlaAla: 4.742 ± 0.232
0.989AlaCys: 0.989 ± 0.075
3.804AlaAsp: 3.804 ± 0.143
4.579AlaGlu: 4.579 ± 0.174
2.382AlaPhe: 2.382 ± 0.132
5.506AlaGly: 5.506 ± 0.209
1.185AlaHis: 1.185 ± 0.089
6.478AlaIle: 6.478 ± 0.245
4.54AlaLys: 4.54 ± 0.156
8.568AlaLeu: 8.568 ± 0.236
3.068AlaMet: 3.068 ± 0.138
2.264AlaAsn: 2.264 ± 0.099
1.691AlaPro: 1.691 ± 0.107
1.281AlaGln: 1.281 ± 0.091
6.961AlaArg: 6.961 ± 0.213
6.281AlaSer: 6.281 ± 0.221
3.011AlaThr: 3.011 ± 0.128
5.989AlaVal: 5.989 ± 0.196
0.685AlaTrp: 0.685 ± 0.069
3.382AlaTyr: 3.382 ± 0.133
0.0AlaXaa: 0.0 ± 0.0
Cys
0.534CysAla: 0.534 ± 0.048
0.101CysCys: 0.101 ± 0.024
0.64CysAsp: 0.64 ± 0.068
0.427CysGlu: 0.427 ± 0.043
0.36CysPhe: 0.36 ± 0.043
0.77CysGly: 0.77 ± 0.073
0.18CysHis: 0.18 ± 0.032
1.354CysIle: 1.354 ± 0.083
0.832CysLys: 0.832 ± 0.072
0.607CysLeu: 0.607 ± 0.056
0.36CysMet: 0.36 ± 0.045
0.68CysAsn: 0.68 ± 0.059
0.478CysPro: 0.478 ± 0.057
0.14CysGln: 0.14 ± 0.027
0.753CysArg: 0.753 ± 0.069
0.978CysSer: 0.978 ± 0.089
0.551CysThr: 0.551 ± 0.051
0.579CysVal: 0.579 ± 0.047
0.118CysTrp: 0.118 ± 0.029
0.427CysTyr: 0.427 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
5.854AspAla: 5.854 ± 0.204
0.399AspCys: 0.399 ± 0.05
4.388AspAsp: 4.388 ± 0.179
5.467AspGlu: 5.467 ± 0.178
1.315AspPhe: 1.315 ± 0.093
5.708AspGly: 5.708 ± 0.227
0.832AspHis: 0.832 ± 0.067
4.635AspIle: 4.635 ± 0.144
3.023AspLys: 3.023 ± 0.123
4.27AspLeu: 4.27 ± 0.184
2.011AspMet: 2.011 ± 0.114
2.107AspAsn: 2.107 ± 0.127
2.214AspPro: 2.214 ± 0.106
0.556AspGln: 0.556 ± 0.053
3.422AspArg: 3.422 ± 0.145
3.337AspSer: 3.337 ± 0.15
2.635AspThr: 2.635 ± 0.165
5.562AspVal: 5.562 ± 0.205
0.399AspTrp: 0.399 ± 0.051
1.966AspTyr: 1.966 ± 0.111
0.0AspXaa: 0.0 ± 0.0
Glu
4.585GluAla: 4.585 ± 0.18
0.832GluCys: 0.832 ± 0.079
3.624GluAsp: 3.624 ± 0.138
5.641GluGlu: 5.641 ± 0.26
2.551GluPhe: 2.551 ± 0.132
4.528GluGly: 4.528 ± 0.17
2.584GluHis: 2.584 ± 0.138
4.034GluIle: 4.034 ± 0.169
3.489GluLys: 3.489 ± 0.159
5.798GluLeu: 5.798 ± 0.204
2.388GluMet: 2.388 ± 0.116
1.494GluAsn: 1.494 ± 0.084
2.371GluPro: 2.371 ± 0.096
2.084GluGln: 2.084 ± 0.134
4.972GluArg: 4.972 ± 0.179
3.641GluSer: 3.641 ± 0.148
1.287GluThr: 1.287 ± 0.088
5.332GluVal: 5.332 ± 0.166
0.629GluTrp: 0.629 ± 0.06
3.18GluTyr: 3.18 ± 0.151
0.0GluXaa: 0.0 ± 0.0
Phe
2.528PheAla: 2.528 ± 0.139
0.416PheCys: 0.416 ± 0.056
1.646PheAsp: 1.646 ± 0.096
1.59PheGlu: 1.59 ± 0.105
0.961PhePhe: 0.961 ± 0.086
2.236PheGly: 2.236 ± 0.107
0.534PheHis: 0.534 ± 0.053
3.18PheIle: 3.18 ± 0.151
2.259PheLys: 2.259 ± 0.097
2.556PheLeu: 2.556 ± 0.149
1.028PheMet: 1.028 ± 0.073
1.708PheAsn: 1.708 ± 0.105
1.112PhePro: 1.112 ± 0.077
0.393PheGln: 0.393 ± 0.043
1.579PheArg: 1.579 ± 0.108
2.118PheSer: 2.118 ± 0.127
1.764PheThr: 1.764 ± 0.11
1.983PheVal: 1.983 ± 0.12
0.247PheTrp: 0.247 ± 0.041
1.163PheTyr: 1.163 ± 0.087
0.0PheXaa: 0.0 ± 0.0
Gly
4.427GlyAla: 4.427 ± 0.188
0.792GlyCys: 0.792 ± 0.073
3.579GlyAsp: 3.579 ± 0.215
4.028GlyGlu: 4.028 ± 0.162
2.472GlyPhe: 2.472 ± 0.123
4.573GlyGly: 4.573 ± 0.274
0.983GlyHis: 0.983 ± 0.08
7.068GlyIle: 7.068 ± 0.211
4.978GlyLys: 4.978 ± 0.177
5.809GlyLeu: 5.809 ± 0.19
2.978GlyMet: 2.978 ± 0.114
2.815GlyAsn: 2.815 ± 0.211
1.652GlyPro: 1.652 ± 0.11
0.95GlyGln: 0.95 ± 0.082
5.006GlyArg: 5.006 ± 0.167
5.832GlySer: 5.832 ± 0.178
2.91GlyThr: 2.91 ± 0.135
5.09GlyVal: 5.09 ± 0.194
0.68GlyTrp: 0.68 ± 0.068
3.613GlyTyr: 3.613 ± 0.161
0.0GlyXaa: 0.0 ± 0.0
His
1.837HisAla: 1.837 ± 0.109
0.242HisCys: 0.242 ± 0.03
1.208HisAsp: 1.208 ± 0.081
1.169HisGlu: 1.169 ± 0.088
0.567HisPhe: 0.567 ± 0.053
1.882HisGly: 1.882 ± 0.104
0.405HisHis: 0.405 ± 0.051
1.573HisIle: 1.573 ± 0.09
0.781HisLys: 0.781 ± 0.066
1.528HisLeu: 1.528 ± 0.096
0.663HisMet: 0.663 ± 0.069
0.899HisAsn: 0.899 ± 0.075
0.882HisPro: 0.882 ± 0.077
0.219HisGln: 0.219 ± 0.035
1.0HisArg: 1.0 ± 0.076
1.023HisSer: 1.023 ± 0.074
1.107HisThr: 1.107 ± 0.077
1.697HisVal: 1.697 ± 0.1
0.27HisTrp: 0.27 ± 0.037
0.697HisTyr: 0.697 ± 0.065
0.0HisXaa: 0.0 ± 0.0
Ile
8.444IleAla: 8.444 ± 0.278
0.871IleCys: 0.871 ± 0.073
5.866IleAsp: 5.866 ± 0.191
5.455IleGlu: 5.455 ± 0.191
2.304IlePhe: 2.304 ± 0.125
5.72IleGly: 5.72 ± 0.199
1.59IleHis: 1.59 ± 0.094
5.764IleIle: 5.764 ± 0.239
4.315IleLys: 4.315 ± 0.17
7.006IleLeu: 7.006 ± 0.215
2.781IleMet: 2.781 ± 0.119
3.427IleAsn: 3.427 ± 0.134
3.394IlePro: 3.394 ± 0.142
1.41IleGln: 1.41 ± 0.11
4.916IleArg: 4.916 ± 0.173
5.534IleSer: 5.534 ± 0.195
4.276IleThr: 4.276 ± 0.188
7.394IleVal: 7.394 ± 0.207
0.494IleTrp: 0.494 ± 0.061
2.562IleTyr: 2.562 ± 0.134
0.0IleXaa: 0.0 ± 0.0
Lys
4.641LysAla: 4.641 ± 0.187
0.523LysCys: 0.523 ± 0.059
4.691LysAsp: 4.691 ± 0.175
4.641LysGlu: 4.641 ± 0.155
1.124LysPhe: 1.124 ± 0.081
5.512LysGly: 5.512 ± 0.198
1.691LysHis: 1.691 ± 0.1
3.809LysIle: 3.809 ± 0.16
2.596LysLys: 2.596 ± 0.131
3.208LysLeu: 3.208 ± 0.148
1.691LysMet: 1.691 ± 0.099
1.775LysAsn: 1.775 ± 0.095
2.41LysPro: 2.41 ± 0.14
1.612LysGln: 1.612 ± 0.095
4.36LysArg: 4.36 ± 0.161
3.629LysSer: 3.629 ± 0.183
1.556LysThr: 1.556 ± 0.092
5.72LysVal: 5.72 ± 0.174
0.371LysTrp: 0.371 ± 0.05
1.921LysTyr: 1.921 ± 0.098
0.0LysXaa: 0.0 ± 0.0
Leu
8.843LeuAla: 8.843 ± 0.25
0.905LeuCys: 0.905 ± 0.071
5.259LeuAsp: 5.259 ± 0.173
6.124LeuGlu: 6.124 ± 0.212
3.011LeuPhe: 3.011 ± 0.155
5.776LeuGly: 5.776 ± 0.181
1.73LeuHis: 1.73 ± 0.116
6.956LeuIle: 6.956 ± 0.217
5.922LeuLys: 5.922 ± 0.205
9.731LeuLeu: 9.731 ± 0.382
2.629LeuMet: 2.629 ± 0.143
3.95LeuAsn: 3.95 ± 0.159
3.231LeuPro: 3.231 ± 0.124
2.028LeuGln: 2.028 ± 0.099
5.557LeuArg: 5.557 ± 0.19
6.422LeuSer: 6.422 ± 0.217
4.107LeuThr: 4.107 ± 0.169
6.276LeuVal: 6.276 ± 0.202
0.652LeuTrp: 0.652 ± 0.071
3.467LeuTyr: 3.467 ± 0.166
0.0LeuXaa: 0.0 ± 0.0
Met
1.893MetAla: 1.893 ± 0.103
0.208MetCys: 0.208 ± 0.035
2.483MetAsp: 2.483 ± 0.13
1.697MetGlu: 1.697 ± 0.096
0.663MetPhe: 0.663 ± 0.066
2.152MetGly: 2.152 ± 0.116
1.174MetHis: 1.174 ± 0.074
2.618MetIle: 2.618 ± 0.117
2.011MetLys: 2.011 ± 0.105
5.911MetLeu: 5.911 ± 0.205
1.146MetMet: 1.146 ± 0.074
1.393MetAsn: 1.393 ± 0.102
1.427MetPro: 1.427 ± 0.097
1.011MetGln: 1.011 ± 0.076
2.045MetArg: 2.045 ± 0.111
2.096MetSer: 2.096 ± 0.109
0.758MetThr: 0.758 ± 0.064
3.118MetVal: 3.118 ± 0.129
0.129MetTrp: 0.129 ± 0.024
0.95MetTyr: 0.95 ± 0.073
0.0MetXaa: 0.0 ± 0.0
Asn
3.635AsnAla: 3.635 ± 0.157
0.36AsnCys: 0.36 ± 0.04
2.371AsnAsp: 2.371 ± 0.194
2.079AsnGlu: 2.079 ± 0.116
1.034AsnPhe: 1.034 ± 0.092
3.332AsnGly: 3.332 ± 0.149
0.539AsnHis: 0.539 ± 0.062
3.343AsnIle: 3.343 ± 0.15
1.978AsnLys: 1.978 ± 0.107
3.225AsnLeu: 3.225 ± 0.141
1.377AsnMet: 1.377 ± 0.1
2.422AsnAsn: 2.422 ± 0.194
1.798AsnPro: 1.798 ± 0.102
0.41AsnGln: 0.41 ± 0.05
2.326AsnArg: 2.326 ± 0.123
2.983AsnSer: 2.983 ± 0.141
2.084AsnThr: 2.084 ± 0.123
3.113AsnVal: 3.113 ± 0.134
0.197AsnTrp: 0.197 ± 0.035
1.388AsnTyr: 1.388 ± 0.107
0.0AsnXaa: 0.0 ± 0.0
Pro
2.135ProAla: 2.135 ± 0.126
0.27ProCys: 0.27 ± 0.039
2.073ProAsp: 2.073 ± 0.114
2.264ProGlu: 2.264 ± 0.14
1.5ProPhe: 1.5 ± 0.088
1.714ProGly: 1.714 ± 0.132
0.494ProHis: 0.494 ± 0.052
2.658ProIle: 2.658 ± 0.124
1.753ProLys: 1.753 ± 0.09
3.669ProLeu: 3.669 ± 0.152
1.062ProMet: 1.062 ± 0.067
1.236ProAsn: 1.236 ± 0.08
1.202ProPro: 1.202 ± 0.102
0.584ProGln: 0.584 ± 0.066
1.775ProArg: 1.775 ± 0.103
2.613ProSer: 2.613 ± 0.115
1.927ProThr: 1.927 ± 0.104
2.641ProVal: 2.641 ± 0.147
0.433ProTrp: 0.433 ± 0.052
1.809ProTyr: 1.809 ± 0.112
0.0ProXaa: 0.0 ± 0.0
Gln
1.219GlnAla: 1.219 ± 0.081
0.236GlnCys: 0.236 ± 0.037
1.011GlnAsp: 1.011 ± 0.081
1.635GlnGlu: 1.635 ± 0.132
0.73GlnPhe: 0.73 ± 0.077
1.478GlnGly: 1.478 ± 0.091
0.579GlnHis: 0.579 ± 0.054
1.36GlnIle: 1.36 ± 0.097
0.854GlnLys: 0.854 ± 0.078
1.669GlnLeu: 1.669 ± 0.096
0.714GlnMet: 0.714 ± 0.065
0.461GlnAsn: 0.461 ± 0.051
0.562GlnPro: 0.562 ± 0.064
0.781GlnGln: 0.781 ± 0.107
1.084GlnArg: 1.084 ± 0.081
1.067GlnSer: 1.067 ± 0.087
0.596GlnThr: 0.596 ± 0.063
1.719GlnVal: 1.719 ± 0.12
0.202GlnTrp: 0.202 ± 0.035
0.719GlnTyr: 0.719 ± 0.073
0.0GlnXaa: 0.0 ± 0.0
Arg
4.321ArgAla: 4.321 ± 0.17
1.067ArgCys: 1.067 ± 0.091
4.034ArgAsp: 4.034 ± 0.172
4.939ArgGlu: 4.939 ± 0.187
2.775ArgPhe: 2.775 ± 0.124
3.865ArgGly: 3.865 ± 0.161
1.264ArgHis: 1.264 ± 0.081
5.517ArgIle: 5.517 ± 0.194
2.944ArgLys: 2.944 ± 0.132
8.062ArgLeu: 8.062 ± 0.218
2.461ArgMet: 2.461 ± 0.117
1.978ArgAsn: 1.978 ± 0.119
1.511ArgPro: 1.511 ± 0.081
1.259ArgGln: 1.259 ± 0.076
4.5ArgArg: 4.5 ± 0.195
4.197ArgSer: 4.197 ± 0.162
1.556ArgThr: 1.556 ± 0.099
5.405ArgVal: 5.405 ± 0.194
0.792ArgTrp: 0.792 ± 0.078
3.085ArgTyr: 3.085 ± 0.143
0.0ArgXaa: 0.0 ± 0.0
Ser
4.197SerAla: 4.197 ± 0.157
0.73SerCys: 0.73 ± 0.066
3.298SerAsp: 3.298 ± 0.15
2.714SerGlu: 2.714 ± 0.132
1.5SerPhe: 1.5 ± 0.099
3.641SerGly: 3.641 ± 0.162
0.685SerHis: 0.685 ± 0.066
9.293SerIle: 9.293 ± 0.235
5.534SerLys: 5.534 ± 0.168
5.439SerLeu: 5.439 ± 0.191
3.231SerMet: 3.231 ± 0.144
4.382SerAsn: 4.382 ± 0.178
1.832SerPro: 1.832 ± 0.107
0.871SerGln: 0.871 ± 0.067
4.905SerArg: 4.905 ± 0.187
7.81SerSer: 7.81 ± 0.298
3.837SerThr: 3.837 ± 0.135
4.422SerVal: 4.422 ± 0.162
0.421SerTrp: 0.421 ± 0.049
2.163SerTyr: 2.163 ± 0.131
0.0SerXaa: 0.0 ± 0.0
Thr
3.422ThrAla: 3.422 ± 0.146
0.523ThrCys: 0.523 ± 0.055
1.865ThrAsp: 1.865 ± 0.11
1.629ThrGlu: 1.629 ± 0.086
1.775ThrPhe: 1.775 ± 0.101
3.41ThrGly: 3.41 ± 0.159
0.832ThrHis: 0.832 ± 0.067
3.854ThrIle: 3.854 ± 0.179
1.708ThrLys: 1.708 ± 0.098
4.882ThrLeu: 4.882 ± 0.165
1.242ThrMet: 1.242 ± 0.082
1.927ThrAsn: 1.927 ± 0.111
1.877ThrPro: 1.877 ± 0.108
0.865ThrGln: 0.865 ± 0.066
2.045ThrArg: 2.045 ± 0.112
3.113ThrSer: 3.113 ± 0.138
2.006ThrThr: 2.006 ± 0.107
3.264ThrVal: 3.264 ± 0.151
0.376ThrTrp: 0.376 ± 0.049
1.483ThrTyr: 1.483 ± 0.097
0.0ThrXaa: 0.0 ± 0.0
Val
6.163ValAla: 6.163 ± 0.235
0.86ValCys: 0.86 ± 0.074
5.382ValAsp: 5.382 ± 0.208
6.191ValGlu: 6.191 ± 0.209
2.371ValPhe: 2.371 ± 0.132
4.978ValGly: 4.978 ± 0.199
1.32ValHis: 1.32 ± 0.088
5.984ValIle: 5.984 ± 0.177
5.321ValLys: 5.321 ± 0.196
6.736ValLeu: 6.736 ± 0.2
2.568ValMet: 2.568 ± 0.122
3.006ValAsn: 3.006 ± 0.129
2.5ValPro: 2.5 ± 0.127
1.455ValGln: 1.455 ± 0.093
5.343ValArg: 5.343 ± 0.168
4.832ValSer: 4.832 ± 0.155
3.719ValThr: 3.719 ± 0.15
6.686ValVal: 6.686 ± 0.192
0.747ValTrp: 0.747 ± 0.069
2.91ValTyr: 2.91 ± 0.115
0.0ValXaa: 0.0 ± 0.0
Trp
0.399TrpAla: 0.399 ± 0.052
0.118TrpCys: 0.118 ± 0.027
0.5TrpAsp: 0.5 ± 0.056
0.36TrpGlu: 0.36 ± 0.044
0.494TrpPhe: 0.494 ± 0.059
0.506TrpGly: 0.506 ± 0.059
0.23TrpHis: 0.23 ± 0.037
0.803TrpIle: 0.803 ± 0.077
0.579TrpLys: 0.579 ± 0.065
0.792TrpLeu: 0.792 ± 0.075
0.281TrpMet: 0.281 ± 0.04
0.292TrpAsn: 0.292 ± 0.044
0.247TrpPro: 0.247 ± 0.042
0.258TrpGln: 0.258 ± 0.036
0.494TrpArg: 0.494 ± 0.058
0.596TrpSer: 0.596 ± 0.062
0.23TrpThr: 0.23 ± 0.04
0.618TrpVal: 0.618 ± 0.062
0.101TrpTrp: 0.101 ± 0.023
0.371TrpTyr: 0.371 ± 0.051
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.568TyrAla: 3.568 ± 0.151
0.427TyrCys: 0.427 ± 0.05
2.377TyrAsp: 2.377 ± 0.107
2.416TyrGlu: 2.416 ± 0.115
1.112TyrPhe: 1.112 ± 0.091
2.95TyrGly: 2.95 ± 0.148
0.815TyrHis: 0.815 ± 0.071
3.242TyrIle: 3.242 ± 0.152
2.068TyrLys: 2.068 ± 0.134
3.107TyrLeu: 3.107 ± 0.151
1.202TyrMet: 1.202 ± 0.081
1.775TyrAsn: 1.775 ± 0.11
1.421TyrPro: 1.421 ± 0.084
0.506TyrGln: 0.506 ± 0.054
2.489TyrArg: 2.489 ± 0.125
2.804TyrSer: 2.804 ± 0.121
2.18TyrThr: 2.18 ± 0.12
2.466TyrVal: 2.466 ± 0.121
0.326TyrTrp: 0.326 ± 0.049
1.641TyrTyr: 1.641 ± 0.1
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 721 proteins (177987 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski