Amino acid dipepetide frequency for Kurlavirus BKC-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.562AlaAla: 3.562 ± 0.263
1.095AlaCys: 1.095 ± 0.099
2.16AlaAsp: 2.16 ± 0.129
3.994AlaGlu: 3.994 ± 0.203
2.89AlaPhe: 2.89 ± 0.17
2.4AlaGly: 2.4 ± 0.196
0.874AlaHis: 0.874 ± 0.091
2.64AlaIle: 2.64 ± 0.19
4.848AlaLys: 4.848 ± 0.337
4.733AlaLeu: 4.733 ± 0.248
1.267AlaMet: 1.267 ± 0.106
2.093AlaAsn: 2.093 ± 0.17
1.968AlaPro: 1.968 ± 0.173
1.651AlaGln: 1.651 ± 0.133
2.525AlaArg: 2.525 ± 0.167
4.503AlaSer: 4.503 ± 0.241
3.303AlaThr: 3.303 ± 0.265
3.572AlaVal: 3.572 ± 0.201
0.461AlaTrp: 0.461 ± 0.064
1.469AlaTyr: 1.469 ± 0.113
0.0AlaXaa: 0.0 ± 0.0
Cys
1.315CysAla: 1.315 ± 0.11
0.643CysCys: 0.643 ± 0.081
1.296CysAsp: 1.296 ± 0.128
1.843CysGlu: 1.843 ± 0.182
1.795CysPhe: 1.795 ± 0.185
2.227CysGly: 2.227 ± 0.217
0.49CysHis: 0.49 ± 0.067
1.287CysIle: 1.287 ± 0.117
1.843CysLys: 1.843 ± 0.184
2.093CysLeu: 2.093 ± 0.153
0.403CysMet: 0.403 ± 0.058
0.816CysAsn: 0.816 ± 0.096
1.671CysPro: 1.671 ± 0.182
0.749CysGln: 0.749 ± 0.094
1.171CysArg: 1.171 ± 0.117
2.458CysSer: 2.458 ± 0.2
1.037CysThr: 1.037 ± 0.109
1.517CysVal: 1.517 ± 0.132
0.336CysTrp: 0.336 ± 0.059
0.682CysTyr: 0.682 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
2.525AspAla: 2.525 ± 0.164
1.229AspCys: 1.229 ± 0.138
2.515AspAsp: 2.515 ± 0.161
4.042AspGlu: 4.042 ± 0.25
2.967AspPhe: 2.967 ± 0.18
4.061AspGly: 4.061 ± 0.239
0.557AspHis: 0.557 ± 0.075
3.668AspIle: 3.668 ± 0.187
3.735AspLys: 3.735 ± 0.228
3.524AspLeu: 3.524 ± 0.219
1.056AspMet: 1.056 ± 0.101
1.738AspAsn: 1.738 ± 0.143
1.949AspPro: 1.949 ± 0.143
1.075AspGln: 1.075 ± 0.102
2.362AspArg: 2.362 ± 0.123
3.197AspSer: 3.197 ± 0.189
2.266AspThr: 2.266 ± 0.127
3.716AspVal: 3.716 ± 0.195
0.816AspTrp: 0.816 ± 0.101
1.546AspTyr: 1.546 ± 0.115
0.0AspXaa: 0.0 ± 0.0
Glu
4.032GluAla: 4.032 ± 0.193
1.843GluCys: 1.843 ± 0.177
4.215GluAsp: 4.215 ± 0.28
9.111GluGlu: 9.111 ± 0.571
4.589GluPhe: 4.589 ± 0.183
4.464GluGly: 4.464 ± 0.177
1.565GluHis: 1.565 ± 0.129
4.282GluIle: 4.282 ± 0.207
9.582GluLys: 9.582 ± 0.386
6.586GluLeu: 6.586 ± 0.27
2.045GluMet: 2.045 ± 0.138
4.196GluAsn: 4.196 ± 0.211
1.968GluPro: 1.968 ± 0.156
2.861GluGln: 2.861 ± 0.187
5.29GluArg: 5.29 ± 0.278
4.052GluSer: 4.052 ± 0.217
5.021GluThr: 5.021 ± 0.216
4.119GluVal: 4.119 ± 0.197
1.2GluTrp: 1.2 ± 0.106
2.755GluTyr: 2.755 ± 0.148
0.0GluXaa: 0.0 ± 0.0
Phe
3.61PheAla: 3.61 ± 0.2
2.247PheCys: 2.247 ± 0.27
3.072PheAsp: 3.072 ± 0.155
5.069PheGlu: 5.069 ± 0.28
3.024PhePhe: 3.024 ± 0.17
3.84PheGly: 3.84 ± 0.18
0.845PheHis: 0.845 ± 0.084
1.959PheIle: 1.959 ± 0.133
2.333PheLys: 2.333 ± 0.152
6.548PheLeu: 6.548 ± 0.272
1.095PheMet: 1.095 ± 0.111
1.171PheAsn: 1.171 ± 0.114
2.304PhePro: 2.304 ± 0.172
1.603PheGln: 1.603 ± 0.12
2.919PheArg: 2.919 ± 0.188
5.29PheSer: 5.29 ± 0.233
2.074PheThr: 2.074 ± 0.131
4.714PheVal: 4.714 ± 0.214
1.411PheTrp: 1.411 ± 0.187
1.719PheTyr: 1.719 ± 0.142
0.0PheXaa: 0.0 ± 0.0
Gly
3.274GlyAla: 3.274 ± 0.212
1.546GlyCys: 1.546 ± 0.161
2.602GlyAsp: 2.602 ± 0.206
4.82GlyGlu: 4.82 ± 0.291
2.679GlyPhe: 2.679 ± 0.19
3.187GlyGly: 3.187 ± 0.222
1.181GlyHis: 1.181 ± 0.111
3.744GlyIle: 3.744 ± 0.196
7.095GlyLys: 7.095 ± 0.325
4.311GlyLeu: 4.311 ± 0.203
1.181GlyMet: 1.181 ± 0.121
3.015GlyAsn: 3.015 ± 0.209
1.555GlyPro: 1.555 ± 0.119
2.122GlyGln: 2.122 ± 0.173
3.226GlyArg: 3.226 ± 0.179
4.359GlySer: 4.359 ± 0.245
4.551GlyThr: 4.551 ± 0.278
4.061GlyVal: 4.061 ± 0.238
0.883GlyTrp: 0.883 ± 0.095
2.554GlyTyr: 2.554 ± 0.175
0.0GlyXaa: 0.0 ± 0.0
His
0.662HisAla: 0.662 ± 0.104
0.547HisCys: 0.547 ± 0.077
0.509HisAsp: 0.509 ± 0.071
1.267HisGlu: 1.267 ± 0.141
0.96HisPhe: 0.96 ± 0.09
1.834HisGly: 1.834 ± 0.181
0.307HisHis: 0.307 ± 0.054
1.296HisIle: 1.296 ± 0.112
1.565HisLys: 1.565 ± 0.149
1.44HisLeu: 1.44 ± 0.123
0.326HisMet: 0.326 ± 0.058
0.73HisAsn: 0.73 ± 0.097
0.845HisPro: 0.845 ± 0.085
0.691HisGln: 0.691 ± 0.077
0.922HisArg: 0.922 ± 0.091
1.315HisSer: 1.315 ± 0.127
0.749HisThr: 0.749 ± 0.089
0.826HisVal: 0.826 ± 0.076
0.269HisTrp: 0.269 ± 0.066
0.624HisTyr: 0.624 ± 0.077
0.0HisXaa: 0.0 ± 0.0
Ile
2.899IleAla: 2.899 ± 0.199
1.354IleCys: 1.354 ± 0.117
2.429IleAsp: 2.429 ± 0.127
2.986IleGlu: 2.986 ± 0.169
3.207IlePhe: 3.207 ± 0.156
3.12IleGly: 3.12 ± 0.232
0.96IleHis: 0.96 ± 0.093
2.554IleIle: 2.554 ± 0.16
3.514IleLys: 3.514 ± 0.185
5.232IleLeu: 5.232 ± 0.226
0.893IleMet: 0.893 ± 0.094
1.517IleAsn: 1.517 ± 0.12
2.688IlePro: 2.688 ± 0.167
1.891IleGln: 1.891 ± 0.136
2.688IleArg: 2.688 ± 0.195
4.685IleSer: 4.685 ± 0.25
2.199IleThr: 2.199 ± 0.15
3.543IleVal: 3.543 ± 0.2
0.816IleTrp: 0.816 ± 0.088
1.2IleTyr: 1.2 ± 0.122
0.0IleXaa: 0.0 ± 0.0
Lys
4.512LysAla: 4.512 ± 0.269
1.613LysCys: 1.613 ± 0.184
4.685LysAsp: 4.685 ± 0.235
8.938LysGlu: 8.938 ± 0.383
3.946LysPhe: 3.946 ± 0.227
4.589LysGly: 4.589 ± 0.197
2.151LysHis: 2.151 ± 0.166
4.186LysIle: 4.186 ± 0.211
11.694LysLys: 11.694 ± 0.697
6.135LysLeu: 6.135 ± 0.211
1.661LysMet: 1.661 ± 0.127
4.916LysAsn: 4.916 ± 0.263
2.083LysPro: 2.083 ± 0.166
2.544LysGln: 2.544 ± 0.165
5.511LysArg: 5.511 ± 0.253
4.608LysSer: 4.608 ± 0.263
5.434LysThr: 5.434 ± 0.282
4.906LysVal: 4.906 ± 0.244
0.883LysTrp: 0.883 ± 0.079
3.668LysTyr: 3.668 ± 0.19
0.0LysXaa: 0.0 ± 0.0
Leu
4.752LeuAla: 4.752 ± 0.226
2.947LeuCys: 2.947 ± 0.221
4.378LeuAsp: 4.378 ± 0.22
7.873LeuGlu: 7.873 ± 0.373
5.117LeuPhe: 5.117 ± 0.265
5.165LeuGly: 5.165 ± 0.263
1.373LeuHis: 1.373 ± 0.103
2.727LeuIle: 2.727 ± 0.168
5.847LeuLys: 5.847 ± 0.3
8.622LeuLeu: 8.622 ± 0.34
1.555LeuMet: 1.555 ± 0.12
2.794LeuAsn: 2.794 ± 0.175
4.608LeuPro: 4.608 ± 0.218
3.639LeuGln: 3.639 ± 0.336
4.292LeuArg: 4.292 ± 0.24
7.681LeuSer: 7.681 ± 0.311
3.303LeuThr: 3.303 ± 0.196
5.732LeuVal: 5.732 ± 0.214
1.671LeuTrp: 1.671 ± 0.111
2.506LeuTyr: 2.506 ± 0.167
0.0LeuXaa: 0.0 ± 0.0
Met
1.2MetAla: 1.2 ± 0.099
0.499MetCys: 0.499 ± 0.071
0.902MetAsp: 0.902 ± 0.085
1.767MetGlu: 1.767 ± 0.127
0.922MetPhe: 0.922 ± 0.088
1.277MetGly: 1.277 ± 0.102
0.384MetHis: 0.384 ± 0.06
0.662MetIle: 0.662 ± 0.074
1.383MetLys: 1.383 ± 0.127
1.536MetLeu: 1.536 ± 0.112
0.48MetMet: 0.48 ± 0.062
0.979MetAsn: 0.979 ± 0.093
0.518MetPro: 0.518 ± 0.082
1.143MetGln: 1.143 ± 0.093
0.941MetArg: 0.941 ± 0.108
1.872MetSer: 1.872 ± 0.124
1.248MetThr: 1.248 ± 0.109
0.979MetVal: 0.979 ± 0.089
0.221MetTrp: 0.221 ± 0.052
0.614MetTyr: 0.614 ± 0.076
0.0MetXaa: 0.0 ± 0.0
Asn
2.535AsnAla: 2.535 ± 0.167
0.845AsnCys: 0.845 ± 0.088
1.469AsnAsp: 1.469 ± 0.124
2.323AsnGlu: 2.323 ± 0.173
2.813AsnPhe: 2.813 ± 0.154
3.466AsnGly: 3.466 ± 0.219
0.624AsnHis: 0.624 ± 0.086
3.956AsnIle: 3.956 ± 0.213
3.552AsnLys: 3.552 ± 0.198
3.139AsnLeu: 3.139 ± 0.181
0.806AsnMet: 0.806 ± 0.084
1.786AsnAsn: 1.786 ± 0.151
2.093AsnPro: 2.093 ± 0.19
0.816AsnGln: 0.816 ± 0.086
1.536AsnArg: 1.536 ± 0.099
2.967AsnSer: 2.967 ± 0.168
2.439AsnThr: 2.439 ± 0.255
2.938AsnVal: 2.938 ± 0.147
0.451AsnTrp: 0.451 ± 0.065
1.392AsnTyr: 1.392 ± 0.108
0.0AsnXaa: 0.0 ± 0.0
Pro
1.431ProAla: 1.431 ± 0.16
0.816ProCys: 0.816 ± 0.087
2.103ProAsp: 2.103 ± 0.15
4.071ProGlu: 4.071 ± 0.212
2.343ProPhe: 2.343 ± 0.161
2.256ProGly: 2.256 ± 0.169
0.71ProHis: 0.71 ± 0.088
1.882ProIle: 1.882 ± 0.133
3.485ProLys: 3.485 ± 0.23
3.12ProLeu: 3.12 ± 0.206
0.71ProMet: 0.71 ± 0.084
1.642ProAsn: 1.642 ± 0.144
1.613ProPro: 1.613 ± 0.179
1.517ProGln: 1.517 ± 0.124
1.93ProArg: 1.93 ± 0.159
3.139ProSer: 3.139 ± 0.477
2.247ProThr: 2.247 ± 0.209
2.458ProVal: 2.458 ± 0.169
0.538ProTrp: 0.538 ± 0.084
1.325ProTyr: 1.325 ± 0.106
0.0ProXaa: 0.0 ± 0.0
Gln
1.459GlnAla: 1.459 ± 0.135
0.528GlnCys: 0.528 ± 0.088
1.479GlnAsp: 1.479 ± 0.114
3.236GlnGlu: 3.236 ± 0.182
1.018GlnPhe: 1.018 ± 0.087
2.16GlnGly: 2.16 ± 0.226
0.701GlnHis: 0.701 ± 0.072
1.651GlnIle: 1.651 ± 0.137
4.32GlnLys: 4.32 ± 0.229
2.103GlnLeu: 2.103 ± 0.143
0.941GlnMet: 0.941 ± 0.099
1.872GlnAsn: 1.872 ± 0.137
0.902GlnPro: 0.902 ± 0.118
1.335GlnGln: 1.335 ± 0.138
2.583GlnArg: 2.583 ± 0.166
1.834GlnSer: 1.834 ± 0.145
2.285GlnThr: 2.285 ± 0.218
2.208GlnVal: 2.208 ± 0.224
0.432GlnTrp: 0.432 ± 0.069
0.874GlnTyr: 0.874 ± 0.091
0.0GlnXaa: 0.0 ± 0.0
Arg
2.304ArgAla: 2.304 ± 0.155
1.191ArgCys: 1.191 ± 0.111
2.688ArgAsp: 2.688 ± 0.191
5.376ArgGlu: 5.376 ± 0.254
2.333ArgPhe: 2.333 ± 0.14
2.842ArgGly: 2.842 ± 0.184
1.027ArgHis: 1.027 ± 0.111
3.043ArgIle: 3.043 ± 0.145
5.434ArgLys: 5.434 ± 0.271
4.81ArgLeu: 4.81 ± 0.269
1.287ArgMet: 1.287 ± 0.111
2.736ArgAsn: 2.736 ± 0.188
1.555ArgPro: 1.555 ± 0.14
1.872ArgGln: 1.872 ± 0.131
2.899ArgArg: 2.899 ± 0.201
2.928ArgSer: 2.928 ± 0.167
2.621ArgThr: 2.621 ± 0.206
3.706ArgVal: 3.706 ± 0.206
0.758ArgTrp: 0.758 ± 0.084
1.767ArgTyr: 1.767 ± 0.129
0.0ArgXaa: 0.0 ± 0.0
Ser
3.629SerAla: 3.629 ± 0.214
1.968SerCys: 1.968 ± 0.155
3.706SerAsp: 3.706 ± 0.194
5.626SerGlu: 5.626 ± 0.235
5.665SerPhe: 5.665 ± 0.327
5.261SerGly: 5.261 ± 0.231
1.306SerHis: 1.306 ± 0.134
2.842SerIle: 2.842 ± 0.169
5.645SerLys: 5.645 ± 0.272
7.767SerLeu: 7.767 ± 0.313
1.085SerMet: 1.085 ± 0.094
3.101SerAsn: 3.101 ± 0.189
3.389SerPro: 3.389 ± 0.35
2.803SerGln: 2.803 ± 0.171
3.908SerArg: 3.908 ± 0.194
6.692SerSer: 6.692 ± 0.388
3.293SerThr: 3.293 ± 0.256
4.82SerVal: 4.82 ± 0.218
1.219SerTrp: 1.219 ± 0.104
1.959SerTyr: 1.959 ± 0.167
0.0SerXaa: 0.0 ± 0.0
Thr
2.419ThrAla: 2.419 ± 0.191
1.306ThrCys: 1.306 ± 0.117
2.131ThrAsp: 2.131 ± 0.146
3.639ThrGlu: 3.639 ± 0.181
3.034ThrPhe: 3.034 ± 0.182
3.428ThrGly: 3.428 ± 0.271
0.662ThrHis: 0.662 ± 0.073
2.688ThrIle: 2.688 ± 0.17
5.031ThrLys: 5.031 ± 0.236
5.098ThrLeu: 5.098 ± 0.369
0.806ThrMet: 0.806 ± 0.086
2.64ThrAsn: 2.64 ± 0.279
2.899ThrPro: 2.899 ± 0.328
1.949ThrGln: 1.949 ± 0.15
3.101ThrArg: 3.101 ± 0.171
3.687ThrSer: 3.687 ± 0.217
3.456ThrThr: 3.456 ± 0.306
2.986ThrVal: 2.986 ± 0.185
0.691ThrTrp: 0.691 ± 0.081
1.45ThrTyr: 1.45 ± 0.121
0.0ThrXaa: 0.0 ± 0.0
Val
3.639ValAla: 3.639 ± 0.234
1.949ValCys: 1.949 ± 0.158
3.351ValAsp: 3.351 ± 0.187
4.349ValGlu: 4.349 ± 0.249
4.292ValPhe: 4.292 ± 0.218
3.36ValGly: 3.36 ± 0.231
1.143ValHis: 1.143 ± 0.097
2.957ValIle: 2.957 ± 0.183
3.879ValLys: 3.879 ± 0.221
6.337ValLeu: 6.337 ± 0.238
1.037ValMet: 1.037 ± 0.096
1.987ValAsn: 1.987 ± 0.144
3.312ValPro: 3.312 ± 0.177
2.131ValGln: 2.131 ± 0.145
3.063ValArg: 3.063 ± 0.183
6.577ValSer: 6.577 ± 0.336
2.986ValThr: 2.986 ± 0.165
4.551ValVal: 4.551 ± 0.225
1.104ValTrp: 1.104 ± 0.097
2.304ValTyr: 2.304 ± 0.132
0.0ValXaa: 0.0 ± 0.0
Trp
0.538TrpAla: 0.538 ± 0.073
0.576TrpCys: 0.576 ± 0.089
0.854TrpAsp: 0.854 ± 0.094
1.027TrpGlu: 1.027 ± 0.108
1.277TrpPhe: 1.277 ± 0.142
0.499TrpGly: 0.499 ± 0.09
0.211TrpHis: 0.211 ± 0.048
0.691TrpIle: 0.691 ± 0.074
1.671TrpLys: 1.671 ± 0.141
1.104TrpLeu: 1.104 ± 0.111
0.374TrpMet: 0.374 ± 0.065
0.902TrpAsn: 0.902 ± 0.111
0.278TrpPro: 0.278 ± 0.054
0.432TrpGln: 0.432 ± 0.063
0.816TrpArg: 0.816 ± 0.092
1.296TrpSer: 1.296 ± 0.122
0.826TrpThr: 0.826 ± 0.086
0.778TrpVal: 0.778 ± 0.091
0.202TrpTrp: 0.202 ± 0.04
0.595TrpTyr: 0.595 ± 0.077
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.709TyrAla: 1.709 ± 0.132
0.931TyrCys: 0.931 ± 0.109
1.997TyrAsp: 1.997 ± 0.158
2.419TyrGlu: 2.419 ± 0.162
1.978TyrPhe: 1.978 ± 0.153
2.506TyrGly: 2.506 ± 0.175
0.614TyrHis: 0.614 ± 0.087
1.661TyrIle: 1.661 ± 0.122
2.208TyrLys: 2.208 ± 0.16
2.343TyrLeu: 2.343 ± 0.149
0.509TyrMet: 0.509 ± 0.077
1.315TyrAsn: 1.315 ± 0.12
1.133TyrPro: 1.133 ± 0.12
1.095TyrGln: 1.095 ± 0.106
1.498TyrArg: 1.498 ± 0.115
2.554TyrSer: 2.554 ± 0.155
1.767TyrThr: 1.767 ± 0.128
2.179TyrVal: 2.179 ± 0.132
0.586TyrTrp: 0.586 ± 0.081
0.95TyrTyr: 0.95 ± 0.102
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 386 proteins (104158 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski