Amino acid dipepetide frequency for Brazilian cedratvirus IHUMI

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.649AlaAla: 3.649 ± 0.264
1.32AlaCys: 1.32 ± 0.119
2.391AlaAsp: 2.391 ± 0.204
3.307AlaGlu: 3.307 ± 0.16
2.484AlaPhe: 2.484 ± 0.135
2.927AlaGly: 2.927 ± 0.2
0.932AlaHis: 0.932 ± 0.082
2.756AlaIle: 2.756 ± 0.131
2.942AlaLys: 2.942 ± 0.152
6.855AlaLeu: 6.855 ± 0.232
0.862AlaMet: 0.862 ± 0.134
2.127AlaAsn: 2.127 ± 0.136
2.034AlaPro: 2.034 ± 0.159
2.438AlaGln: 2.438 ± 0.201
3.493AlaArg: 3.493 ± 0.194
4.363AlaSer: 4.363 ± 0.22
3.462AlaThr: 3.462 ± 0.411
3.408AlaVal: 3.408 ± 0.184
0.652AlaTrp: 0.652 ± 0.067
2.406AlaTyr: 2.406 ± 0.134
0.008AlaXaa: 0.008 ± 0.008
Cys
1.195CysAla: 1.195 ± 0.115
0.442CysCys: 0.442 ± 0.06
1.071CysAsp: 1.071 ± 0.128
0.963CysGlu: 0.963 ± 0.096
1.025CysPhe: 1.025 ± 0.1
0.862CysGly: 0.862 ± 0.1
0.435CysHis: 0.435 ± 0.083
1.064CysIle: 1.064 ± 0.108
1.537CysLys: 1.537 ± 0.12
2.321CysLeu: 2.321 ± 0.142
0.489CysMet: 0.489 ± 0.056
1.017CysAsn: 1.017 ± 0.107
2.003CysPro: 2.003 ± 0.239
0.505CysGln: 0.505 ± 0.074
1.149CysArg: 1.149 ± 0.103
2.701CysSer: 2.701 ± 0.185
1.312CysThr: 1.312 ± 0.127
1.304CysVal: 1.304 ± 0.113
0.186CysTrp: 0.186 ± 0.045
0.924CysTyr: 0.924 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
2.205AspAla: 2.205 ± 0.153
1.149AspCys: 1.149 ± 0.104
2.5AspAsp: 2.5 ± 0.162
3.944AspGlu: 3.944 ± 0.165
2.779AspPhe: 2.779 ± 0.153
2.818AspGly: 2.818 ± 0.228
0.761AspHis: 0.761 ± 0.087
3.392AspIle: 3.392 ± 0.185
3.167AspLys: 3.167 ± 0.164
6.234AspLeu: 6.234 ± 0.253
1.172AspMet: 1.172 ± 0.096
2.096AspAsn: 2.096 ± 0.125
1.428AspPro: 1.428 ± 0.088
1.227AspGln: 1.227 ± 0.092
2.647AspArg: 2.647 ± 0.125
2.414AspSer: 2.414 ± 0.143
2.174AspThr: 2.174 ± 0.133
3.338AspVal: 3.338 ± 0.187
0.784AspTrp: 0.784 ± 0.081
2.996AspTyr: 2.996 ± 0.18
0.0AspXaa: 0.0 ± 0.0
Glu
3.718GluAla: 3.718 ± 0.195
0.939GluCys: 0.939 ± 0.087
3.998GluAsp: 3.998 ± 0.193
9.533GluGlu: 9.533 ± 0.353
2.818GluPhe: 2.818 ± 0.135
4.619GluGly: 4.619 ± 0.199
1.467GluHis: 1.467 ± 0.126
4.774GluIle: 4.774 ± 0.198
6.381GluLys: 6.381 ± 0.316
5.799GluLeu: 5.799 ± 0.229
1.801GluMet: 1.801 ± 0.131
3.416GluAsn: 3.416 ± 0.171
1.731GluPro: 1.731 ± 0.166
3.028GluGln: 3.028 ± 0.203
4.953GluArg: 4.953 ± 0.236
3.509GluSer: 3.509 ± 0.162
3.478GluThr: 3.478 ± 0.248
5.023GluVal: 5.023 ± 0.188
0.916GluTrp: 0.916 ± 0.081
2.701GluTyr: 2.701 ± 0.146
0.0GluXaa: 0.0 ± 0.0
Phe
3.175PheAla: 3.175 ± 0.157
1.095PheCys: 1.095 ± 0.113
2.065PheAsp: 2.065 ± 0.137
1.607PheGlu: 1.607 ± 0.115
2.569PhePhe: 2.569 ± 0.134
2.197PheGly: 2.197 ± 0.128
0.629PheHis: 0.629 ± 0.064
2.624PheIle: 2.624 ± 0.144
1.133PheLys: 1.133 ± 0.097
4.828PheLeu: 4.828 ± 0.208
0.862PheMet: 0.862 ± 0.075
1.925PheAsn: 1.925 ± 0.13
2.127PhePro: 2.127 ± 0.139
1.102PheGln: 1.102 ± 0.083
2.197PheArg: 2.197 ± 0.123
4.572PheSer: 4.572 ± 0.196
3.641PheThr: 3.641 ± 0.153
3.579PheVal: 3.579 ± 0.172
0.978PheTrp: 0.978 ± 0.085
2.74PheTyr: 2.74 ± 0.152
0.0PheXaa: 0.0 ± 0.0
Gly
4.27GlyAla: 4.27 ± 0.718
2.042GlyCys: 2.042 ± 0.241
2.616GlyAsp: 2.616 ± 0.149
5.224GlyGlu: 5.224 ± 0.448
2.453GlyPhe: 2.453 ± 0.137
3.26GlyGly: 3.26 ± 0.164
1.335GlyHis: 1.335 ± 0.163
2.911GlyIle: 2.911 ± 0.157
3.982GlyLys: 3.982 ± 0.238
4.891GlyLeu: 4.891 ± 0.213
1.087GlyMet: 1.087 ± 0.091
3.912GlyAsn: 3.912 ± 0.406
2.197GlyPro: 2.197 ± 0.298
2.073GlyGln: 2.073 ± 0.136
2.911GlyArg: 2.911 ± 0.157
3.757GlySer: 3.757 ± 0.208
2.779GlyThr: 2.779 ± 0.201
4.968GlyVal: 4.968 ± 0.589
0.613GlyTrp: 0.613 ± 0.063
3.237GlyTyr: 3.237 ± 0.161
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 0.146
0.458HisCys: 0.458 ± 0.069
0.893HisAsp: 0.893 ± 0.085
1.304HisGlu: 1.304 ± 0.103
0.629HisPhe: 0.629 ± 0.076
0.939HisGly: 0.939 ± 0.087
0.536HisHis: 0.536 ± 0.094
1.149HisIle: 1.149 ± 0.108
1.025HisLys: 1.025 ± 0.103
2.577HisLeu: 2.577 ± 0.18
0.481HisMet: 0.481 ± 0.065
0.761HisAsn: 0.761 ± 0.084
1.001HisPro: 1.001 ± 0.094
0.644HisGln: 0.644 ± 0.071
1.172HisArg: 1.172 ± 0.087
1.149HisSer: 1.149 ± 0.092
0.753HisThr: 0.753 ± 0.082
1.157HisVal: 1.157 ± 0.098
0.14HisTrp: 0.14 ± 0.032
0.8HisTyr: 0.8 ± 0.075
0.0HisXaa: 0.0 ± 0.0
Ile
3.074IleAla: 3.074 ± 0.165
1.172IleCys: 1.172 ± 0.105
2.849IleAsp: 2.849 ± 0.162
3.299IleGlu: 3.299 ± 0.188
2.942IlePhe: 2.942 ± 0.155
3.097IleGly: 3.097 ± 0.215
1.009IleHis: 1.009 ± 0.098
3.617IleIle: 3.617 ± 0.195
3.26IleLys: 3.26 ± 0.189
6.505IleLeu: 6.505 ± 0.247
1.188IleMet: 1.188 ± 0.122
2.531IleAsn: 2.531 ± 0.144
2.484IlePro: 2.484 ± 0.139
1.591IleGln: 1.591 ± 0.109
2.818IleArg: 2.818 ± 0.155
4.231IleSer: 4.231 ± 0.175
2.95IleThr: 2.95 ± 0.186
3.423IleVal: 3.423 ± 0.191
0.52IleTrp: 0.52 ± 0.061
2.833IleTyr: 2.833 ± 0.159
0.0IleXaa: 0.0 ± 0.0
Lys
2.88LysAla: 2.88 ± 0.15
0.668LysCys: 0.668 ± 0.078
3.346LysAsp: 3.346 ± 0.175
5.659LysGlu: 5.659 ± 0.236
1.948LysPhe: 1.948 ± 0.128
3.99LysGly: 3.99 ± 0.264
1.343LysHis: 1.343 ± 0.097
4.2LysIle: 4.2 ± 0.187
5.123LysLys: 5.123 ± 0.284
5.023LysLeu: 5.023 ± 0.232
1.157LysMet: 1.157 ± 0.089
2.903LysAsn: 2.903 ± 0.164
2.298LysPro: 2.298 ± 0.143
2.282LysGln: 2.282 ± 0.143
3.579LysArg: 3.579 ± 0.192
3.02LysSer: 3.02 ± 0.171
2.95LysThr: 2.95 ± 0.149
4.355LysVal: 4.355 ± 0.221
0.737LysTrp: 0.737 ± 0.101
2.492LysTyr: 2.492 ± 0.145
0.0LysXaa: 0.0 ± 0.0
Leu
6.451LeuAla: 6.451 ± 0.251
2.911LeuCys: 2.911 ± 0.146
5.589LeuAsp: 5.589 ± 0.233
8.78LeuGlu: 8.78 ± 0.324
4.891LeuPhe: 4.891 ± 0.188
5.457LeuGly: 5.457 ± 0.243
2.399LeuHis: 2.399 ± 0.146
4.953LeuIle: 4.953 ± 0.202
4.875LeuLys: 4.875 ± 0.221
13.119LeuLeu: 13.119 ± 0.379
1.498LeuMet: 1.498 ± 0.101
4.06LeuAsn: 4.06 ± 0.169
6.195LeuPro: 6.195 ± 0.224
5.682LeuGln: 5.682 ± 0.27
4.852LeuArg: 4.852 ± 0.212
9.812LeuSer: 9.812 ± 0.325
5.286LeuThr: 5.286 ± 0.212
6.963LeuVal: 6.963 ± 0.255
1.149LeuTrp: 1.149 ± 0.1
5.333LeuTyr: 5.333 ± 0.227
0.0LeuXaa: 0.0 ± 0.0
Met
1.234MetAla: 1.234 ± 0.091
0.365MetCys: 0.365 ± 0.054
1.351MetAsp: 1.351 ± 0.109
1.778MetGlu: 1.778 ± 0.112
0.769MetPhe: 0.769 ± 0.078
0.955MetGly: 0.955 ± 0.076
0.497MetHis: 0.497 ± 0.059
0.745MetIle: 0.745 ± 0.072
0.885MetLys: 0.885 ± 0.074
1.964MetLeu: 1.964 ± 0.136
0.303MetMet: 0.303 ± 0.047
0.815MetAsn: 0.815 ± 0.122
0.598MetPro: 0.598 ± 0.107
1.343MetGln: 1.343 ± 0.116
0.877MetArg: 0.877 ± 0.087
1.638MetSer: 1.638 ± 0.117
0.706MetThr: 0.706 ± 0.086
1.289MetVal: 1.289 ± 0.1
0.217MetTrp: 0.217 ± 0.04
0.699MetTyr: 0.699 ± 0.073
0.0MetXaa: 0.0 ± 0.0
Asn
1.917AsnAla: 1.917 ± 0.113
0.536AsnCys: 0.536 ± 0.064
1.475AsnAsp: 1.475 ± 0.098
1.801AsnGlu: 1.801 ± 0.123
2.368AsnPhe: 2.368 ± 0.138
2.701AsnGly: 2.701 ± 0.25
0.792AsnHis: 0.792 ± 0.079
3.346AsnIle: 3.346 ± 0.157
3.253AsnLys: 3.253 ± 0.186
6.195AsnLeu: 6.195 ± 0.209
0.963AsnMet: 0.963 ± 0.085
2.111AsnAsn: 2.111 ± 0.166
2.166AsnPro: 2.166 ± 0.132
1.467AsnGln: 1.467 ± 0.108
2.158AsnArg: 2.158 ± 0.136
2.779AsnSer: 2.779 ± 0.154
3.26AsnThr: 3.26 ± 0.416
2.748AsnVal: 2.748 ± 0.183
0.435AsnTrp: 0.435 ± 0.048
2.135AsnTyr: 2.135 ± 0.132
0.0AsnXaa: 0.0 ± 0.0
Pro
1.785ProAla: 1.785 ± 0.136
1.133ProCys: 1.133 ± 0.124
1.972ProAsp: 1.972 ± 0.131
3.975ProGlu: 3.975 ± 0.167
2.174ProPhe: 2.174 ± 0.119
2.36ProGly: 2.36 ± 0.182
0.769ProHis: 0.769 ± 0.09
1.731ProIle: 1.731 ± 0.131
2.127ProLys: 2.127 ± 0.139
5.232ProLeu: 5.232 ± 0.199
0.574ProMet: 0.574 ± 0.074
1.793ProAsn: 1.793 ± 0.115
2.243ProPro: 2.243 ± 0.187
2.414ProGln: 2.414 ± 0.267
1.91ProArg: 1.91 ± 0.156
3.78ProSer: 3.78 ± 0.195
1.754ProThr: 1.754 ± 0.126
2.795ProVal: 2.795 ± 0.172
1.149ProTrp: 1.149 ± 0.195
1.754ProTyr: 1.754 ± 0.132
0.0ProXaa: 0.0 ± 0.0
Gln
2.484GlnAla: 2.484 ± 0.152
0.497GlnCys: 0.497 ± 0.087
2.111GlnAsp: 2.111 ± 0.119
3.361GlnGlu: 3.361 ± 0.162
0.753GlnPhe: 0.753 ± 0.079
4.549GlnGly: 4.549 ± 0.4
0.567GlnHis: 0.567 ± 0.067
1.77GlnIle: 1.77 ± 0.108
1.863GlnLys: 1.863 ± 0.125
2.795GlnLeu: 2.795 ± 0.145
0.606GlnMet: 0.606 ± 0.064
1.351GlnAsn: 1.351 ± 0.124
1.289GlnPro: 1.289 ± 0.102
1.009GlnGln: 1.009 ± 0.129
2.166GlnArg: 2.166 ± 0.126
2.228GlnSer: 2.228 ± 0.133
2.166GlnThr: 2.166 ± 0.132
2.989GlnVal: 2.989 ± 0.171
1.017GlnTrp: 1.017 ± 0.178
0.838GlnTyr: 0.838 ± 0.084
0.0GlnXaa: 0.0 ± 0.0
Arg
2.632ArgAla: 2.632 ± 0.163
1.149ArgCys: 1.149 ± 0.11
2.958ArgAsp: 2.958 ± 0.156
5.69ArgGlu: 5.69 ± 0.255
2.049ArgPhe: 2.049 ± 0.124
3.509ArgGly: 3.509 ± 0.191
0.862ArgHis: 0.862 ± 0.081
3.09ArgIle: 3.09 ± 0.189
3.726ArgLys: 3.726 ± 0.201
4.596ArgLeu: 4.596 ± 0.177
1.056ArgMet: 1.056 ± 0.083
2.546ArgAsn: 2.546 ± 0.143
1.739ArgPro: 1.739 ± 0.126
1.638ArgGln: 1.638 ± 0.126
3.113ArgArg: 3.113 ± 0.182
3.338ArgSer: 3.338 ± 0.196
2.135ArgThr: 2.135 ± 0.147
3.796ArgVal: 3.796 ± 0.193
0.675ArgTrp: 0.675 ± 0.076
2.158ArgTyr: 2.158 ± 0.135
0.0ArgXaa: 0.0 ± 0.0
Ser
3.742SerAla: 3.742 ± 0.165
1.669SerCys: 1.669 ± 0.141
3.09SerAsp: 3.09 ± 0.151
3.664SerGlu: 3.664 ± 0.153
4.355SerPhe: 4.355 ± 0.194
4.394SerGly: 4.394 ± 0.184
1.118SerHis: 1.118 ± 0.096
3.742SerIle: 3.742 ± 0.183
4.308SerLys: 4.308 ± 0.196
9.735SerLeu: 9.735 ± 0.315
1.374SerMet: 1.374 ± 0.12
2.857SerAsn: 2.857 ± 0.155
3.695SerPro: 3.695 ± 0.249
2.174SerGln: 2.174 ± 0.146
3.548SerArg: 3.548 ± 0.176
7.716SerSer: 7.716 ± 0.384
4.091SerThr: 4.091 ± 0.206
4.192SerVal: 4.192 ± 0.181
0.994SerTrp: 0.994 ± 0.09
3.881SerTyr: 3.881 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
1.987ThrAla: 1.987 ± 0.125
1.739ThrCys: 1.739 ± 0.175
2.205ThrAsp: 2.205 ± 0.152
3.074ThrGlu: 3.074 ± 0.164
2.888ThrPhe: 2.888 ± 0.145
6.777ThrGly: 6.777 ± 1.526
0.823ThrHis: 0.823 ± 0.085
2.958ThrIle: 2.958 ± 0.155
2.709ThrLys: 2.709 ± 0.169
6.373ThrLeu: 6.373 ± 0.233
0.652ThrMet: 0.652 ± 0.076
2.096ThrAsn: 2.096 ± 0.161
2.391ThrPro: 2.391 ± 0.147
1.475ThrGln: 1.475 ± 0.1
2.686ThrArg: 2.686 ± 0.147
4.285ThrSer: 4.285 ± 0.214
2.733ThrThr: 2.733 ± 0.222
2.756ThrVal: 2.756 ± 0.193
0.699ThrTrp: 0.699 ± 0.07
2.026ThrTyr: 2.026 ± 0.126
0.0ThrXaa: 0.0 ± 0.0
Val
3.431ValAla: 3.431 ± 0.166
2.189ValCys: 2.189 ± 0.17
3.641ValAsp: 3.641 ± 0.156
4.301ValGlu: 4.301 ± 0.197
2.585ValPhe: 2.585 ± 0.127
2.639ValGly: 2.639 ± 0.183
1.188ValHis: 1.188 ± 0.093
3.198ValIle: 3.198 ± 0.198
3.796ValLys: 3.796 ± 0.198
7.67ValLeu: 7.67 ± 0.281
1.351ValMet: 1.351 ± 0.116
3.043ValAsn: 3.043 ± 0.215
3.237ValPro: 3.237 ± 0.165
2.368ValGln: 2.368 ± 0.16
3.229ValArg: 3.229 ± 0.16
4.433ValSer: 4.433 ± 0.216
4.906ValThr: 4.906 ± 0.595
3.912ValVal: 3.912 ± 0.219
0.776ValTrp: 0.776 ± 0.069
3.33ValTyr: 3.33 ± 0.151
0.0ValXaa: 0.0 ± 0.0
Trp
1.327TrpAla: 1.327 ± 0.18
0.326TrpCys: 0.326 ± 0.05
0.869TrpAsp: 0.869 ± 0.137
0.613TrpGlu: 0.613 ± 0.059
0.606TrpPhe: 0.606 ± 0.07
0.442TrpGly: 0.442 ± 0.062
0.225TrpHis: 0.225 ± 0.04
0.854TrpIle: 0.854 ± 0.074
0.924TrpLys: 0.924 ± 0.073
1.747TrpLeu: 1.747 ± 0.148
0.396TrpMet: 0.396 ± 0.062
0.877TrpAsn: 0.877 ± 0.111
0.21TrpPro: 0.21 ± 0.05
0.357TrpGln: 0.357 ± 0.048
0.637TrpArg: 0.637 ± 0.07
0.831TrpSer: 0.831 ± 0.087
0.613TrpThr: 0.613 ± 0.074
0.582TrpVal: 0.582 ± 0.072
0.07TrpTrp: 0.07 ± 0.024
0.551TrpTyr: 0.551 ± 0.07
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.593TyrAla: 2.593 ± 0.129
0.706TyrCys: 0.706 ± 0.084
2.243TyrAsp: 2.243 ± 0.13
2.717TyrGlu: 2.717 ± 0.175
2.562TyrPhe: 2.562 ± 0.162
2.026TyrGly: 2.026 ± 0.129
0.947TyrHis: 0.947 ± 0.089
2.523TyrIle: 2.523 ± 0.148
2.934TyrLys: 2.934 ± 0.17
5.799TyrLeu: 5.799 ± 0.25
1.149TyrMet: 1.149 ± 0.09
2.368TyrAsn: 2.368 ± 0.133
2.36TyrPro: 2.36 ± 0.132
1.591TyrGln: 1.591 ± 0.122
2.36TyrArg: 2.36 ± 0.117
3.78TyrSer: 3.78 ± 0.171
2.181TyrThr: 2.181 ± 0.12
2.647TyrVal: 2.647 ± 0.132
0.349TyrTrp: 0.349 ± 0.053
2.647TyrTyr: 2.647 ± 0.144
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.008XaaGlu: 0.008 ± 0.008
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.101XaaXaa: 0.101 ± 0.11
Statistics based on 533 proteins (128820 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski