Amino acid dipepetide frequency for Sea otter poxvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.836AlaAla: 1.836 ± 0.243
0.956AlaCys: 0.956 ± 0.172
2.213AlaAsp: 2.213 ± 0.25
1.484AlaGlu: 1.484 ± 0.225
1.861AlaPhe: 1.861 ± 0.233
1.484AlaGly: 1.484 ± 0.252
0.503AlaHis: 0.503 ± 0.126
4.577AlaIle: 4.577 ± 0.405
2.389AlaLys: 2.389 ± 0.242
3.596AlaLeu: 3.596 ± 0.284
1.031AlaMet: 1.031 ± 0.205
2.565AlaAsn: 2.565 ± 0.25
1.257AlaPro: 1.257 ± 0.162
1.006AlaGln: 1.006 ± 0.168
1.685AlaArg: 1.685 ± 0.227
3.194AlaSer: 3.194 ± 0.372
2.59AlaThr: 2.59 ± 0.241
1.936AlaVal: 1.936 ± 0.239
0.277AlaTrp: 0.277 ± 0.083
1.71AlaTyr: 1.71 ± 0.241
0.0AlaXaa: 0.0 ± 0.0
Cys
0.855CysAla: 0.855 ± 0.159
0.629CysCys: 0.629 ± 0.127
1.383CysAsp: 1.383 ± 0.206
1.031CysGlu: 1.031 ± 0.163
0.855CysPhe: 0.855 ± 0.139
1.257CysGly: 1.257 ± 0.145
0.402CysHis: 0.402 ± 0.086
2.691CysIle: 2.691 ± 0.336
1.458CysLys: 1.458 ± 0.216
1.76CysLeu: 1.76 ± 0.26
0.88CysMet: 0.88 ± 0.154
1.458CysAsn: 1.458 ± 0.164
0.855CysPro: 0.855 ± 0.302
0.427CysGln: 0.427 ± 0.109
0.78CysArg: 0.78 ± 0.127
2.313CysSer: 2.313 ± 0.311
1.106CysThr: 1.106 ± 0.188
1.609CysVal: 1.609 ± 0.18
0.251CysTrp: 0.251 ± 0.077
0.93CysTyr: 0.93 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
2.49AspAla: 2.49 ± 0.248
1.182AspCys: 1.182 ± 0.187
4.853AspAsp: 4.853 ± 0.494
3.571AspGlu: 3.571 ± 0.303
2.59AspPhe: 2.59 ± 0.245
2.64AspGly: 2.64 ± 0.237
0.956AspHis: 0.956 ± 0.163
8.952AspIle: 8.952 ± 0.533
4.275AspLys: 4.275 ± 0.316
4.023AspLeu: 4.023 ± 0.265
1.811AspMet: 1.811 ± 0.238
4.577AspAsn: 4.577 ± 0.36
1.685AspPro: 1.685 ± 0.211
1.031AspGln: 1.031 ± 0.17
1.861AspArg: 1.861 ± 0.231
3.998AspSer: 3.998 ± 0.321
4.702AspThr: 4.702 ± 0.395
4.476AspVal: 4.476 ± 0.418
0.352AspTrp: 0.352 ± 0.083
2.263AspTyr: 2.263 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
1.333GluAla: 1.333 ± 0.194
0.981GluCys: 0.981 ± 0.139
2.439GluAsp: 2.439 ± 0.267
2.64GluGlu: 2.64 ± 0.269
2.238GluPhe: 2.238 ± 0.244
1.106GluGly: 1.106 ± 0.167
1.886GluHis: 1.886 ± 0.233
4.049GluIle: 4.049 ± 0.435
2.917GluLys: 2.917 ± 0.251
5.004GluLeu: 5.004 ± 0.422
0.93GluMet: 0.93 ± 0.174
3.093GluAsn: 3.093 ± 0.28
1.609GluPro: 1.609 ± 0.18
1.861GluGln: 1.861 ± 0.225
1.911GluArg: 1.911 ± 0.246
3.445GluSer: 3.445 ± 0.291
3.168GluThr: 3.168 ± 0.311
1.836GluVal: 1.836 ± 0.187
0.327GluTrp: 0.327 ± 0.082
3.093GluTyr: 3.093 ± 0.228
0.0GluXaa: 0.0 ± 0.0
Phe
1.66PheAla: 1.66 ± 0.232
0.981PheCys: 0.981 ± 0.155
2.842PheAsp: 2.842 ± 0.289
2.087PheGlu: 2.087 ± 0.217
2.037PhePhe: 2.037 ± 0.221
2.464PheGly: 2.464 ± 0.219
0.981PheHis: 0.981 ± 0.132
4.652PheIle: 4.652 ± 0.372
2.716PheLys: 2.716 ± 0.24
4.878PheLeu: 4.878 ± 0.448
1.282PheMet: 1.282 ± 0.18
3.395PheAsn: 3.395 ± 0.275
1.609PhePro: 1.609 ± 0.173
0.805PheGln: 0.805 ± 0.142
1.534PheArg: 1.534 ± 0.187
4.174PheSer: 4.174 ± 0.341
2.892PheThr: 2.892 ± 0.249
3.143PheVal: 3.143 ± 0.34
0.402PheTrp: 0.402 ± 0.106
2.037PheTyr: 2.037 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
1.257GlyAla: 1.257 ± 0.192
0.88GlyCys: 0.88 ± 0.147
2.263GlyAsp: 2.263 ± 0.247
1.735GlyGlu: 1.735 ± 0.175
1.509GlyPhe: 1.509 ± 0.167
1.836GlyGly: 1.836 ± 0.234
0.78GlyHis: 0.78 ± 0.143
4.25GlyIle: 4.25 ± 0.382
3.194GlyLys: 3.194 ± 0.245
2.942GlyLeu: 2.942 ± 0.237
0.855GlyMet: 0.855 ± 0.137
2.691GlyAsn: 2.691 ± 0.241
0.578GlyPro: 0.578 ± 0.113
0.855GlyGln: 0.855 ± 0.161
1.811GlyArg: 1.811 ± 0.214
2.49GlySer: 2.49 ± 0.247
2.716GlyThr: 2.716 ± 0.364
2.54GlyVal: 2.54 ± 0.222
0.201GlyTrp: 0.201 ± 0.074
2.263GlyTyr: 2.263 ± 0.245
0.0GlyXaa: 0.0 ± 0.0
His
1.031HisAla: 1.031 ± 0.143
0.729HisCys: 0.729 ± 0.136
1.484HisAsp: 1.484 ± 0.279
1.207HisGlu: 1.207 ± 0.157
1.132HisPhe: 1.132 ± 0.16
1.207HisGly: 1.207 ± 0.165
0.654HisHis: 0.654 ± 0.152
2.741HisIle: 2.741 ± 0.227
1.735HisLys: 1.735 ± 0.224
1.886HisLeu: 1.886 ± 0.179
0.654HisMet: 0.654 ± 0.127
1.735HisAsn: 1.735 ± 0.234
0.679HisPro: 0.679 ± 0.141
0.578HisGln: 0.578 ± 0.114
0.93HisArg: 0.93 ± 0.157
1.635HisSer: 1.635 ± 0.232
1.735HisThr: 1.735 ± 0.2
2.062HisVal: 2.062 ± 0.264
0.226HisTrp: 0.226 ± 0.086
0.805HisTyr: 0.805 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.023IleAla: 4.023 ± 0.322
2.137IleCys: 2.137 ± 0.215
6.714IleAsp: 6.714 ± 0.401
5.029IleGlu: 5.029 ± 0.467
4.174IlePhe: 4.174 ± 0.361
2.867IleGly: 2.867 ± 0.278
2.842IleHis: 2.842 ± 0.302
8.852IleIle: 8.852 ± 0.501
6.865IleLys: 6.865 ± 0.429
9.354IleLeu: 9.354 ± 0.485
2.666IleMet: 2.666 ± 0.287
7.393IleAsn: 7.393 ± 0.379
3.998IlePro: 3.998 ± 0.366
3.168IleGln: 3.168 ± 0.262
4.778IleArg: 4.778 ± 0.351
8.5IleSer: 8.5 ± 0.432
6.689IleThr: 6.689 ± 0.43
5.909IleVal: 5.909 ± 0.372
0.377IleTrp: 0.377 ± 0.092
4.552IleTyr: 4.552 ± 0.405
0.0IleXaa: 0.0 ± 0.0
Lys
2.263LysAla: 2.263 ± 0.197
1.458LysCys: 1.458 ± 0.219
3.998LysAsp: 3.998 ± 0.276
3.068LysGlu: 3.068 ± 0.285
3.244LysPhe: 3.244 ± 0.335
2.339LysGly: 2.339 ± 0.205
2.313LysHis: 2.313 ± 0.234
6.915LysIle: 6.915 ± 0.399
5.658LysLys: 5.658 ± 0.361
6.991LysLeu: 6.991 ± 0.495
1.559LysMet: 1.559 ± 0.21
4.375LysAsn: 4.375 ± 0.341
1.886LysPro: 1.886 ± 0.196
2.54LysGln: 2.54 ± 0.259
2.992LysArg: 2.992 ± 0.262
4.929LysSer: 4.929 ± 0.399
4.728LysThr: 4.728 ± 0.313
3.269LysVal: 3.269 ± 0.316
0.453LysTrp: 0.453 ± 0.128
3.596LysTyr: 3.596 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
3.797LeuAla: 3.797 ± 0.352
2.087LeuCys: 2.087 ± 0.259
5.708LeuAsp: 5.708 ± 0.295
4.702LeuGlu: 4.702 ± 0.294
5.029LeuPhe: 5.029 ± 0.378
3.018LeuGly: 3.018 ± 0.289
2.515LeuHis: 2.515 ± 0.255
6.865LeuIle: 6.865 ± 0.453
5.658LeuLys: 5.658 ± 0.362
9.983LeuLeu: 9.983 ± 0.511
2.54LeuMet: 2.54 ± 0.263
5.381LeuAsn: 5.381 ± 0.369
3.219LeuPro: 3.219 ± 0.323
2.942LeuGln: 2.942 ± 0.289
4.526LeuArg: 4.526 ± 0.37
8.625LeuSer: 8.625 ± 0.483
6.085LeuThr: 6.085 ± 0.394
5.029LeuVal: 5.029 ± 0.374
0.729LeuTrp: 0.729 ± 0.159
4.904LeuTyr: 4.904 ± 0.292
0.0LeuXaa: 0.0 ± 0.0
Met
1.358MetAla: 1.358 ± 0.159
0.503MetCys: 0.503 ± 0.111
1.836MetAsp: 1.836 ± 0.204
1.031MetGlu: 1.031 ± 0.159
1.534MetPhe: 1.534 ± 0.215
1.257MetGly: 1.257 ± 0.158
0.629MetHis: 0.629 ± 0.134
2.339MetIle: 2.339 ± 0.222
1.635MetLys: 1.635 ± 0.201
3.093MetLeu: 3.093 ± 0.288
0.503MetMet: 0.503 ± 0.096
1.76MetAsn: 1.76 ± 0.207
0.78MetPro: 0.78 ± 0.154
0.754MetGln: 0.754 ± 0.137
1.282MetArg: 1.282 ± 0.192
2.062MetSer: 2.062 ± 0.221
1.358MetThr: 1.358 ± 0.171
1.207MetVal: 1.207 ± 0.189
0.151MetTrp: 0.151 ± 0.058
1.584MetTyr: 1.584 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
2.389AsnAla: 2.389 ± 0.219
1.207AsnCys: 1.207 ± 0.193
4.476AsnAsp: 4.476 ± 0.302
3.37AsnGlu: 3.37 ± 0.392
2.339AsnPhe: 2.339 ± 0.243
2.691AsnGly: 2.691 ± 0.26
1.509AsnHis: 1.509 ± 0.168
8.424AsnIle: 8.424 ± 0.402
5.457AsnLys: 5.457 ± 0.357
4.828AsnLeu: 4.828 ± 0.395
2.339AsnMet: 2.339 ± 0.212
5.759AsnAsn: 5.759 ± 0.551
2.137AsnPro: 2.137 ± 0.205
1.333AsnGln: 1.333 ± 0.161
2.339AsnArg: 2.339 ± 0.267
3.973AsnSer: 3.973 ± 0.295
4.904AsnThr: 4.904 ± 0.328
4.375AsnVal: 4.375 ± 0.262
0.327AsnTrp: 0.327 ± 0.082
2.967AsnTyr: 2.967 ± 0.266
0.0AsnXaa: 0.0 ± 0.0
Pro
1.182ProAla: 1.182 ± 0.165
1.006ProCys: 1.006 ± 0.187
2.012ProAsp: 2.012 ± 0.213
1.458ProGlu: 1.458 ± 0.212
1.484ProPhe: 1.484 ± 0.206
1.358ProGly: 1.358 ± 0.269
0.93ProHis: 0.93 ± 0.164
3.521ProIle: 3.521 ± 0.333
2.188ProLys: 2.188 ± 0.222
2.992ProLeu: 2.992 ± 0.283
0.604ProMet: 0.604 ± 0.106
1.76ProAsn: 1.76 ± 0.231
1.433ProPro: 1.433 ± 0.23
1.056ProGln: 1.056 ± 0.167
1.106ProArg: 1.106 ± 0.171
2.464ProSer: 2.464 ± 0.279
2.238ProThr: 2.238 ± 0.224
2.288ProVal: 2.288 ± 0.19
0.302ProTrp: 0.302 ± 0.087
1.609ProTyr: 1.609 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
1.006GlnAla: 1.006 ± 0.155
0.654GlnCys: 0.654 ± 0.138
1.433GlnAsp: 1.433 ± 0.154
1.232GlnGlu: 1.232 ± 0.171
1.182GlnPhe: 1.182 ± 0.169
0.478GlnGly: 0.478 ± 0.105
0.905GlnHis: 0.905 ± 0.148
1.735GlnIle: 1.735 ± 0.211
1.685GlnLys: 1.685 ± 0.212
3.118GlnLeu: 3.118 ± 0.266
0.503GlnMet: 0.503 ± 0.109
1.785GlnAsn: 1.785 ± 0.219
0.729GlnPro: 0.729 ± 0.148
1.308GlnGln: 1.308 ± 0.201
1.458GlnArg: 1.458 ± 0.205
2.54GlnSer: 2.54 ± 0.252
1.961GlnThr: 1.961 ± 0.258
1.76GlnVal: 1.76 ± 0.179
0.302GlnTrp: 0.302 ± 0.096
1.861GlnTyr: 1.861 ± 0.235
0.0GlnXaa: 0.0 ± 0.0
Arg
1.458ArgAla: 1.458 ± 0.184
0.905ArgCys: 0.905 ± 0.168
2.339ArgAsp: 2.339 ± 0.23
1.76ArgGlu: 1.76 ± 0.215
2.213ArgPhe: 2.213 ± 0.255
1.408ArgGly: 1.408 ± 0.187
1.056ArgHis: 1.056 ± 0.167
4.225ArgIle: 4.225 ± 0.321
3.043ArgLys: 3.043 ± 0.358
4.049ArgLeu: 4.049 ± 0.291
1.232ArgMet: 1.232 ± 0.175
2.54ArgAsn: 2.54 ± 0.245
1.157ArgPro: 1.157 ± 0.201
1.433ArgGln: 1.433 ± 0.224
2.364ArgArg: 2.364 ± 0.272
3.697ArgSer: 3.697 ± 0.309
2.515ArgThr: 2.515 ± 0.258
2.666ArgVal: 2.666 ± 0.303
0.151ArgTrp: 0.151 ± 0.074
2.54ArgTyr: 2.54 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
2.741SerAla: 2.741 ± 0.293
1.836SerCys: 1.836 ± 0.185
4.979SerAsp: 4.979 ± 0.395
3.143SerGlu: 3.143 ± 0.288
3.47SerPhe: 3.47 ± 0.278
2.766SerGly: 2.766 ± 0.267
1.76SerHis: 1.76 ± 0.191
7.368SerIle: 7.368 ± 0.348
5.784SerLys: 5.784 ± 0.388
7.896SerLeu: 7.896 ± 0.397
2.137SerMet: 2.137 ± 0.244
4.753SerAsn: 4.753 ± 0.286
2.64SerPro: 2.64 ± 0.275
2.364SerGln: 2.364 ± 0.272
3.671SerArg: 3.671 ± 0.312
6.337SerSer: 6.337 ± 0.5
5.859SerThr: 5.859 ± 0.352
4.929SerVal: 4.929 ± 0.42
0.377SerTrp: 0.377 ± 0.08
3.445SerTyr: 3.445 ± 0.291
0.0SerXaa: 0.0 ± 0.0
Thr
2.49ThrAla: 2.49 ± 0.258
1.71ThrCys: 1.71 ± 0.207
4.652ThrAsp: 4.652 ± 0.372
3.344ThrGlu: 3.344 ± 0.279
3.118ThrPhe: 3.118 ± 0.264
2.439ThrGly: 2.439 ± 0.272
1.685ThrHis: 1.685 ± 0.213
6.991ThrIle: 6.991 ± 0.502
4.476ThrLys: 4.476 ± 0.319
6.588ThrLeu: 6.588 ± 0.428
1.987ThrMet: 1.987 ± 0.214
4.049ThrAsn: 4.049 ± 0.259
2.64ThrPro: 2.64 ± 0.321
1.76ThrGln: 1.76 ± 0.204
3.043ThrArg: 3.043 ± 0.256
4.677ThrSer: 4.677 ± 0.359
4.325ThrThr: 4.325 ± 0.361
4.099ThrVal: 4.099 ± 0.311
0.453ThrTrp: 0.453 ± 0.12
3.068ThrTyr: 3.068 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
2.867ValAla: 2.867 ± 0.237
1.785ValCys: 1.785 ± 0.242
3.621ValAsp: 3.621 ± 0.272
1.961ValGlu: 1.961 ± 0.262
3.395ValPhe: 3.395 ± 0.365
2.59ValGly: 2.59 ± 0.239
1.509ValHis: 1.509 ± 0.209
5.583ValIle: 5.583 ± 0.314
3.671ValLys: 3.671 ± 0.332
5.507ValLeu: 5.507 ± 0.351
1.383ValMet: 1.383 ± 0.16
3.772ValAsn: 3.772 ± 0.267
2.112ValPro: 2.112 ± 0.242
1.383ValGln: 1.383 ± 0.185
2.842ValArg: 2.842 ± 0.24
5.155ValSer: 5.155 ± 0.359
4.476ValThr: 4.476 ± 0.335
3.294ValVal: 3.294 ± 0.307
0.302ValTrp: 0.302 ± 0.099
3.294ValTyr: 3.294 ± 0.304
0.0ValXaa: 0.0 ± 0.0
Trp
0.251TrpAla: 0.251 ± 0.089
0.126TrpCys: 0.126 ± 0.056
0.226TrpAsp: 0.226 ± 0.073
0.226TrpGlu: 0.226 ± 0.068
0.578TrpPhe: 0.578 ± 0.115
0.226TrpGly: 0.226 ± 0.087
0.075TrpHis: 0.075 ± 0.047
0.704TrpIle: 0.704 ± 0.127
0.453TrpLys: 0.453 ± 0.109
0.78TrpLeu: 0.78 ± 0.151
0.277TrpMet: 0.277 ± 0.105
0.402TrpAsn: 0.402 ± 0.096
0.302TrpPro: 0.302 ± 0.096
0.126TrpGln: 0.126 ± 0.057
0.302TrpArg: 0.302 ± 0.089
0.453TrpSer: 0.453 ± 0.106
0.201TrpThr: 0.201 ± 0.062
0.327TrpVal: 0.327 ± 0.081
0.025TrpTrp: 0.025 ± 0.027
0.251TrpTyr: 0.251 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.861TyrAla: 1.861 ± 0.204
1.182TyrCys: 1.182 ± 0.197
2.816TyrAsp: 2.816 ± 0.236
1.785TyrGlu: 1.785 ± 0.209
2.59TyrPhe: 2.59 ± 0.227
2.213TyrGly: 2.213 ± 0.249
0.956TyrHis: 0.956 ± 0.133
5.457TyrIle: 5.457 ± 0.392
3.319TyrLys: 3.319 ± 0.353
4.074TyrLeu: 4.074 ± 0.313
1.458TyrMet: 1.458 ± 0.183
3.898TyrAsn: 3.898 ± 0.285
1.685TyrPro: 1.685 ± 0.234
0.905TyrGln: 0.905 ± 0.144
1.559TyrArg: 1.559 ± 0.206
3.596TyrSer: 3.596 ± 0.287
3.344TyrThr: 3.344 ± 0.294
3.747TyrVal: 3.747 ± 0.311
0.327TyrTrp: 0.327 ± 0.077
2.288TyrTyr: 2.288 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 130 proteins (39768 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski