Amino acid dipepetide frequency for Waterbuck coronavirus US/OH-WD358-GnC/1994

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.05AlaAla: 4.05 ± 0.457
1.877AlaCys: 1.877 ± 0.582
4.149AlaAsp: 4.149 ± 0.721
1.679AlaGlu: 1.679 ± 0.525
3.853AlaPhe: 3.853 ± 0.462
3.655AlaGly: 3.655 ± 0.254
0.889AlaHis: 0.889 ± 0.404
3.951AlaIle: 3.951 ± 0.66
4.05AlaLys: 4.05 ± 0.686
5.73AlaLeu: 5.73 ± 0.605
1.087AlaMet: 1.087 ± 0.294
4.149AlaAsn: 4.149 ± 0.545
2.272AlaPro: 2.272 ± 1.003
2.371AlaGln: 2.371 ± 0.377
2.074AlaArg: 2.074 ± 0.629
4.149AlaSer: 4.149 ± 1.012
3.853AlaThr: 3.853 ± 0.712
5.433AlaVal: 5.433 ± 1.054
1.087AlaTrp: 1.087 ± 0.335
3.062AlaTyr: 3.062 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
1.679CysAla: 1.679 ± 0.436
1.185CysCys: 1.185 ± 0.395
2.568CysAsp: 2.568 ± 0.491
0.79CysGlu: 0.79 ± 0.222
2.371CysPhe: 2.371 ± 0.342
2.173CysGly: 2.173 ± 0.598
0.593CysHis: 0.593 ± 0.125
2.074CysIle: 2.074 ± 0.319
2.272CysLys: 2.272 ± 0.412
2.371CysLeu: 2.371 ± 0.266
0.494CysMet: 0.494 ± 0.326
2.964CysAsn: 2.964 ± 0.613
0.889CysPro: 0.889 ± 0.317
1.284CysGln: 1.284 ± 0.25
1.185CysArg: 1.185 ± 0.389
2.371CysSer: 2.371 ± 0.559
1.778CysThr: 1.778 ± 0.672
3.161CysVal: 3.161 ± 0.426
0.494CysTrp: 0.494 ± 0.239
1.976CysTyr: 1.976 ± 0.563
0.0CysXaa: 0.0 ± 0.0
Asp
3.655AspAla: 3.655 ± 0.305
2.865AspCys: 2.865 ± 0.444
3.853AspAsp: 3.853 ± 0.674
1.976AspGlu: 1.976 ± 0.37
4.347AspPhe: 4.347 ± 0.834
4.347AspGly: 4.347 ± 0.729
0.889AspHis: 0.889 ± 0.185
3.26AspIle: 3.26 ± 0.58
3.754AspLys: 3.754 ± 0.759
6.026AspLeu: 6.026 ± 0.688
1.581AspMet: 1.581 ± 0.382
2.371AspAsn: 2.371 ± 0.428
1.383AspPro: 1.383 ± 0.228
1.581AspGln: 1.581 ± 0.472
1.284AspArg: 1.284 ± 0.394
4.05AspSer: 4.05 ± 0.489
2.964AspThr: 2.964 ± 0.577
7.31AspVal: 7.31 ± 1.166
0.395AspTrp: 0.395 ± 0.191
3.26AspTyr: 3.26 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
3.161GluAla: 3.161 ± 0.291
0.691GluCys: 0.691 ± 0.234
2.568GluAsp: 2.568 ± 0.552
2.47GluGlu: 2.47 ± 0.539
1.778GluPhe: 1.778 ± 0.253
2.074GluGly: 2.074 ± 0.477
0.593GluHis: 0.593 ± 0.153
2.964GluIle: 2.964 ± 0.244
1.482GluLys: 1.482 ± 0.381
4.149GluLeu: 4.149 ± 0.506
0.79GluMet: 0.79 ± 0.139
1.877GluAsn: 1.877 ± 0.706
1.383GluPro: 1.383 ± 0.434
1.087GluGln: 1.087 ± 0.317
1.284GluArg: 1.284 ± 0.365
1.679GluSer: 1.679 ± 0.597
1.877GluThr: 1.877 ± 0.42
2.272GluVal: 2.272 ± 0.534
0.198GluTrp: 0.198 ± 0.151
1.976GluTyr: 1.976 ± 0.475
0.099GluXaa: 0.099 ± 0.134
Phe
2.865PheAla: 2.865 ± 0.432
1.679PheCys: 1.679 ± 0.301
3.359PheAsp: 3.359 ± 0.769
2.47PheGlu: 2.47 ± 0.52
1.284PhePhe: 1.284 ± 0.444
3.062PheGly: 3.062 ± 0.511
0.691PheHis: 0.691 ± 0.195
3.359PheIle: 3.359 ± 0.405
4.347PheLys: 4.347 ± 0.335
3.754PheLeu: 3.754 ± 0.651
1.284PheMet: 1.284 ± 0.244
4.544PheAsn: 4.544 ± 0.783
1.185PhePro: 1.185 ± 0.394
1.482PheGln: 1.482 ± 0.504
1.581PheArg: 1.581 ± 0.43
3.655PheSer: 3.655 ± 0.565
4.544PheThr: 4.544 ± 1.023
5.927PheVal: 5.927 ± 1.121
0.691PheTrp: 0.691 ± 0.308
3.655PheTyr: 3.655 ± 0.522
0.0PheXaa: 0.0 ± 0.0
Gly
2.47GlyAla: 2.47 ± 0.297
2.964GlyCys: 2.964 ± 0.414
3.359GlyAsp: 3.359 ± 0.467
1.581GlyGlu: 1.581 ± 0.249
4.149GlyPhe: 4.149 ± 1.119
3.853GlyGly: 3.853 ± 0.516
0.988GlyHis: 0.988 ± 0.272
3.26GlyIle: 3.26 ± 1.122
2.964GlyLys: 2.964 ± 0.671
4.347GlyLeu: 4.347 ± 0.668
1.581GlyMet: 1.581 ± 0.203
3.556GlyAsn: 3.556 ± 1.095
1.284GlyPro: 1.284 ± 0.383
1.482GlyGln: 1.482 ± 0.401
1.976GlyArg: 1.976 ± 0.403
4.84GlySer: 4.84 ± 0.618
4.149GlyThr: 4.149 ± 0.687
6.915GlyVal: 6.915 ± 0.996
0.889GlyTrp: 0.889 ± 0.266
3.655GlyTyr: 3.655 ± 0.857
0.0GlyXaa: 0.0 ± 0.0
His
1.482HisAla: 1.482 ± 0.431
0.593HisCys: 0.593 ± 0.385
1.087HisAsp: 1.087 ± 0.288
0.494HisGlu: 0.494 ± 0.326
1.284HisPhe: 1.284 ± 0.159
0.395HisGly: 0.395 ± 0.18
0.099HisHis: 0.099 ± 0.087
0.593HisIle: 0.593 ± 0.381
1.284HisLys: 1.284 ± 0.531
1.383HisLeu: 1.383 ± 0.472
0.198HisMet: 0.198 ± 0.084
0.494HisAsn: 0.494 ± 0.191
0.494HisPro: 0.494 ± 0.274
0.494HisGln: 0.494 ± 0.12
0.198HisArg: 0.198 ± 0.084
0.889HisSer: 0.889 ± 0.364
0.79HisThr: 0.79 ± 0.147
2.568HisVal: 2.568 ± 1.034
0.593HisTrp: 0.593 ± 0.269
0.889HisTyr: 0.889 ± 0.318
0.0HisXaa: 0.0 ± 0.0
Ile
3.359IleAla: 3.359 ± 0.264
2.371IleCys: 2.371 ± 0.585
3.161IleAsp: 3.161 ± 0.349
1.284IleGlu: 1.284 ± 0.389
1.976IlePhe: 1.976 ± 0.597
3.556IleGly: 3.556 ± 0.833
0.494IleHis: 0.494 ± 0.085
3.754IleIle: 3.754 ± 1.13
3.951IleLys: 3.951 ± 0.975
5.038IleLeu: 5.038 ± 0.815
1.284IleMet: 1.284 ± 0.451
3.359IleAsn: 3.359 ± 1.065
1.383IlePro: 1.383 ± 0.324
2.173IleGln: 2.173 ± 0.505
2.272IleArg: 2.272 ± 0.604
3.556IleSer: 3.556 ± 0.303
3.359IleThr: 3.359 ± 0.604
5.038IleVal: 5.038 ± 1.24
0.691IleTrp: 0.691 ± 0.246
1.679IleTyr: 1.679 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
3.556LysAla: 3.556 ± 0.801
2.371LysCys: 2.371 ± 0.54
3.26LysAsp: 3.26 ± 0.55
2.47LysGlu: 2.47 ± 0.439
3.359LysPhe: 3.359 ± 0.436
3.853LysGly: 3.853 ± 0.729
1.679LysHis: 1.679 ± 0.661
3.26LysIle: 3.26 ± 0.777
1.877LysLys: 1.877 ± 0.428
6.52LysLeu: 6.52 ± 1.014
0.593LysMet: 0.593 ± 0.188
2.074LysAsn: 2.074 ± 0.183
2.964LysPro: 2.964 ± 0.48
2.47LysGln: 2.47 ± 0.706
2.371LysArg: 2.371 ± 0.375
4.05LysSer: 4.05 ± 0.36
1.679LysThr: 1.679 ± 0.189
5.334LysVal: 5.334 ± 0.602
0.988LysTrp: 0.988 ± 0.157
2.865LysTyr: 2.865 ± 0.72
0.0LysXaa: 0.0 ± 0.0
Leu
5.927LeuAla: 5.927 ± 1.121
2.865LeuCys: 2.865 ± 0.519
5.038LeuAsp: 5.038 ± 0.66
4.05LeuGlu: 4.05 ± 0.643
5.236LeuPhe: 5.236 ± 0.991
5.038LeuGly: 5.038 ± 1.475
1.778LeuHis: 1.778 ± 0.377
3.655LeuIle: 3.655 ± 0.622
5.137LeuLys: 5.137 ± 0.987
7.606LeuLeu: 7.606 ± 1.182
1.877LeuMet: 1.877 ± 0.299
5.334LeuAsn: 5.334 ± 0.742
3.457LeuPro: 3.457 ± 0.452
3.754LeuGln: 3.754 ± 0.988
2.766LeuArg: 2.766 ± 0.775
7.508LeuSer: 7.508 ± 0.456
5.532LeuThr: 5.532 ± 0.819
6.026LeuVal: 6.026 ± 0.616
1.383LeuTrp: 1.383 ± 0.205
5.236LeuTyr: 5.236 ± 0.497
0.0LeuXaa: 0.0 ± 0.0
Met
2.173MetAla: 2.173 ± 0.541
0.691MetCys: 0.691 ± 0.377
1.087MetAsp: 1.087 ± 0.382
0.691MetGlu: 0.691 ± 0.17
1.482MetPhe: 1.482 ± 0.2
0.889MetGly: 0.889 ± 0.327
0.593MetHis: 0.593 ± 0.217
0.988MetIle: 0.988 ± 0.167
0.395MetLys: 0.395 ± 0.211
2.667MetLeu: 2.667 ± 0.556
0.494MetMet: 0.494 ± 0.253
0.988MetAsn: 0.988 ± 0.195
1.679MetPro: 1.679 ± 0.514
0.988MetGln: 0.988 ± 0.236
0.593MetArg: 0.593 ± 0.296
1.877MetSer: 1.877 ± 0.382
1.087MetThr: 1.087 ± 0.359
1.383MetVal: 1.383 ± 0.497
0.296MetTrp: 0.296 ± 0.306
0.988MetTyr: 0.988 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
3.556AsnAla: 3.556 ± 0.987
1.482AsnCys: 1.482 ± 0.187
2.371AsnAsp: 2.371 ± 0.615
1.482AsnGlu: 1.482 ± 0.257
3.359AsnPhe: 3.359 ± 0.644
4.742AsnGly: 4.742 ± 0.85
0.988AsnHis: 0.988 ± 0.331
2.47AsnIle: 2.47 ± 0.537
3.26AsnLys: 3.26 ± 0.47
4.445AsnLeu: 4.445 ± 0.706
1.976AsnMet: 1.976 ± 0.169
3.457AsnAsn: 3.457 ± 0.594
2.371AsnPro: 2.371 ± 0.638
1.877AsnGln: 1.877 ± 0.773
1.976AsnArg: 1.976 ± 0.465
3.556AsnSer: 3.556 ± 0.764
3.26AsnThr: 3.26 ± 0.615
5.828AsnVal: 5.828 ± 0.54
0.395AsnTrp: 0.395 ± 0.079
2.766AsnTyr: 2.766 ± 1.108
0.0AsnXaa: 0.0 ± 0.0
Pro
2.964ProAla: 2.964 ± 0.543
1.383ProCys: 1.383 ± 0.255
1.778ProAsp: 1.778 ± 0.458
1.383ProGlu: 1.383 ± 0.365
1.581ProPhe: 1.581 ± 0.217
2.074ProGly: 2.074 ± 0.742
0.79ProHis: 0.79 ± 0.324
2.371ProIle: 2.371 ± 0.56
2.074ProLys: 2.074 ± 0.33
2.964ProLeu: 2.964 ± 0.38
0.395ProMet: 0.395 ± 0.333
1.581ProAsn: 1.581 ± 0.706
1.877ProPro: 1.877 ± 0.471
1.581ProGln: 1.581 ± 0.7
1.185ProArg: 1.185 ± 0.391
2.074ProSer: 2.074 ± 0.743
2.865ProThr: 2.865 ± 0.396
2.371ProVal: 2.371 ± 0.472
0.395ProTrp: 0.395 ± 0.251
1.482ProTyr: 1.482 ± 0.587
0.0ProXaa: 0.0 ± 0.0
Gln
1.679GlnAla: 1.679 ± 0.277
0.691GlnCys: 0.691 ± 0.244
2.173GlnAsp: 2.173 ± 0.46
1.778GlnGlu: 1.778 ± 0.491
2.074GlnPhe: 2.074 ± 0.541
2.173GlnGly: 2.173 ± 0.538
0.79GlnHis: 0.79 ± 0.207
2.074GlnIle: 2.074 ± 0.21
1.976GlnLys: 1.976 ± 0.66
3.359GlnLeu: 3.359 ± 0.572
0.395GlnMet: 0.395 ± 0.169
1.482GlnAsn: 1.482 ± 0.517
1.482GlnPro: 1.482 ± 0.674
2.074GlnGln: 2.074 ± 0.763
1.087GlnArg: 1.087 ± 0.349
3.655GlnSer: 3.655 ± 0.295
1.976GlnThr: 1.976 ± 0.456
2.074GlnVal: 2.074 ± 0.498
0.889GlnTrp: 0.889 ± 0.213
1.383GlnTyr: 1.383 ± 0.194
0.0GlnXaa: 0.0 ± 0.0
Arg
2.371ArgAla: 2.371 ± 1.089
0.889ArgCys: 0.889 ± 0.405
1.581ArgAsp: 1.581 ± 0.172
1.383ArgGlu: 1.383 ± 0.388
1.877ArgPhe: 1.877 ± 0.429
2.074ArgGly: 2.074 ± 0.512
0.79ArgHis: 0.79 ± 0.241
1.383ArgIle: 1.383 ± 0.34
1.778ArgLys: 1.778 ± 0.536
2.865ArgLeu: 2.865 ± 0.348
0.395ArgMet: 0.395 ± 0.129
1.581ArgAsn: 1.581 ± 0.497
0.889ArgPro: 0.889 ± 0.319
1.284ArgGln: 1.284 ± 0.692
1.778ArgArg: 1.778 ± 0.494
3.359ArgSer: 3.359 ± 0.818
1.679ArgThr: 1.679 ± 0.306
3.655ArgVal: 3.655 ± 1.003
0.296ArgTrp: 0.296 ± 0.23
1.877ArgTyr: 1.877 ± 0.427
0.0ArgXaa: 0.0 ± 0.0
Ser
4.643SerAla: 4.643 ± 0.484
2.865SerCys: 2.865 ± 0.471
4.742SerAsp: 4.742 ± 0.606
2.272SerGlu: 2.272 ± 0.304
3.062SerPhe: 3.062 ± 0.189
4.248SerGly: 4.248 ± 1.838
1.284SerHis: 1.284 ± 0.387
3.951SerIle: 3.951 ± 0.288
3.754SerLys: 3.754 ± 0.923
7.113SerLeu: 7.113 ± 1.042
1.778SerMet: 1.778 ± 0.514
3.062SerAsn: 3.062 ± 0.334
1.679SerPro: 1.679 ± 0.585
2.667SerGln: 2.667 ± 0.237
3.161SerArg: 3.161 ± 1.803
5.631SerSer: 5.631 ± 0.779
4.544SerThr: 4.544 ± 1.227
6.915SerVal: 6.915 ± 0.778
0.691SerTrp: 0.691 ± 0.333
3.359SerTyr: 3.359 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
4.248ThrAla: 4.248 ± 0.602
1.976ThrCys: 1.976 ± 0.901
3.457ThrAsp: 3.457 ± 0.465
1.679ThrGlu: 1.679 ± 0.192
4.05ThrPhe: 4.05 ± 1.118
5.137ThrGly: 5.137 ± 1.18
0.395ThrHis: 0.395 ± 0.169
3.26ThrIle: 3.26 ± 0.883
3.26ThrLys: 3.26 ± 0.743
4.445ThrLeu: 4.445 ± 0.567
1.581ThrMet: 1.581 ± 0.299
3.062ThrAsn: 3.062 ± 0.895
2.272ThrPro: 2.272 ± 0.71
1.581ThrGln: 1.581 ± 0.33
1.679ThrArg: 1.679 ± 0.781
4.643ThrSer: 4.643 ± 1.227
4.742ThrThr: 4.742 ± 0.922
5.137ThrVal: 5.137 ± 0.634
0.691ThrTrp: 0.691 ± 0.379
2.371ThrTyr: 2.371 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
5.927ValAla: 5.927 ± 0.892
2.964ValCys: 2.964 ± 0.743
7.606ValAsp: 7.606 ± 1.347
3.853ValGlu: 3.853 ± 0.539
3.951ValPhe: 3.951 ± 0.536
3.655ValGly: 3.655 ± 0.342
0.593ValHis: 0.593 ± 0.296
4.347ValIle: 4.347 ± 0.437
5.828ValLys: 5.828 ± 1.261
8.594ValLeu: 8.594 ± 1.298
2.568ValMet: 2.568 ± 0.389
5.631ValAsn: 5.631 ± 1.143
4.149ValPro: 4.149 ± 0.495
3.26ValGln: 3.26 ± 0.604
2.568ValArg: 2.568 ± 0.8
6.026ValSer: 6.026 ± 0.734
4.544ValThr: 4.544 ± 0.629
8.693ValVal: 8.693 ± 1.543
0.79ValTrp: 0.79 ± 0.177
6.717ValTyr: 6.717 ± 0.784
0.0ValXaa: 0.0 ± 0.0
Trp
0.395TrpAla: 0.395 ± 0.127
0.395TrpCys: 0.395 ± 0.168
0.593TrpAsp: 0.593 ± 0.381
0.198TrpGlu: 0.198 ± 0.121
1.087TrpPhe: 1.087 ± 0.41
0.296TrpGly: 0.296 ± 0.147
0.395TrpHis: 0.395 ± 0.079
0.593TrpIle: 0.593 ± 0.257
0.395TrpLys: 0.395 ± 0.193
1.383TrpLeu: 1.383 ± 0.501
0.395TrpMet: 0.395 ± 0.247
0.988TrpAsn: 0.988 ± 0.438
0.494TrpPro: 0.494 ± 0.256
0.593TrpGln: 0.593 ± 0.287
0.593TrpArg: 0.593 ± 0.195
0.889TrpSer: 0.889 ± 0.289
0.593TrpThr: 0.593 ± 0.158
1.383TrpVal: 1.383 ± 0.365
0.099TrpTrp: 0.099 ± 0.162
0.79TrpTyr: 0.79 ± 0.31
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.26TyrAla: 3.26 ± 0.689
1.877TyrCys: 1.877 ± 0.363
3.359TyrAsp: 3.359 ± 0.975
2.47TyrGlu: 2.47 ± 0.366
3.26TyrPhe: 3.26 ± 0.471
2.964TyrGly: 2.964 ± 0.461
0.988TyrHis: 0.988 ± 0.405
2.173TyrIle: 2.173 ± 0.383
3.754TyrLys: 3.754 ± 0.615
4.445TyrLeu: 4.445 ± 0.794
1.185TyrMet: 1.185 ± 0.304
3.062TyrAsn: 3.062 ± 0.526
1.581TyrPro: 1.581 ± 0.551
1.185TyrGln: 1.185 ± 0.25
2.173TyrArg: 2.173 ± 0.679
2.964TyrSer: 2.964 ± 0.774
3.853TyrThr: 3.853 ± 0.642
4.84TyrVal: 4.84 ± 0.629
0.593TyrTrp: 0.593 ± 0.22
3.951TyrTyr: 3.951 ± 0.652
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.099XaaIle: 0.099 ± 0.134
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (10124 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski