Amino acid dipepetide frequency for Waterbuck coronavirus US/OH-WD358/1994

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.05AlaAla: 4.05 ± 0.61
1.877AlaCys: 1.877 ± 0.674
4.149AlaAsp: 4.149 ± 0.848
1.679AlaGlu: 1.679 ± 0.59
3.853AlaPhe: 3.853 ± 0.511
3.655AlaGly: 3.655 ± 0.283
0.889AlaHis: 0.889 ± 0.338
3.951AlaIle: 3.951 ± 0.69
4.05AlaLys: 4.05 ± 0.695
5.73AlaLeu: 5.73 ± 0.618
1.087AlaMet: 1.087 ± 0.272
4.149AlaAsn: 4.149 ± 0.566
2.272AlaPro: 2.272 ± 0.952
2.371AlaGln: 2.371 ± 0.4
2.074AlaArg: 2.074 ± 0.491
4.149AlaSer: 4.149 ± 1.025
3.853AlaThr: 3.853 ± 0.79
5.433AlaVal: 5.433 ± 0.902
1.087AlaTrp: 1.087 ± 0.375
3.062AlaTyr: 3.062 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
1.679CysAla: 1.679 ± 0.433
1.185CysCys: 1.185 ± 0.518
2.568CysAsp: 2.568 ± 0.547
0.79CysGlu: 0.79 ± 0.218
2.371CysPhe: 2.371 ± 0.354
2.173CysGly: 2.173 ± 0.645
0.593CysHis: 0.593 ± 0.13
2.074CysIle: 2.074 ± 0.327
2.272CysLys: 2.272 ± 0.465
2.371CysLeu: 2.371 ± 0.291
0.494CysMet: 0.494 ± 0.322
2.964CysAsn: 2.964 ± 0.805
0.889CysPro: 0.889 ± 0.327
1.284CysGln: 1.284 ± 0.295
1.185CysArg: 1.185 ± 0.395
2.371CysSer: 2.371 ± 0.512
1.778CysThr: 1.778 ± 0.815
3.161CysVal: 3.161 ± 0.467
0.494CysTrp: 0.494 ± 0.207
1.976CysTyr: 1.976 ± 0.428
0.0CysXaa: 0.0 ± 0.0
Asp
3.655AspAla: 3.655 ± 0.281
2.865AspCys: 2.865 ± 0.538
3.853AspAsp: 3.853 ± 0.639
1.976AspGlu: 1.976 ± 0.418
4.347AspPhe: 4.347 ± 0.631
4.347AspGly: 4.347 ± 0.668
0.889AspHis: 0.889 ± 0.24
3.26AspIle: 3.26 ± 0.593
3.754AspLys: 3.754 ± 0.713
6.026AspLeu: 6.026 ± 0.708
1.581AspMet: 1.581 ± 0.446
2.371AspAsn: 2.371 ± 0.503
1.383AspPro: 1.383 ± 0.268
1.581AspGln: 1.581 ± 0.549
1.284AspArg: 1.284 ± 0.468
4.05AspSer: 4.05 ± 0.483
2.964AspThr: 2.964 ± 0.505
7.31AspVal: 7.31 ± 1.221
0.395AspTrp: 0.395 ± 0.17
3.26AspTyr: 3.26 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
3.161GluAla: 3.161 ± 0.243
0.691GluCys: 0.691 ± 0.257
2.568GluAsp: 2.568 ± 0.693
2.47GluGlu: 2.47 ± 0.557
1.778GluPhe: 1.778 ± 0.271
2.074GluGly: 2.074 ± 0.459
0.691GluHis: 0.691 ± 0.2
2.964GluIle: 2.964 ± 0.244
1.482GluLys: 1.482 ± 0.315
4.149GluLeu: 4.149 ± 0.548
0.79GluMet: 0.79 ± 0.15
1.877GluAsn: 1.877 ± 0.722
1.383GluPro: 1.383 ± 0.376
1.087GluGln: 1.087 ± 0.297
1.284GluArg: 1.284 ± 0.295
1.679GluSer: 1.679 ± 0.507
1.877GluThr: 1.877 ± 0.491
2.272GluVal: 2.272 ± 0.605
0.198GluTrp: 0.198 ± 0.165
1.976GluTyr: 1.976 ± 0.604
0.0GluXaa: 0.0 ± 0.0
Phe
2.865PheAla: 2.865 ± 0.556
1.679PheCys: 1.679 ± 0.313
3.359PheAsp: 3.359 ± 0.59
2.47PheGlu: 2.47 ± 0.516
1.284PhePhe: 1.284 ± 0.494
3.062PheGly: 3.062 ± 0.701
0.691PheHis: 0.691 ± 0.223
3.359PheIle: 3.359 ± 0.427
4.347PheLys: 4.347 ± 0.298
3.754PheLeu: 3.754 ± 0.693
1.284PheMet: 1.284 ± 0.285
4.544PheAsn: 4.544 ± 0.691
1.185PhePro: 1.185 ± 0.335
1.482PheGln: 1.482 ± 0.485
1.581PheArg: 1.581 ± 0.411
3.655PheSer: 3.655 ± 0.54
4.544PheThr: 4.544 ± 1.027
5.927PheVal: 5.927 ± 1.137
0.691PheTrp: 0.691 ± 0.313
3.655PheTyr: 3.655 ± 0.454
0.0PheXaa: 0.0 ± 0.0
Gly
2.47GlyAla: 2.47 ± 0.321
2.964GlyCys: 2.964 ± 0.354
3.359GlyAsp: 3.359 ± 0.458
1.581GlyGlu: 1.581 ± 0.289
4.149GlyPhe: 4.149 ± 0.849
3.853GlyGly: 3.853 ± 0.467
0.988GlyHis: 0.988 ± 0.312
3.26GlyIle: 3.26 ± 0.969
2.964GlyLys: 2.964 ± 0.831
4.347GlyLeu: 4.347 ± 0.576
1.581GlyMet: 1.581 ± 0.243
3.556GlyAsn: 3.556 ± 1.045
1.284GlyPro: 1.284 ± 0.414
1.482GlyGln: 1.482 ± 0.463
1.976GlyArg: 1.976 ± 0.366
4.84GlySer: 4.84 ± 0.501
4.149GlyThr: 4.149 ± 0.557
6.915GlyVal: 6.915 ± 1.018
0.889GlyTrp: 0.889 ± 0.269
3.655GlyTyr: 3.655 ± 0.809
0.0GlyXaa: 0.0 ± 0.0
His
1.482HisAla: 1.482 ± 0.458
0.593HisCys: 0.593 ± 0.38
1.087HisAsp: 1.087 ± 0.289
0.494HisGlu: 0.494 ± 0.322
1.284HisPhe: 1.284 ± 0.157
0.395HisGly: 0.395 ± 0.217
0.099HisHis: 0.099 ± 0.092
0.691HisIle: 0.691 ± 0.424
1.284HisLys: 1.284 ± 0.427
1.383HisLeu: 1.383 ± 0.568
0.198HisMet: 0.198 ± 0.068
0.494HisAsn: 0.494 ± 0.198
0.494HisPro: 0.494 ± 0.257
0.494HisGln: 0.494 ± 0.127
0.198HisArg: 0.198 ± 0.068
0.889HisSer: 0.889 ± 0.293
0.79HisThr: 0.79 ± 0.168
2.568HisVal: 2.568 ± 1.078
0.593HisTrp: 0.593 ± 0.249
0.889HisTyr: 0.889 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
3.359IleAla: 3.359 ± 0.323
2.371IleCys: 2.371 ± 0.676
3.161IleAsp: 3.161 ± 0.35
1.284IleGlu: 1.284 ± 0.261
1.976IlePhe: 1.976 ± 0.696
3.556IleGly: 3.556 ± 1.068
0.494IleHis: 0.494 ± 0.101
3.754IleIle: 3.754 ± 1.274
3.951IleLys: 3.951 ± 0.96
5.038IleLeu: 5.038 ± 0.873
1.284IleMet: 1.284 ± 0.486
3.359IleAsn: 3.359 ± 1.321
1.383IlePro: 1.383 ± 0.336
2.173IleGln: 2.173 ± 0.519
2.272IleArg: 2.272 ± 0.654
3.556IleSer: 3.556 ± 0.346
3.359IleThr: 3.359 ± 0.407
5.038IleVal: 5.038 ± 1.256
0.691IleTrp: 0.691 ± 0.275
1.679IleTyr: 1.679 ± 0.261
0.0IleXaa: 0.0 ± 0.0
Lys
3.556LysAla: 3.556 ± 0.664
2.371LysCys: 2.371 ± 0.599
3.26LysAsp: 3.26 ± 0.557
2.47LysGlu: 2.47 ± 0.475
3.359LysPhe: 3.359 ± 0.427
3.853LysGly: 3.853 ± 0.837
1.679LysHis: 1.679 ± 0.596
3.26LysIle: 3.26 ± 0.911
1.877LysLys: 1.877 ± 0.418
6.52LysLeu: 6.52 ± 0.912
0.593LysMet: 0.593 ± 0.207
2.074LysAsn: 2.074 ± 0.168
2.964LysPro: 2.964 ± 0.533
2.47LysGln: 2.47 ± 0.863
2.371LysArg: 2.371 ± 0.396
4.05LysSer: 4.05 ± 0.3
1.679LysThr: 1.679 ± 0.185
5.334LysVal: 5.334 ± 0.523
0.988LysTrp: 0.988 ± 0.174
2.865LysTyr: 2.865 ± 0.673
0.0LysXaa: 0.0 ± 0.0
Leu
5.927LeuAla: 5.927 ± 1.299
2.865LeuCys: 2.865 ± 0.443
5.038LeuAsp: 5.038 ± 0.66
4.05LeuGlu: 4.05 ± 0.657
5.236LeuPhe: 5.236 ± 0.969
5.038LeuGly: 5.038 ± 1.521
1.778LeuHis: 1.778 ± 0.379
3.655LeuIle: 3.655 ± 0.611
5.137LeuLys: 5.137 ± 0.862
7.606LeuLeu: 7.606 ± 1.185
1.877LeuMet: 1.877 ± 0.361
5.334LeuAsn: 5.334 ± 0.699
3.457LeuPro: 3.457 ± 0.485
3.754LeuGln: 3.754 ± 0.991
2.766LeuArg: 2.766 ± 0.738
7.508LeuSer: 7.508 ± 0.527
5.532LeuThr: 5.532 ± 0.774
6.026LeuVal: 6.026 ± 0.444
1.383LeuTrp: 1.383 ± 0.221
5.236LeuTyr: 5.236 ± 0.521
0.0LeuXaa: 0.0 ± 0.0
Met
2.173MetAla: 2.173 ± 0.697
0.691MetCys: 0.691 ± 0.46
1.087MetAsp: 1.087 ± 0.384
0.691MetGlu: 0.691 ± 0.231
1.482MetPhe: 1.482 ± 0.238
0.889MetGly: 0.889 ± 0.297
0.593MetHis: 0.593 ± 0.227
0.988MetIle: 0.988 ± 0.221
0.395MetLys: 0.395 ± 0.211
2.667MetLeu: 2.667 ± 0.465
0.494MetMet: 0.494 ± 0.272
0.988MetAsn: 0.988 ± 0.236
1.679MetPro: 1.679 ± 0.337
0.988MetGln: 0.988 ± 0.273
0.593MetArg: 0.593 ± 0.31
1.877MetSer: 1.877 ± 0.266
1.087MetThr: 1.087 ± 0.281
1.383MetVal: 1.383 ± 0.381
0.296MetTrp: 0.296 ± 0.326
0.988MetTyr: 0.988 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.556AsnAla: 3.556 ± 1.102
1.482AsnCys: 1.482 ± 0.218
2.371AsnAsp: 2.371 ± 0.695
1.482AsnGlu: 1.482 ± 0.28
3.359AsnPhe: 3.359 ± 0.609
4.742AsnGly: 4.742 ± 0.869
0.988AsnHis: 0.988 ± 0.285
2.47AsnIle: 2.47 ± 0.545
3.26AsnLys: 3.26 ± 0.565
4.445AsnLeu: 4.445 ± 0.77
1.976AsnMet: 1.976 ± 0.196
3.457AsnAsn: 3.457 ± 0.658
2.371AsnPro: 2.371 ± 0.601
1.877AsnGln: 1.877 ± 0.66
1.976AsnArg: 1.976 ± 0.496
3.556AsnSer: 3.556 ± 0.671
3.26AsnThr: 3.26 ± 0.678
5.828AsnVal: 5.828 ± 0.558
0.395AsnTrp: 0.395 ± 0.087
2.766AsnTyr: 2.766 ± 1.112
0.0AsnXaa: 0.0 ± 0.0
Pro
2.964ProAla: 2.964 ± 0.471
1.383ProCys: 1.383 ± 0.272
1.778ProAsp: 1.778 ± 0.443
1.383ProGlu: 1.383 ± 0.363
1.581ProPhe: 1.581 ± 0.26
2.074ProGly: 2.074 ± 0.807
0.79ProHis: 0.79 ± 0.311
2.371ProIle: 2.371 ± 0.433
2.074ProLys: 2.074 ± 0.429
2.964ProLeu: 2.964 ± 0.42
0.395ProMet: 0.395 ± 0.306
1.581ProAsn: 1.581 ± 0.662
1.877ProPro: 1.877 ± 0.5
1.581ProGln: 1.581 ± 0.698
1.185ProArg: 1.185 ± 0.422
2.074ProSer: 2.074 ± 0.613
2.865ProThr: 2.865 ± 0.468
2.371ProVal: 2.371 ± 0.494
0.395ProTrp: 0.395 ± 0.242
1.482ProTyr: 1.482 ± 0.599
0.0ProXaa: 0.0 ± 0.0
Gln
1.679GlnAla: 1.679 ± 0.277
0.691GlnCys: 0.691 ± 0.255
2.173GlnAsp: 2.173 ± 0.397
1.778GlnGlu: 1.778 ± 0.439
2.074GlnPhe: 2.074 ± 0.593
2.173GlnGly: 2.173 ± 0.456
0.79GlnHis: 0.79 ± 0.202
2.074GlnIle: 2.074 ± 0.246
1.976GlnLys: 1.976 ± 0.674
3.359GlnLeu: 3.359 ± 0.598
0.395GlnMet: 0.395 ± 0.137
1.482GlnAsn: 1.482 ± 0.49
1.482GlnPro: 1.482 ± 0.58
2.074GlnGln: 2.074 ± 0.853
1.087GlnArg: 1.087 ± 0.39
3.655GlnSer: 3.655 ± 0.347
1.976GlnThr: 1.976 ± 0.468
2.074GlnVal: 2.074 ± 0.558
0.889GlnTrp: 0.889 ± 0.182
1.383GlnTyr: 1.383 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
2.371ArgAla: 2.371 ± 1.169
0.889ArgCys: 0.889 ± 0.442
1.581ArgAsp: 1.581 ± 0.163
1.383ArgGlu: 1.383 ± 0.336
1.877ArgPhe: 1.877 ± 0.465
2.074ArgGly: 2.074 ± 0.556
0.79ArgHis: 0.79 ± 0.23
1.383ArgIle: 1.383 ± 0.461
1.778ArgLys: 1.778 ± 0.495
2.865ArgLeu: 2.865 ± 0.468
0.395ArgMet: 0.395 ± 0.169
1.581ArgAsn: 1.581 ± 0.43
0.889ArgPro: 0.889 ± 0.395
1.284ArgGln: 1.284 ± 0.75
1.778ArgArg: 1.778 ± 0.454
3.359ArgSer: 3.359 ± 0.868
1.679ArgThr: 1.679 ± 0.305
3.655ArgVal: 3.655 ± 0.961
0.296ArgTrp: 0.296 ± 0.246
1.877ArgTyr: 1.877 ± 0.485
0.0ArgXaa: 0.0 ± 0.0
Ser
4.643SerAla: 4.643 ± 0.517
2.865SerCys: 2.865 ± 0.518
4.742SerAsp: 4.742 ± 0.682
2.272SerGlu: 2.272 ± 0.326
3.062SerPhe: 3.062 ± 0.193
4.248SerGly: 4.248 ± 1.599
1.284SerHis: 1.284 ± 0.357
3.951SerIle: 3.951 ± 0.336
3.754SerLys: 3.754 ± 1.005
7.113SerLeu: 7.113 ± 1.105
1.778SerMet: 1.778 ± 0.496
3.062SerAsn: 3.062 ± 0.31
1.679SerPro: 1.679 ± 0.627
2.667SerGln: 2.667 ± 0.249
3.161SerArg: 3.161 ± 1.692
5.631SerSer: 5.631 ± 0.784
4.544SerThr: 4.544 ± 1.139
6.915SerVal: 6.915 ± 0.721
0.691SerTrp: 0.691 ± 0.327
3.359SerTyr: 3.359 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
4.248ThrAla: 4.248 ± 0.656
1.976ThrCys: 1.976 ± 1.042
3.457ThrAsp: 3.457 ± 0.503
1.679ThrGlu: 1.679 ± 0.222
4.05ThrPhe: 4.05 ± 1.193
5.137ThrGly: 5.137 ± 1.038
0.395ThrHis: 0.395 ± 0.137
3.26ThrIle: 3.26 ± 0.784
3.26ThrLys: 3.26 ± 0.889
4.445ThrLeu: 4.445 ± 0.537
1.581ThrMet: 1.581 ± 0.278
3.062ThrAsn: 3.062 ± 0.794
2.272ThrPro: 2.272 ± 0.748
1.581ThrGln: 1.581 ± 0.375
1.679ThrArg: 1.679 ± 0.789
4.643ThrSer: 4.643 ± 0.951
4.742ThrThr: 4.742 ± 0.617
5.137ThrVal: 5.137 ± 0.561
0.691ThrTrp: 0.691 ± 0.262
2.371ThrTyr: 2.371 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
5.927ValAla: 5.927 ± 0.815
2.964ValCys: 2.964 ± 0.611
7.606ValAsp: 7.606 ± 1.15
3.853ValGlu: 3.853 ± 0.537
3.951ValPhe: 3.951 ± 0.485
3.655ValGly: 3.655 ± 0.366
0.593ValHis: 0.593 ± 0.31
4.347ValIle: 4.347 ± 0.452
5.828ValLys: 5.828 ± 1.241
8.594ValLeu: 8.594 ± 1.265
2.568ValMet: 2.568 ± 0.371
5.631ValAsn: 5.631 ± 1.13
4.149ValPro: 4.149 ± 0.663
3.26ValGln: 3.26 ± 0.645
2.568ValArg: 2.568 ± 0.745
6.026ValSer: 6.026 ± 0.724
4.544ValThr: 4.544 ± 0.536
8.693ValVal: 8.693 ± 1.522
0.79ValTrp: 0.79 ± 0.215
6.717ValTyr: 6.717 ± 0.982
0.0ValXaa: 0.0 ± 0.0
Trp
0.395TrpAla: 0.395 ± 0.154
0.395TrpCys: 0.395 ± 0.167
0.593TrpAsp: 0.593 ± 0.455
0.198TrpGlu: 0.198 ± 0.149
1.087TrpPhe: 1.087 ± 0.382
0.296TrpGly: 0.296 ± 0.139
0.395TrpHis: 0.395 ± 0.087
0.593TrpIle: 0.593 ± 0.309
0.395TrpLys: 0.395 ± 0.186
1.383TrpLeu: 1.383 ± 0.551
0.395TrpMet: 0.395 ± 0.221
0.988TrpAsn: 0.988 ± 0.534
0.494TrpPro: 0.494 ± 0.238
0.593TrpGln: 0.593 ± 0.327
0.593TrpArg: 0.593 ± 0.198
0.889TrpSer: 0.889 ± 0.271
0.593TrpThr: 0.593 ± 0.198
1.383TrpVal: 1.383 ± 0.389
0.099TrpTrp: 0.099 ± 0.169
0.79TrpTyr: 0.79 ± 0.34
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.26TyrAla: 3.26 ± 0.667
1.877TyrCys: 1.877 ± 0.348
3.359TyrAsp: 3.359 ± 0.922
2.47TyrGlu: 2.47 ± 0.354
3.26TyrPhe: 3.26 ± 0.473
2.964TyrGly: 2.964 ± 0.459
0.988TyrHis: 0.988 ± 0.457
2.173TyrIle: 2.173 ± 0.343
3.754TyrLys: 3.754 ± 0.573
4.445TyrLeu: 4.445 ± 0.723
1.185TyrMet: 1.185 ± 0.345
3.062TyrAsn: 3.062 ± 0.559
1.581TyrPro: 1.581 ± 0.668
1.185TyrGln: 1.185 ± 0.256
2.173TyrArg: 2.173 ± 0.775
2.964TyrSer: 2.964 ± 0.918
3.853TyrThr: 3.853 ± 0.686
4.84TyrVal: 4.84 ± 0.757
0.593TyrTrp: 0.593 ± 0.248
3.951TyrTyr: 3.951 ± 0.743
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (10124 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski