Amino acid dipepetide frequency for Severe acute respiratory syndrome coronavirus 2 (2019-nCoV) (SARS-CoV-2)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.402AlaAla: 5.402 ± 0.442
2.078AlaCys: 2.078 ± 0.437
3.048AlaAsp: 3.048 ± 0.36
2.078AlaGlu: 2.078 ± 0.327
3.602AlaPhe: 3.602 ± 0.596
3.809AlaGly: 3.809 ± 0.474
0.693AlaHis: 0.693 ± 0.227
3.117AlaIle: 3.117 ± 0.463
3.74AlaLys: 3.74 ± 1.213
6.441AlaLeu: 6.441 ± 0.902
2.424AlaMet: 2.424 ± 0.334
3.532AlaAsn: 3.532 ± 0.519
2.286AlaPro: 2.286 ± 0.228
2.147AlaGln: 2.147 ± 0.359
3.463AlaArg: 3.463 ± 0.339
5.264AlaSer: 5.264 ± 0.415
5.472AlaThr: 5.472 ± 0.508
5.264AlaVal: 5.264 ± 0.73
0.97AlaTrp: 0.97 ± 0.238
3.809AlaTyr: 3.809 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
2.563CysAla: 2.563 ± 0.566
1.454CysCys: 1.454 ± 0.694
1.801CysAsp: 1.801 ± 0.47
1.247CysGlu: 1.247 ± 0.254
1.177CysPhe: 1.177 ± 0.666
2.424CysGly: 2.424 ± 0.593
0.762CysHis: 0.762 ± 0.131
1.524CysIle: 1.524 ± 0.383
0.831CysLys: 0.831 ± 0.219
2.701CysLeu: 2.701 ± 0.395
0.346CysMet: 0.346 ± 0.096
1.593CysAsn: 1.593 ± 0.25
1.039CysPro: 1.039 ± 0.233
0.485CysGln: 0.485 ± 0.361
1.108CysArg: 1.108 ± 0.32
2.009CysSer: 2.009 ± 0.523
2.84CysThr: 2.84 ± 0.454
3.325CysVal: 3.325 ± 0.35
0.416CysTrp: 0.416 ± 0.751
1.385CysTyr: 1.385 ± 0.186
0.0CysXaa: 0.0 ± 0.0
Asp
3.602AspAla: 3.602 ± 0.52
1.108AspCys: 1.108 ± 0.21
2.493AspAsp: 2.493 ± 0.341
2.216AspGlu: 2.216 ± 0.588
3.186AspPhe: 3.186 ± 0.591
4.156AspGly: 4.156 ± 0.567
0.693AspHis: 0.693 ± 0.296
2.978AspIle: 2.978 ± 0.503
2.563AspLys: 2.563 ± 0.484
4.848AspLeu: 4.848 ± 0.523
1.177AspMet: 1.177 ± 0.327
3.463AspAsn: 3.463 ± 0.7
1.593AspPro: 1.593 ± 0.616
1.385AspGln: 1.385 ± 0.328
1.177AspArg: 1.177 ± 0.308
3.325AspSer: 3.325 ± 0.541
3.463AspThr: 3.463 ± 0.941
3.671AspVal: 3.671 ± 0.646
0.623AspTrp: 0.623 ± 0.223
2.77AspTyr: 2.77 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
3.325GluAla: 3.325 ± 0.516
1.593GluCys: 1.593 ± 0.214
2.216GluAsp: 2.216 ± 0.341
4.433GluGlu: 4.433 ± 0.756
2.286GluPhe: 2.286 ± 0.465
3.048GluGly: 3.048 ± 0.481
1.039GluHis: 1.039 ± 0.522
3.394GluIle: 3.394 ± 0.456
2.493GluLys: 2.493 ± 0.486
4.571GluLeu: 4.571 ± 0.588
1.177GluMet: 1.177 ± 0.308
1.939GluAsn: 1.939 ± 0.272
1.316GluPro: 1.316 ± 0.937
2.078GluGln: 2.078 ± 0.329
1.177GluArg: 1.177 ± 0.255
2.355GluSer: 2.355 ± 0.239
3.325GluThr: 3.325 ± 0.594
4.086GluVal: 4.086 ± 0.804
0.416GluTrp: 0.416 ± 0.159
1.801GluTyr: 1.801 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
3.117PheAla: 3.117 ± 0.913
2.078PheCys: 2.078 ± 0.535
3.048PheAsp: 3.048 ± 0.931
1.454PheGlu: 1.454 ± 0.357
2.216PhePhe: 2.216 ± 0.343
2.632PheGly: 2.632 ± 0.553
0.623PheHis: 0.623 ± 0.534
2.424PheIle: 2.424 ± 0.43
3.602PheLys: 3.602 ± 0.651
6.095PheLeu: 6.095 ± 1.918
0.831PheMet: 0.831 ± 0.195
3.117PheAsn: 3.117 ± 0.735
1.524PhePro: 1.524 ± 0.378
1.177PheGln: 1.177 ± 0.726
1.177PheArg: 1.177 ± 0.336
3.463PheSer: 3.463 ± 0.431
3.809PheThr: 3.809 ± 0.336
4.156PheVal: 4.156 ± 0.558
0.346PheTrp: 0.346 ± 0.16
2.978PheTyr: 2.978 ± 0.575
0.0PheXaa: 0.0 ± 0.0
Gly
4.225GlyAla: 4.225 ± 0.517
1.662GlyCys: 1.662 ± 0.318
4.086GlyAsp: 4.086 ± 0.442
2.493GlyGlu: 2.493 ± 0.435
3.048GlyPhe: 3.048 ± 0.489
4.156GlyGly: 4.156 ± 0.571
1.247GlyHis: 1.247 ± 0.317
2.493GlyIle: 2.493 ± 0.497
2.84GlyLys: 2.84 ± 0.508
4.156GlyLeu: 4.156 ± 0.582
0.97GlyMet: 0.97 ± 0.249
2.563GlyAsn: 2.563 ± 0.237
2.078GlyPro: 2.078 ± 0.426
2.009GlyGln: 2.009 ± 0.353
1.454GlyArg: 1.454 ± 0.23
4.294GlySer: 4.294 ± 0.469
5.402GlyThr: 5.402 ± 0.708
6.303GlyVal: 6.303 ± 0.791
0.554GlyTrp: 0.554 ± 0.32
2.701GlyTyr: 2.701 ± 0.316
0.0GlyXaa: 0.0 ± 0.0
His
1.385HisAla: 1.385 ± 0.241
0.416HisCys: 0.416 ± 0.159
0.554HisAsp: 0.554 ± 0.404
0.762HisGlu: 0.762 ± 0.266
1.524HisPhe: 1.524 ± 0.353
1.247HisGly: 1.247 ± 0.174
0.554HisHis: 0.554 ± 0.242
0.9HisIle: 0.9 ± 0.239
0.554HisLys: 0.554 ± 0.15
1.316HisLeu: 1.316 ± 0.397
0.416HisMet: 0.416 ± 0.149
1.524HisAsn: 1.524 ± 0.35
0.623HisPro: 0.623 ± 0.213
0.346HisGln: 0.346 ± 0.52
0.208HisArg: 0.208 ± 0.11
1.662HisSer: 1.662 ± 0.23
1.732HisThr: 1.732 ± 0.288
2.216HisVal: 2.216 ± 0.348
0.277HisTrp: 0.277 ± 0.141
0.554HisTyr: 0.554 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.086IleAla: 4.086 ± 1.147
1.039IleCys: 1.039 ± 0.337
3.394IleAsp: 3.394 ± 0.246
1.454IleGlu: 1.454 ± 0.229
1.732IlePhe: 1.732 ± 0.447
2.978IleGly: 2.978 ± 0.908
0.623IleHis: 0.623 ± 0.389
3.74IleIle: 3.74 ± 1.067
4.294IleLys: 4.294 ± 0.543
4.086IleLeu: 4.086 ± 0.487
1.039IleMet: 1.039 ± 0.59
2.77IleAsn: 2.77 ± 0.374
1.939IlePro: 1.939 ± 0.28
2.424IleGln: 2.424 ± 0.409
1.454IleArg: 1.454 ± 0.446
3.394IleSer: 3.394 ± 0.565
4.502IleThr: 4.502 ± 0.623
4.571IleVal: 4.571 ± 0.792
0.554IleTrp: 0.554 ± 0.245
1.177IleTyr: 1.177 ± 0.555
0.0IleXaa: 0.0 ± 0.0
Lys
3.532LysAla: 3.532 ± 0.73
1.732LysCys: 1.732 ± 0.272
3.048LysAsp: 3.048 ± 0.697
2.493LysGlu: 2.493 ± 0.274
2.424LysPhe: 2.424 ± 0.24
4.779LysGly: 4.779 ± 0.82
2.355LysHis: 2.355 ± 0.43
2.355LysIle: 2.355 ± 0.359
3.463LysLys: 3.463 ± 0.734
6.164LysLeu: 6.164 ± 0.853
1.316LysMet: 1.316 ± 0.236
2.632LysAsn: 2.632 ± 0.585
3.463LysPro: 3.463 ± 0.585
2.009LysGln: 2.009 ± 0.329
2.216LysArg: 2.216 ± 0.256
4.225LysSer: 4.225 ± 0.482
3.394LysThr: 3.394 ± 0.548
4.017LysVal: 4.017 ± 0.743
0.97LysTrp: 0.97 ± 0.27
2.355LysTyr: 2.355 ± 0.498
0.0LysXaa: 0.0 ± 0.0
Leu
7.272LeuAla: 7.272 ± 0.89
2.355LeuCys: 2.355 ± 1.042
4.848LeuAsp: 4.848 ± 0.544
4.502LeuGlu: 4.502 ± 0.54
3.048LeuPhe: 3.048 ± 0.918
4.987LeuGly: 4.987 ± 0.612
2.009LeuHis: 2.009 ± 0.35
4.363LeuIle: 4.363 ± 1.412
8.034LeuLys: 8.034 ± 1.376
10.389LeuLeu: 10.389 ± 3.315
2.632LeuMet: 2.632 ± 0.479
6.303LeuAsn: 6.303 ± 0.964
4.987LeuPro: 4.987 ± 0.794
5.125LeuGln: 5.125 ± 0.779
3.809LeuArg: 3.809 ± 0.643
5.887LeuSer: 5.887 ± 0.44
6.649LeuThr: 6.649 ± 0.362
5.957LeuVal: 5.957 ± 1.292
1.108LeuTrp: 1.108 ± 0.289
3.671LeuTyr: 3.671 ± 0.772
0.0LeuXaa: 0.0 ± 0.0
Met
1.593MetAla: 1.593 ± 0.613
0.831MetCys: 0.831 ± 0.289
1.039MetAsp: 1.039 ± 0.197
0.97MetGlu: 0.97 ± 0.417
1.177MetPhe: 1.177 ± 0.241
0.693MetGly: 0.693 ± 0.212
0.346MetHis: 0.346 ± 0.161
0.485MetIle: 0.485 ± 0.241
0.762MetLys: 0.762 ± 0.331
2.424MetLeu: 2.424 ± 0.345
1.039MetMet: 1.039 ± 0.298
0.693MetAsn: 0.693 ± 0.212
1.316MetPro: 1.316 ± 0.382
1.039MetGln: 1.039 ± 0.289
1.039MetArg: 1.039 ± 0.426
2.424MetSer: 2.424 ± 0.411
1.177MetThr: 1.177 ± 0.214
1.662MetVal: 1.662 ± 0.504
0.554MetTrp: 0.554 ± 0.268
1.247MetTyr: 1.247 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 0.579
1.593AsnCys: 1.593 ± 0.386
2.147AsnAsp: 2.147 ± 0.511
1.87AsnGlu: 1.87 ± 0.308
2.563AsnPhe: 2.563 ± 0.764
4.294AsnGly: 4.294 ± 0.715
0.623AsnHis: 0.623 ± 0.184
2.978AsnIle: 2.978 ± 0.539
3.048AsnLys: 3.048 ± 0.276
5.818AsnLeu: 5.818 ± 0.76
1.177AsnMet: 1.177 ± 0.329
3.463AsnAsn: 3.463 ± 0.53
1.87AsnPro: 1.87 ± 0.413
1.454AsnGln: 1.454 ± 0.373
1.801AsnArg: 1.801 ± 0.363
4.71AsnSer: 4.71 ± 0.7
3.186AsnThr: 3.186 ± 0.416
4.571AsnVal: 4.571 ± 0.692
0.554AsnTrp: 0.554 ± 0.148
2.701AsnTyr: 2.701 ± 0.254
0.0AsnXaa: 0.0 ± 0.0
Pro
2.355ProAla: 2.355 ± 0.377
1.039ProCys: 1.039 ± 0.257
1.732ProAsp: 1.732 ± 0.387
1.316ProGlu: 1.316 ± 0.296
2.632ProPhe: 2.632 ± 0.891
1.939ProGly: 1.939 ± 0.439
0.9ProHis: 0.9 ± 0.168
2.77ProIle: 2.77 ± 1.078
2.563ProLys: 2.563 ± 0.643
4.433ProLeu: 4.433 ± 0.476
0.485ProMet: 0.485 ± 0.323
2.701ProAsn: 2.701 ± 0.668
1.524ProPro: 1.524 ± 0.285
1.662ProGln: 1.662 ± 0.762
1.524ProArg: 1.524 ± 0.423
2.355ProSer: 2.355 ± 0.399
3.117ProThr: 3.117 ± 0.401
2.978ProVal: 2.978 ± 0.471
0.277ProTrp: 0.277 ± 0.082
1.454ProTyr: 1.454 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
2.632GlnAla: 2.632 ± 0.403
0.831GlnCys: 0.831 ± 0.226
1.454GlnAsp: 1.454 ± 0.378
2.216GlnGlu: 2.216 ± 0.771
1.385GlnPhe: 1.385 ± 0.277
2.009GlnGly: 2.009 ± 0.794
0.9GlnHis: 0.9 ± 0.356
2.078GlnIle: 2.078 ± 0.715
1.662GlnLys: 1.662 ± 0.463
4.294GlnLeu: 4.294 ± 0.862
0.97GlnMet: 0.97 ± 0.204
1.316GlnAsn: 1.316 ± 0.327
2.701GlnPro: 2.701 ± 0.467
2.009GlnGln: 2.009 ± 0.485
1.108GlnArg: 1.108 ± 0.393
2.286GlnSer: 2.286 ± 0.585
3.186GlnThr: 3.186 ± 0.573
2.078GlnVal: 2.078 ± 0.298
0.623GlnTrp: 0.623 ± 0.158
1.247GlnTyr: 1.247 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
2.632ArgAla: 2.632 ± 0.363
1.039ArgCys: 1.039 ± 0.475
1.593ArgAsp: 1.593 ± 0.39
2.355ArgGlu: 2.355 ± 0.665
1.87ArgPhe: 1.87 ± 0.355
1.732ArgGly: 1.732 ± 0.636
0.831ArgHis: 0.831 ± 0.297
1.801ArgIle: 1.801 ± 0.71
2.563ArgLys: 2.563 ± 0.355
2.147ArgLeu: 2.147 ± 0.728
0.416ArgMet: 0.416 ± 0.373
1.454ArgAsn: 1.454 ± 0.706
0.762ArgPro: 0.762 ± 0.165
1.524ArgGln: 1.524 ± 0.314
1.247ArgArg: 1.247 ± 0.423
2.632ArgSer: 2.632 ± 0.515
1.939ArgThr: 1.939 ± 0.273
3.602ArgVal: 3.602 ± 0.62
0.416ArgTrp: 0.416 ± 0.254
1.524ArgTyr: 1.524 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
6.164SerAla: 6.164 ± 0.75
1.939SerCys: 1.939 ± 0.535
3.532SerAsp: 3.532 ± 0.749
4.225SerGlu: 4.225 ± 0.611
4.363SerPhe: 4.363 ± 0.845
3.74SerGly: 3.74 ± 0.588
1.524SerHis: 1.524 ± 0.437
2.493SerIle: 2.493 ± 0.311
3.463SerLys: 3.463 ± 0.979
6.372SerLeu: 6.372 ± 0.848
1.593SerMet: 1.593 ± 0.218
2.563SerAsn: 2.563 ± 1.073
2.147SerPro: 2.147 ± 0.948
1.939SerGln: 1.939 ± 0.345
2.078SerArg: 2.078 ± 1.267
4.571SerSer: 4.571 ± 1.048
5.195SerThr: 5.195 ± 0.469
5.125SerVal: 5.125 ± 0.624
0.831SerTrp: 0.831 ± 0.137
3.186SerTyr: 3.186 ± 0.433
0.0SerXaa: 0.0 ± 0.0
Thr
3.394ThrAla: 3.394 ± 0.437
3.186ThrCys: 3.186 ± 0.727
3.325ThrAsp: 3.325 ± 0.347
3.948ThrGlu: 3.948 ± 0.305
4.433ThrPhe: 4.433 ± 0.679
4.017ThrGly: 4.017 ± 0.399
1.108ThrHis: 1.108 ± 0.308
5.402ThrIle: 5.402 ± 0.865
3.809ThrLys: 3.809 ± 0.633
6.857ThrLeu: 6.857 ± 0.463
1.108ThrMet: 1.108 ± 0.262
3.879ThrAsn: 3.879 ± 0.632
3.463ThrPro: 3.463 ± 0.462
3.048ThrGln: 3.048 ± 0.993
2.909ThrArg: 2.909 ± 0.377
4.987ThrSer: 4.987 ± 0.798
6.649ThrThr: 6.649 ± 0.839
5.957ThrVal: 5.957 ± 0.605
0.693ThrTrp: 0.693 ± 0.23
2.701ThrTyr: 2.701 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
5.402ValAla: 5.402 ± 0.373
2.563ValCys: 2.563 ± 0.466
4.571ValAsp: 4.571 ± 0.626
5.125ValGlu: 5.125 ± 1.172
3.325ValPhe: 3.325 ± 0.497
4.156ValGly: 4.156 ± 0.603
0.9ValHis: 0.9 ± 0.217
2.978ValIle: 2.978 ± 0.367
4.086ValLys: 4.086 ± 0.36
8.658ValLeu: 8.658 ± 1.088
2.009ValMet: 2.009 ± 0.321
4.571ValAsn: 4.571 ± 0.627
2.978ValPro: 2.978 ± 0.307
3.394ValGln: 3.394 ± 0.442
2.978ValArg: 2.978 ± 0.488
4.086ValSer: 4.086 ± 0.581
6.441ValThr: 6.441 ± 0.687
8.104ValVal: 8.104 ± 0.775
0.416ValTrp: 0.416 ± 0.156
4.641ValTyr: 4.641 ± 0.394
0.0ValXaa: 0.0 ± 0.0
Trp
0.693TrpAla: 0.693 ± 0.24
0.208TrpCys: 0.208 ± 0.074
0.623TrpAsp: 0.623 ± 0.263
0.485TrpGlu: 0.485 ± 0.165
0.97TrpPhe: 0.97 ± 0.231
0.139TrpGly: 0.139 ± 0.09
0.346TrpHis: 0.346 ± 0.18
0.416TrpIle: 0.416 ± 0.305
0.554TrpLys: 0.554 ± 0.166
1.939TrpLeu: 1.939 ± 0.503
0.208TrpMet: 0.208 ± 0.084
1.247TrpAsn: 1.247 ± 0.24
0.416TrpPro: 0.416 ± 0.361
0.277TrpGln: 0.277 ± 0.184
0.277TrpArg: 0.277 ± 0.227
0.623TrpSer: 0.623 ± 0.181
0.554TrpThr: 0.554 ± 0.206
0.554TrpVal: 0.554 ± 0.139
0.069TrpTrp: 0.069 ± 0.045
0.485TrpTyr: 0.485 ± 0.295
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.078TyrAla: 2.078 ± 0.312
2.286TyrCys: 2.286 ± 0.518
1.939TyrAsp: 1.939 ± 0.599
2.424TyrGlu: 2.424 ± 0.322
2.909TyrPhe: 2.909 ± 0.354
1.801TyrGly: 1.801 ± 0.333
0.693TyrHis: 0.693 ± 0.345
2.563TyrIle: 2.563 ± 0.592
4.017TyrLys: 4.017 ± 0.689
4.225TyrLeu: 4.225 ± 0.688
1.108TyrMet: 1.108 ± 0.417
2.286TyrAsn: 2.286 ± 0.305
1.732TyrPro: 1.732 ± 0.444
1.454TyrGln: 1.454 ± 0.765
1.87TyrArg: 1.87 ± 0.284
2.563TyrSer: 2.563 ± 1.071
2.84TyrThr: 2.84 ± 0.222
3.186TyrVal: 3.186 ± 0.579
0.416TyrTrp: 0.416 ± 0.149
2.563TyrTyr: 2.563 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (14439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski