Amino acid dipepetide frequency for Severe acute respiratory syndrome coronavirus (SARS-CoV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.469AlaAla: 6.469 ± 0.454
2.295AlaCys: 2.295 ± 0.354
2.991AlaAsp: 2.991 ± 0.538
2.504AlaGlu: 2.504 ± 0.611
2.852AlaPhe: 2.852 ± 0.661
4.452AlaGly: 4.452 ± 0.559
0.904AlaHis: 0.904 ± 0.369
4.104AlaIle: 4.104 ± 0.367
4.035AlaLys: 4.035 ± 1.192
7.582AlaLeu: 7.582 ± 0.816
2.574AlaMet: 2.574 ± 0.413
3.687AlaAsn: 3.687 ± 0.777
2.643AlaPro: 2.643 ± 0.336
2.226AlaGln: 2.226 ± 0.304
3.13AlaArg: 3.13 ± 0.476
5.008AlaSer: 5.008 ± 1.043
4.869AlaThr: 4.869 ± 0.495
4.661AlaVal: 4.661 ± 0.857
1.391AlaTrp: 1.391 ± 0.326
3.826AlaTyr: 3.826 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
2.574CysAla: 2.574 ± 0.324
1.6CysCys: 1.6 ± 0.301
2.156CysAsp: 2.156 ± 0.457
1.183CysGlu: 1.183 ± 0.39
1.391CysPhe: 1.391 ± 0.248
2.504CysGly: 2.504 ± 0.594
0.556CysHis: 0.556 ± 0.13
1.878CysIle: 1.878 ± 0.497
0.974CysLys: 0.974 ± 0.366
2.852CysLeu: 2.852 ± 0.539
0.696CysMet: 0.696 ± 0.192
1.391CysAsn: 1.391 ± 0.216
0.835CysPro: 0.835 ± 0.163
0.626CysGln: 0.626 ± 0.216
1.043CysArg: 1.043 ± 0.362
1.809CysSer: 1.809 ± 0.466
2.574CysThr: 2.574 ± 0.49
3.061CysVal: 3.061 ± 0.677
0.417CysTrp: 0.417 ± 0.515
1.669CysTyr: 1.669 ± 0.354
0.0CysXaa: 0.0 ± 0.0
Asp
4.591AspAla: 4.591 ± 0.915
1.252AspCys: 1.252 ± 0.297
2.643AspAsp: 2.643 ± 0.428
2.852AspGlu: 2.852 ± 0.308
2.504AspPhe: 2.504 ± 0.463
3.826AspGly: 3.826 ± 0.727
0.835AspHis: 0.835 ± 0.248
2.922AspIle: 2.922 ± 0.62
2.574AspLys: 2.574 ± 0.507
4.661AspLeu: 4.661 ± 0.524
1.183AspMet: 1.183 ± 0.274
3.2AspAsn: 3.2 ± 0.523
1.6AspPro: 1.6 ± 0.956
1.461AspGln: 1.461 ± 0.272
1.322AspArg: 1.322 ± 0.382
3.061AspSer: 3.061 ± 0.403
3.687AspThr: 3.687 ± 1.063
4.174AspVal: 4.174 ± 0.892
0.487AspTrp: 0.487 ± 0.203
3.408AspTyr: 3.408 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
3.478GluAla: 3.478 ± 0.614
1.6GluCys: 1.6 ± 0.317
2.713GluAsp: 2.713 ± 0.546
4.73GluGlu: 4.73 ± 0.926
1.948GluPhe: 1.948 ± 0.532
2.782GluGly: 2.782 ± 0.557
1.391GluHis: 1.391 ± 0.43
3.2GluIle: 3.2 ± 0.523
1.948GluLys: 1.948 ± 0.539
4.661GluLeu: 4.661 ± 0.706
0.974GluMet: 0.974 ± 0.324
1.809GluAsn: 1.809 ± 0.434
2.087GluPro: 2.087 ± 0.376
2.017GluGln: 2.017 ± 0.224
1.252GluArg: 1.252 ± 0.301
2.435GluSer: 2.435 ± 0.362
2.852GluThr: 2.852 ± 0.739
3.895GluVal: 3.895 ± 0.544
0.556GluTrp: 0.556 ± 0.193
1.878GluTyr: 1.878 ± 0.615
0.0GluXaa: 0.0 ± 0.0
Phe
2.852PheAla: 2.852 ± 0.759
1.878PheCys: 1.878 ± 0.525
2.713PheAsp: 2.713 ± 0.784
1.669PheGlu: 1.669 ± 0.453
2.226PhePhe: 2.226 ± 0.286
2.852PheGly: 2.852 ± 0.785
0.765PheHis: 0.765 ± 0.342
2.295PheIle: 2.295 ± 0.387
2.991PheLys: 2.991 ± 0.574
5.634PheLeu: 5.634 ± 1.244
0.974PheMet: 0.974 ± 0.34
2.782PheAsn: 2.782 ± 1.071
1.878PhePro: 1.878 ± 0.273
0.974PheGln: 0.974 ± 0.704
1.53PheArg: 1.53 ± 0.425
3.13PheSer: 3.13 ± 0.632
3.826PheThr: 3.826 ± 0.717
3.617PheVal: 3.617 ± 0.709
0.348PheTrp: 0.348 ± 0.171
2.574PheTyr: 2.574 ± 0.432
0.0PheXaa: 0.0 ± 0.0
Gly
4.591GlyAla: 4.591 ± 0.675
1.53GlyCys: 1.53 ± 0.395
3.687GlyAsp: 3.687 ± 0.497
2.156GlyGlu: 2.156 ± 0.352
3.339GlyPhe: 3.339 ± 0.507
4.035GlyGly: 4.035 ± 0.837
1.53GlyHis: 1.53 ± 0.477
3.617GlyIle: 3.617 ± 0.661
2.852GlyLys: 2.852 ± 0.483
3.478GlyLeu: 3.478 ± 0.698
1.043GlyMet: 1.043 ± 0.322
2.852GlyAsn: 2.852 ± 0.582
2.226GlyPro: 2.226 ± 0.68
2.226GlyGln: 2.226 ± 0.48
1.739GlyArg: 1.739 ± 0.259
3.895GlySer: 3.895 ± 0.255
5.426GlyThr: 5.426 ± 1.105
6.26GlyVal: 6.26 ± 0.926
0.348GlyTrp: 0.348 ± 0.285
2.852GlyTyr: 2.852 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
1.391HisAla: 1.391 ± 0.298
0.765HisCys: 0.765 ± 0.24
0.835HisAsp: 0.835 ± 0.364
1.113HisGlu: 1.113 ± 0.235
1.183HisPhe: 1.183 ± 0.425
1.461HisGly: 1.461 ± 0.225
0.626HisHis: 0.626 ± 0.207
1.113HisIle: 1.113 ± 0.448
0.765HisLys: 0.765 ± 0.273
2.226HisLeu: 2.226 ± 0.311
0.417HisMet: 0.417 ± 0.184
0.974HisAsn: 0.974 ± 0.321
0.556HisPro: 0.556 ± 0.192
0.487HisGln: 0.487 ± 0.211
0.278HisArg: 0.278 ± 0.26
1.669HisSer: 1.669 ± 0.351
2.226HisThr: 2.226 ± 0.594
1.669HisVal: 1.669 ± 0.339
0.348HisTrp: 0.348 ± 0.152
0.696HisTyr: 0.696 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
3.617IleAla: 3.617 ± 1.091
1.53IleCys: 1.53 ± 0.363
3.2IleAsp: 3.2 ± 0.395
1.669IleGlu: 1.669 ± 0.48
1.53IlePhe: 1.53 ± 0.365
3.2IleGly: 3.2 ± 0.907
0.487IleHis: 0.487 ± 0.144
3.061IleIle: 3.061 ± 0.891
3.826IleLys: 3.826 ± 0.723
4.452IleLeu: 4.452 ± 0.542
1.669IleMet: 1.669 ± 0.353
2.852IleAsn: 2.852 ± 0.436
1.809IlePro: 1.809 ± 0.386
2.504IleGln: 2.504 ± 0.821
1.6IleArg: 1.6 ± 0.263
3.269IleSer: 3.269 ± 0.648
4.591IleThr: 4.591 ± 0.554
4.521IleVal: 4.521 ± 0.813
0.487IleTrp: 0.487 ± 0.181
0.974IleTyr: 0.974 ± 0.497
0.0IleXaa: 0.0 ± 0.0
Lys
2.852LysAla: 2.852 ± 0.763
1.809LysCys: 1.809 ± 0.41
2.852LysAsp: 2.852 ± 0.545
2.782LysGlu: 2.782 ± 0.451
2.782LysPhe: 2.782 ± 0.526
4.869LysGly: 4.869 ± 0.807
1.809LysHis: 1.809 ± 0.356
2.295LysIle: 2.295 ± 0.514
3.13LysLys: 3.13 ± 1.287
6.608LysLeu: 6.608 ± 0.459
1.6LysMet: 1.6 ± 0.256
2.087LysAsn: 2.087 ± 0.318
3.617LysPro: 3.617 ± 0.441
1.391LysGln: 1.391 ± 0.799
2.574LysArg: 2.574 ± 0.306
4.174LysSer: 4.174 ± 0.823
3.687LysThr: 3.687 ± 0.541
3.339LysVal: 3.339 ± 0.463
0.696LysTrp: 0.696 ± 0.177
2.087LysTyr: 2.087 ± 0.484
0.0LysXaa: 0.0 ± 0.0
Leu
6.817LeuAla: 6.817 ± 0.913
2.922LeuCys: 2.922 ± 0.508
5.426LeuAsp: 5.426 ± 0.811
4.73LeuGlu: 4.73 ± 0.989
3.269LeuPhe: 3.269 ± 0.814
5.147LeuGly: 5.147 ± 0.507
1.739LeuHis: 1.739 ± 0.376
3.408LeuIle: 3.408 ± 1.381
7.026LeuLys: 7.026 ± 0.825
10.573LeuLeu: 10.573 ± 1.889
2.713LeuMet: 2.713 ± 0.654
6.469LeuAsn: 6.469 ± 0.645
4.661LeuPro: 4.661 ± 0.933
4.382LeuGln: 4.382 ± 0.325
4.869LeuArg: 4.869 ± 0.538
7.443LeuSer: 7.443 ± 1.307
5.774LeuThr: 5.774 ± 0.675
6.121LeuVal: 6.121 ± 1.375
1.043LeuTrp: 1.043 ± 0.414
3.548LeuTyr: 3.548 ± 0.639
0.0LeuXaa: 0.0 ± 0.0
Met
1.739MetAla: 1.739 ± 0.744
0.974MetCys: 0.974 ± 0.228
1.739MetAsp: 1.739 ± 0.379
0.835MetGlu: 0.835 ± 0.574
0.904MetPhe: 0.904 ± 0.185
0.974MetGly: 0.974 ± 0.254
0.417MetHis: 0.417 ± 0.167
0.556MetIle: 0.556 ± 0.28
0.974MetLys: 0.974 ± 0.301
2.991MetLeu: 2.991 ± 0.575
0.696MetMet: 0.696 ± 0.286
0.904MetAsn: 0.904 ± 0.194
1.391MetPro: 1.391 ± 0.319
1.183MetGln: 1.183 ± 0.354
0.835MetArg: 0.835 ± 0.293
2.295MetSer: 2.295 ± 0.469
1.461MetThr: 1.461 ± 0.301
1.6MetVal: 1.6 ± 0.401
0.696MetTrp: 0.696 ± 0.409
1.322MetTyr: 1.322 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
4.035AsnAla: 4.035 ± 0.622
1.669AsnCys: 1.669 ± 0.457
1.669AsnAsp: 1.669 ± 0.285
1.809AsnGlu: 1.809 ± 0.392
2.087AsnPhe: 2.087 ± 1.246
4.382AsnGly: 4.382 ± 0.617
1.53AsnHis: 1.53 ± 0.403
2.504AsnIle: 2.504 ± 0.493
2.782AsnLys: 2.782 ± 0.482
4.73AsnLeu: 4.73 ± 0.696
1.461AsnMet: 1.461 ± 0.373
3.269AsnAsn: 3.269 ± 0.412
1.739AsnPro: 1.739 ± 0.341
1.322AsnGln: 1.322 ± 0.713
1.809AsnArg: 1.809 ± 0.586
3.687AsnSer: 3.687 ± 0.949
3.2AsnThr: 3.2 ± 0.74
4.521AsnVal: 4.521 ± 0.458
0.417AsnTrp: 0.417 ± 0.198
2.435AsnTyr: 2.435 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
2.922ProAla: 2.922 ± 0.395
1.322ProCys: 1.322 ± 0.409
1.739ProAsp: 1.739 ± 0.563
1.53ProGlu: 1.53 ± 0.31
2.087ProPhe: 2.087 ± 0.832
1.739ProGly: 1.739 ± 0.386
0.765ProHis: 0.765 ± 0.287
2.713ProIle: 2.713 ± 0.361
2.991ProLys: 2.991 ± 0.687
4.661ProLeu: 4.661 ± 0.483
0.556ProMet: 0.556 ± 0.377
2.365ProAsn: 2.365 ± 0.236
1.669ProPro: 1.669 ± 0.318
1.391ProGln: 1.391 ± 1.114
1.53ProArg: 1.53 ± 0.561
2.365ProSer: 2.365 ± 0.43
2.991ProThr: 2.991 ± 0.373
3.548ProVal: 3.548 ± 0.848
0.278ProTrp: 0.278 ± 0.093
1.043ProTyr: 1.043 ± 0.149
0.0ProXaa: 0.0 ± 0.0
Gln
2.922GlnAla: 2.922 ± 0.286
1.043GlnCys: 1.043 ± 0.258
1.669GlnAsp: 1.669 ± 0.565
2.017GlnGlu: 2.017 ± 0.535
1.53GlnPhe: 1.53 ± 0.564
2.017GlnGly: 2.017 ± 0.932
0.835GlnHis: 0.835 ± 0.499
1.669GlnIle: 1.669 ± 1.194
1.461GlnLys: 1.461 ± 0.478
3.965GlnLeu: 3.965 ± 0.84
1.043GlnMet: 1.043 ± 0.311
1.391GlnAsn: 1.391 ± 0.524
2.156GlnPro: 2.156 ± 0.282
1.878GlnGln: 1.878 ± 0.518
1.669GlnArg: 1.669 ± 0.594
2.156GlnSer: 2.156 ± 0.251
2.574GlnThr: 2.574 ± 0.618
2.643GlnVal: 2.643 ± 0.582
0.696GlnTrp: 0.696 ± 0.205
1.113GlnTyr: 1.113 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
3.548ArgAla: 3.548 ± 0.443
1.252ArgCys: 1.252 ± 0.311
2.017ArgAsp: 2.017 ± 0.433
2.435ArgGlu: 2.435 ± 0.527
1.461ArgPhe: 1.461 ± 0.298
2.156ArgGly: 2.156 ± 1.351
1.183ArgHis: 1.183 ± 0.245
1.809ArgIle: 1.809 ± 0.667
2.017ArgLys: 2.017 ± 0.461
2.922ArgLeu: 2.922 ± 0.426
0.696ArgMet: 0.696 ± 0.616
1.739ArgAsn: 1.739 ± 0.76
1.043ArgPro: 1.043 ± 0.355
1.739ArgGln: 1.739 ± 0.764
1.043ArgArg: 1.043 ± 0.965
2.643ArgSer: 2.643 ± 0.425
1.53ArgThr: 1.53 ± 0.576
3.478ArgVal: 3.478 ± 0.584
0.487ArgTrp: 0.487 ± 0.351
1.461ArgTyr: 1.461 ± 0.242
0.0ArgXaa: 0.0 ± 0.0
Ser
5.843SerAla: 5.843 ± 1.015
1.669SerCys: 1.669 ± 0.347
3.687SerAsp: 3.687 ± 0.896
3.756SerGlu: 3.756 ± 0.598
4.035SerPhe: 4.035 ± 0.671
3.965SerGly: 3.965 ± 0.928
1.391SerHis: 1.391 ± 0.513
2.713SerIle: 2.713 ± 0.447
3.408SerLys: 3.408 ± 0.466
6.539SerLeu: 6.539 ± 0.926
1.391SerMet: 1.391 ± 0.306
2.991SerAsn: 2.991 ± 0.693
2.295SerPro: 2.295 ± 0.835
2.365SerGln: 2.365 ± 0.604
2.017SerArg: 2.017 ± 1.71
3.895SerSer: 3.895 ± 0.706
5.008SerThr: 5.008 ± 1.143
5.704SerVal: 5.704 ± 0.892
0.904SerTrp: 0.904 ± 0.13
3.13SerTyr: 3.13 ± 0.738
0.0SerXaa: 0.0 ± 0.0
Thr
3.617ThrAla: 3.617 ± 1.226
2.922ThrCys: 2.922 ± 0.85
3.061ThrAsp: 3.061 ± 0.834
3.895ThrGlu: 3.895 ± 0.465
4.591ThrPhe: 4.591 ± 0.625
4.104ThrGly: 4.104 ± 0.469
1.739ThrHis: 1.739 ± 0.475
4.661ThrIle: 4.661 ± 0.736
3.617ThrLys: 3.617 ± 0.37
6.4ThrLeu: 6.4 ± 0.443
1.669ThrMet: 1.669 ± 0.462
3.339ThrAsn: 3.339 ± 0.295
2.922ThrPro: 2.922 ± 0.446
3.339ThrGln: 3.339 ± 0.965
2.782ThrArg: 2.782 ± 0.548
5.704ThrSer: 5.704 ± 0.943
6.608ThrThr: 6.608 ± 0.943
5.078ThrVal: 5.078 ± 0.509
0.348ThrTrp: 0.348 ± 0.152
2.365ThrTyr: 2.365 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
5.704ValAla: 5.704 ± 0.824
1.878ValCys: 1.878 ± 0.551
4.939ValAsp: 4.939 ± 1.132
4.174ValGlu: 4.174 ± 1.353
4.104ValPhe: 4.104 ± 0.494
3.061ValGly: 3.061 ± 0.698
0.904ValHis: 0.904 ± 0.329
4.104ValIle: 4.104 ± 0.521
5.147ValLys: 5.147 ± 0.761
7.791ValLeu: 7.791 ± 0.581
1.6ValMet: 1.6 ± 0.212
3.2ValAsn: 3.2 ± 0.684
2.991ValPro: 2.991 ± 0.308
3.269ValGln: 3.269 ± 0.434
3.2ValArg: 3.2 ± 0.303
4.591ValSer: 4.591 ± 0.664
6.817ValThr: 6.817 ± 0.792
7.026ValVal: 7.026 ± 1.252
0.556ValTrp: 0.556 ± 0.135
4.174ValTyr: 4.174 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.696TrpAla: 0.696 ± 0.273
0.209TrpCys: 0.209 ± 0.084
0.348TrpAsp: 0.348 ± 0.226
0.626TrpGlu: 0.626 ± 0.148
1.043TrpPhe: 1.043 ± 0.23
0.209TrpGly: 0.209 ± 0.123
0.348TrpHis: 0.348 ± 0.236
0.556TrpIle: 0.556 ± 0.22
0.556TrpLys: 0.556 ± 0.193
1.6TrpLeu: 1.6 ± 0.687
0.139TrpMet: 0.139 ± 0.058
1.252TrpAsn: 1.252 ± 0.251
0.417TrpPro: 0.417 ± 0.451
0.278TrpGln: 0.278 ± 0.139
0.209TrpArg: 0.209 ± 0.123
0.835TrpSer: 0.835 ± 0.262
0.417TrpThr: 0.417 ± 0.129
0.765TrpVal: 0.765 ± 0.256
0.07TrpTrp: 0.07 ± 0.045
0.348TrpTyr: 0.348 ± 0.359
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.948TyrAla: 1.948 ± 0.408
1.669TyrCys: 1.669 ± 0.452
2.156TyrAsp: 2.156 ± 0.579
1.739TyrGlu: 1.739 ± 0.314
2.643TyrPhe: 2.643 ± 0.455
1.948TyrGly: 1.948 ± 0.291
1.043TyrHis: 1.043 ± 0.342
1.669TyrIle: 1.669 ± 0.285
3.965TyrLys: 3.965 ± 0.44
3.756TyrLeu: 3.756 ± 0.598
1.322TyrMet: 1.322 ± 0.445
2.574TyrAsn: 2.574 ± 0.277
1.669TyrPro: 1.669 ± 0.429
1.391TyrGln: 1.391 ± 0.581
2.226TyrArg: 2.226 ± 0.349
2.643TyrSer: 2.643 ± 0.764
2.643TyrThr: 2.643 ± 0.51
3.548TyrVal: 3.548 ± 0.697
0.348TyrTrp: 0.348 ± 0.103
2.226TyrTyr: 2.226 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (14377 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski