Amino acid dipepetide frequency for BtVs-BetaCoV/SC2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.828AlaAla: 6.828 ± 1.568
2.672AlaCys: 2.672 ± 0.666
2.969AlaAsp: 2.969 ± 0.817
2.177AlaGlu: 2.177 ± 0.89
3.365AlaPhe: 3.365 ± 1.148
3.958AlaGly: 3.958 ± 0.751
1.188AlaHis: 1.188 ± 0.327
4.057AlaIle: 4.057 ± 1.015
3.859AlaLys: 3.859 ± 0.973
6.927AlaLeu: 6.927 ± 0.827
2.078AlaMet: 2.078 ± 0.674
6.037AlaAsn: 6.037 ± 1.153
3.266AlaPro: 3.266 ± 2.096
2.87AlaGln: 2.87 ± 1.195
3.266AlaArg: 3.266 ± 1.191
6.136AlaSer: 6.136 ± 1.067
5.443AlaThr: 5.443 ± 0.789
7.323AlaVal: 7.323 ± 1.0
0.99AlaTrp: 0.99 ± 0.334
3.563AlaTyr: 3.563 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
2.078CysAla: 2.078 ± 0.677
1.089CysCys: 1.089 ± 0.261
2.078CysAsp: 2.078 ± 0.675
0.99CysGlu: 0.99 ± 0.377
1.286CysPhe: 1.286 ± 0.549
2.177CysGly: 2.177 ± 0.82
0.396CysHis: 0.396 ± 0.164
1.286CysIle: 1.286 ± 0.48
1.781CysLys: 1.781 ± 0.485
2.375CysLeu: 2.375 ± 1.06
0.99CysMet: 0.99 ± 0.539
1.88CysAsn: 1.88 ± 0.524
0.891CysPro: 0.891 ± 0.349
0.891CysGln: 0.891 ± 0.576
1.385CysArg: 1.385 ± 0.377
1.484CysSer: 1.484 ± 0.514
2.474CysThr: 2.474 ± 0.842
2.87CysVal: 2.87 ± 0.833
0.297CysTrp: 0.297 ± 0.161
2.375CysTyr: 2.375 ± 1.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.354AspAla: 4.354 ± 0.683
1.484AspCys: 1.484 ± 0.639
2.672AspAsp: 2.672 ± 0.551
2.771AspGlu: 2.771 ± 0.529
3.167AspPhe: 3.167 ± 0.802
4.651AspGly: 4.651 ± 1.1
0.297AspHis: 0.297 ± 0.161
2.771AspIle: 2.771 ± 1.129
2.078AspLys: 2.078 ± 0.795
5.245AspLeu: 5.245 ± 0.595
1.188AspMet: 1.188 ± 0.364
2.375AspAsn: 2.375 ± 0.603
3.068AspPro: 3.068 ± 0.579
1.484AspGln: 1.484 ± 0.31
2.078AspArg: 2.078 ± 1.041
2.969AspSer: 2.969 ± 0.34
2.177AspThr: 2.177 ± 0.804
4.651AspVal: 4.651 ± 1.486
0.891AspTrp: 0.891 ± 0.397
2.969AspTyr: 2.969 ± 0.688
0.0AspXaa: 0.0 ± 0.0
Glu
3.859GluAla: 3.859 ± 0.917
1.188GluCys: 1.188 ± 0.553
2.375GluAsp: 2.375 ± 0.726
1.979GluGlu: 1.979 ± 0.613
1.88GluPhe: 1.88 ± 0.546
1.583GluGly: 1.583 ± 0.641
1.188GluHis: 1.188 ± 0.338
1.88GluIle: 1.88 ± 0.71
1.682GluLys: 1.682 ± 0.607
4.057GluLeu: 4.057 ± 1.3
0.297GluMet: 0.297 ± 0.343
1.682GluAsn: 1.682 ± 0.668
2.276GluPro: 2.276 ± 0.43
1.979GluGln: 1.979 ± 0.751
1.682GluArg: 1.682 ± 0.558
2.177GluSer: 2.177 ± 0.55
2.177GluThr: 2.177 ± 0.398
2.672GluVal: 2.672 ± 0.668
0.792GluTrp: 0.792 ± 0.436
2.078GluTyr: 2.078 ± 0.563
0.0GluXaa: 0.0 ± 0.0
Phe
3.167PheAla: 3.167 ± 0.697
1.682PheCys: 1.682 ± 0.334
2.573PheAsp: 2.573 ± 0.626
2.474PheGlu: 2.474 ± 0.517
2.078PhePhe: 2.078 ± 0.88
2.672PheGly: 2.672 ± 0.904
0.792PheHis: 0.792 ± 0.56
2.771PheIle: 2.771 ± 0.644
3.068PheLys: 3.068 ± 0.641
2.969PheLeu: 2.969 ± 0.62
1.089PheMet: 1.089 ± 0.323
2.969PheAsn: 2.969 ± 0.614
1.286PhePro: 1.286 ± 0.591
1.484PheGln: 1.484 ± 0.64
1.385PheArg: 1.385 ± 0.311
3.958PheSer: 3.958 ± 1.308
3.859PheThr: 3.859 ± 1.168
6.333PheVal: 6.333 ± 1.281
0.495PheTrp: 0.495 ± 0.269
2.276PheTyr: 2.276 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
4.651GlyAla: 4.651 ± 0.817
1.979GlyCys: 1.979 ± 0.753
4.156GlyAsp: 4.156 ± 0.751
1.682GlyGlu: 1.682 ± 0.668
2.672GlyPhe: 2.672 ± 1.074
3.464GlyGly: 3.464 ± 0.804
1.188GlyHis: 1.188 ± 0.333
2.87GlyIle: 2.87 ± 0.438
3.068GlyLys: 3.068 ± 0.581
5.938GlyLeu: 5.938 ± 1.124
0.693GlyMet: 0.693 ± 0.715
2.87GlyAsn: 2.87 ± 0.679
1.979GlyPro: 1.979 ± 0.511
1.979GlyGln: 1.979 ± 0.493
1.583GlyArg: 1.583 ± 1.499
4.651GlySer: 4.651 ± 0.424
5.146GlyThr: 5.146 ± 1.231
4.948GlyVal: 4.948 ± 1.193
0.396GlyTrp: 0.396 ± 0.164
3.068GlyTyr: 3.068 ± 0.858
0.0GlyXaa: 0.0 ± 0.0
His
1.88HisAla: 1.88 ± 0.71
0.297HisCys: 0.297 ± 0.252
0.891HisAsp: 0.891 ± 0.526
0.693HisGlu: 0.693 ± 0.346
0.891HisPhe: 0.891 ± 0.417
1.188HisGly: 1.188 ± 0.562
0.198HisHis: 0.198 ± 0.295
1.583HisIle: 1.583 ± 0.82
1.089HisLys: 1.089 ± 0.279
1.979HisLeu: 1.979 ± 1.137
0.495HisMet: 0.495 ± 0.464
0.792HisAsn: 0.792 ± 0.534
0.792HisPro: 0.792 ± 0.691
0.99HisGln: 0.99 ± 0.275
0.99HisArg: 0.99 ± 0.49
1.286HisSer: 1.286 ± 0.399
1.484HisThr: 1.484 ± 0.462
1.88HisVal: 1.88 ± 0.478
0.396HisTrp: 0.396 ± 0.215
0.792HisTyr: 0.792 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
4.156IleAla: 4.156 ± 0.861
0.594IleCys: 0.594 ± 0.224
2.672IleAsp: 2.672 ± 0.886
1.188IleGlu: 1.188 ± 0.236
1.979IlePhe: 1.979 ± 0.705
2.87IleGly: 2.87 ± 0.578
0.495IleHis: 0.495 ± 0.248
2.177IleIle: 2.177 ± 0.922
2.474IleLys: 2.474 ± 0.66
4.057IleLeu: 4.057 ± 1.373
0.693IleMet: 0.693 ± 0.376
1.979IleAsn: 1.979 ± 0.706
2.771IlePro: 2.771 ± 0.923
1.089IleGln: 1.089 ± 0.742
2.276IleArg: 2.276 ± 0.433
3.068IleSer: 3.068 ± 1.226
2.969IleThr: 2.969 ± 0.604
5.047IleVal: 5.047 ± 1.29
0.198IleTrp: 0.198 ± 0.163
1.781IleTyr: 1.781 ± 0.308
0.0IleXaa: 0.0 ± 0.0
Lys
3.761LysAla: 3.761 ± 0.876
1.88LysCys: 1.88 ± 0.71
3.167LysAsp: 3.167 ± 0.915
2.078LysGlu: 2.078 ± 0.394
2.177LysPhe: 2.177 ± 0.607
3.068LysGly: 3.068 ± 1.204
2.177LysHis: 2.177 ± 1.183
2.276LysIle: 2.276 ± 0.516
2.375LysLys: 2.375 ± 0.678
5.938LysLeu: 5.938 ± 0.835
1.682LysMet: 1.682 ± 0.802
2.078LysAsn: 2.078 ± 0.543
2.969LysPro: 2.969 ± 0.87
2.969LysGln: 2.969 ± 0.808
1.979LysArg: 1.979 ± 0.623
3.266LysSer: 3.266 ± 0.859
1.88LysThr: 1.88 ± 0.406
4.057LysVal: 4.057 ± 0.655
0.495LysTrp: 0.495 ± 0.392
2.474LysTyr: 2.474 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
7.323LeuAla: 7.323 ± 0.758
3.662LeuCys: 3.662 ± 0.758
4.057LeuAsp: 4.057 ± 0.593
3.761LeuGlu: 3.761 ± 1.291
4.651LeuPhe: 4.651 ± 0.393
4.255LeuGly: 4.255 ± 1.066
2.276LeuHis: 2.276 ± 1.35
3.464LeuIle: 3.464 ± 1.252
5.146LeuLys: 5.146 ± 0.753
8.808LeuLeu: 8.808 ± 1.352
1.88LeuMet: 1.88 ± 0.954
5.542LeuAsn: 5.542 ± 0.576
4.255LeuPro: 4.255 ± 2.278
4.057LeuGln: 4.057 ± 1.245
3.662LeuArg: 3.662 ± 0.587
7.125LeuSer: 7.125 ± 1.956
6.63LeuThr: 6.63 ± 1.115
6.828LeuVal: 6.828 ± 1.002
1.484LeuTrp: 1.484 ± 0.57
4.354LeuTyr: 4.354 ± 1.019
0.0LeuXaa: 0.0 ± 0.0
Met
1.385MetAla: 1.385 ± 0.891
1.089MetCys: 1.089 ± 0.591
0.792MetAsp: 0.792 ± 0.482
1.385MetGlu: 1.385 ± 0.691
0.792MetPhe: 0.792 ± 0.31
0.891MetGly: 0.891 ± 0.232
0.891MetHis: 0.891 ± 0.397
0.396MetIle: 0.396 ± 0.215
0.495MetLys: 0.495 ± 0.393
3.167MetLeu: 3.167 ± 1.063
0.495MetMet: 0.495 ± 0.188
0.891MetAsn: 0.891 ± 0.358
0.891MetPro: 0.891 ± 0.421
1.484MetGln: 1.484 ± 0.689
1.286MetArg: 1.286 ± 0.87
0.891MetSer: 0.891 ± 0.305
1.88MetThr: 1.88 ± 0.332
1.484MetVal: 1.484 ± 0.563
0.297MetTrp: 0.297 ± 0.29
0.99MetTyr: 0.99 ± 0.409
0.0MetXaa: 0.0 ± 0.0
Asn
4.453AsnAla: 4.453 ± 0.908
1.583AsnCys: 1.583 ± 0.527
1.88AsnAsp: 1.88 ± 0.616
2.078AsnGlu: 2.078 ± 0.606
3.365AsnPhe: 3.365 ± 0.632
3.958AsnGly: 3.958 ± 1.504
0.792AsnHis: 0.792 ± 0.226
1.583AsnIle: 1.583 ± 0.552
3.068AsnLys: 3.068 ± 0.779
4.156AsnLeu: 4.156 ± 1.454
2.078AsnMet: 2.078 ± 0.458
2.672AsnAsn: 2.672 ± 0.902
1.88AsnPro: 1.88 ± 0.447
1.484AsnGln: 1.484 ± 0.554
1.583AsnArg: 1.583 ± 0.539
4.354AsnSer: 4.354 ± 0.718
2.672AsnThr: 2.672 ± 0.844
3.859AsnVal: 3.859 ± 1.249
0.396AsnTrp: 0.396 ± 0.164
3.068AsnTyr: 3.068 ± 0.826
0.0AsnXaa: 0.0 ± 0.0
Pro
3.761ProAla: 3.761 ± 1.2
0.99ProCys: 0.99 ± 0.377
2.573ProAsp: 2.573 ± 0.443
2.276ProGlu: 2.276 ± 0.593
1.484ProPhe: 1.484 ± 0.399
2.87ProGly: 2.87 ± 1.477
1.089ProHis: 1.089 ± 0.614
2.474ProIle: 2.474 ± 1.151
2.375ProLys: 2.375 ± 0.862
4.651ProLeu: 4.651 ± 1.213
0.99ProMet: 0.99 ± 0.627
2.375ProAsn: 2.375 ± 1.083
2.474ProPro: 2.474 ± 1.214
1.188ProGln: 1.188 ± 0.644
2.177ProArg: 2.177 ± 0.971
2.771ProSer: 2.771 ± 1.109
2.87ProThr: 2.87 ± 0.712
3.761ProVal: 3.761 ± 0.94
0.297ProTrp: 0.297 ± 0.154
1.88ProTyr: 1.88 ± 0.502
0.0ProXaa: 0.0 ± 0.0
Gln
2.672GlnAla: 2.672 ± 0.414
1.188GlnCys: 1.188 ± 0.576
2.177GlnAsp: 2.177 ± 0.724
1.979GlnGlu: 1.979 ± 1.126
1.484GlnPhe: 1.484 ± 0.795
2.969GlnGly: 2.969 ± 1.161
0.297GlnHis: 0.297 ± 0.423
1.682GlnIle: 1.682 ± 0.678
1.583GlnLys: 1.583 ± 0.568
4.651GlnLeu: 4.651 ± 0.741
0.891GlnMet: 0.891 ± 0.332
1.286GlnAsn: 1.286 ± 0.407
1.979GlnPro: 1.979 ± 0.489
1.979GlnGln: 1.979 ± 0.92
0.99GlnArg: 0.99 ± 0.598
2.474GlnSer: 2.474 ± 1.085
2.87GlnThr: 2.87 ± 0.738
2.771GlnVal: 2.771 ± 0.837
0.495GlnTrp: 0.495 ± 0.311
1.682GlnTyr: 1.682 ± 0.735
0.0GlnXaa: 0.0 ± 0.0
Arg
3.662ArgAla: 3.662 ± 1.318
1.188ArgCys: 1.188 ± 0.692
2.474ArgAsp: 2.474 ± 0.678
1.385ArgGlu: 1.385 ± 0.53
1.979ArgPhe: 1.979 ± 0.462
1.88ArgGly: 1.88 ± 0.944
1.188ArgHis: 1.188 ± 0.519
1.88ArgIle: 1.88 ± 0.478
1.781ArgLys: 1.781 ± 0.473
3.266ArgLeu: 3.266 ± 0.795
0.693ArgMet: 0.693 ± 0.494
1.88ArgAsn: 1.88 ± 0.335
1.979ArgPro: 1.979 ± 1.066
1.484ArgGln: 1.484 ± 0.486
1.385ArgArg: 1.385 ± 0.53
4.057ArgSer: 4.057 ± 2.139
1.88ArgThr: 1.88 ± 0.662
2.87ArgVal: 2.87 ± 0.958
0.297ArgTrp: 0.297 ± 0.565
1.385ArgTyr: 1.385 ± 0.624
0.0ArgXaa: 0.0 ± 0.0
Ser
5.74SerAla: 5.74 ± 1.12
1.88SerCys: 1.88 ± 0.629
4.057SerAsp: 4.057 ± 1.206
2.573SerGlu: 2.573 ± 0.522
4.453SerPhe: 4.453 ± 1.435
4.354SerGly: 4.354 ± 0.798
1.682SerHis: 1.682 ± 0.822
2.969SerIle: 2.969 ± 0.918
3.464SerLys: 3.464 ± 0.488
7.224SerLeu: 7.224 ± 1.97
1.484SerMet: 1.484 ± 0.693
2.771SerAsn: 2.771 ± 1.065
2.474SerPro: 2.474 ± 0.687
2.771SerGln: 2.771 ± 0.686
3.167SerArg: 3.167 ± 1.777
5.938SerSer: 5.938 ± 2.982
4.354SerThr: 4.354 ± 0.693
6.828SerVal: 6.828 ± 1.285
1.089SerTrp: 1.089 ± 0.517
3.365SerTyr: 3.365 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
4.75ThrAla: 4.75 ± 1.028
1.682ThrCys: 1.682 ± 0.564
2.177ThrAsp: 2.177 ± 0.349
2.375ThrGlu: 2.375 ± 0.499
3.563ThrPhe: 3.563 ± 1.097
5.344ThrGly: 5.344 ± 1.568
1.781ThrHis: 1.781 ± 0.496
3.068ThrIle: 3.068 ± 0.601
3.563ThrLys: 3.563 ± 0.611
5.839ThrLeu: 5.839 ± 1.725
1.286ThrMet: 1.286 ± 0.487
3.365ThrAsn: 3.365 ± 0.858
3.266ThrPro: 3.266 ± 0.556
2.474ThrGln: 2.474 ± 1.415
2.474ThrArg: 2.474 ± 1.101
4.651ThrSer: 4.651 ± 0.841
4.255ThrThr: 4.255 ± 0.743
5.542ThrVal: 5.542 ± 1.023
0.495ThrTrp: 0.495 ± 0.248
2.771ThrTyr: 2.771 ± 0.683
0.0ThrXaa: 0.0 ± 0.0
Val
6.63ValAla: 6.63 ± 1.27
3.068ValCys: 3.068 ± 0.975
5.74ValAsp: 5.74 ± 1.674
3.761ValGlu: 3.761 ± 1.415
4.057ValPhe: 4.057 ± 1.061
4.453ValGly: 4.453 ± 1.061
1.188ValHis: 1.188 ± 0.236
2.969ValIle: 2.969 ± 0.534
5.938ValLys: 5.938 ± 1.448
7.422ValLeu: 7.422 ± 1.849
1.484ValMet: 1.484 ± 0.603
4.354ValAsn: 4.354 ± 0.638
4.057ValPro: 4.057 ± 0.926
4.255ValGln: 4.255 ± 0.888
3.167ValArg: 3.167 ± 0.678
7.125ValSer: 7.125 ± 0.757
5.641ValThr: 5.641 ± 0.915
7.323ValVal: 7.323 ± 1.662
0.99ValTrp: 0.99 ± 0.461
3.365ValTyr: 3.365 ± 0.841
0.0ValXaa: 0.0 ± 0.0
Trp
0.792TrpAla: 0.792 ± 0.31
0.495TrpCys: 0.495 ± 0.269
0.693TrpAsp: 0.693 ± 0.265
0.297TrpGlu: 0.297 ± 0.161
0.891TrpPhe: 0.891 ± 0.251
0.198TrpGly: 0.198 ± 0.108
0.396TrpHis: 0.396 ± 0.267
0.198TrpIle: 0.198 ± 0.304
0.495TrpLys: 0.495 ± 0.269
1.385TrpLeu: 1.385 ± 0.74
0.297TrpMet: 0.297 ± 0.426
0.594TrpAsn: 0.594 ± 0.308
0.693TrpPro: 0.693 ± 0.445
0.297TrpGln: 0.297 ± 0.941
0.495TrpArg: 0.495 ± 0.333
0.792TrpSer: 0.792 ± 0.573
0.495TrpThr: 0.495 ± 0.188
0.99TrpVal: 0.99 ± 0.293
0.198TrpTrp: 0.198 ± 0.304
0.495TrpTyr: 0.495 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.068TyrAla: 3.068 ± 1.139
1.385TyrCys: 1.385 ± 0.53
3.167TyrAsp: 3.167 ± 1.039
1.583TyrGlu: 1.583 ± 0.655
3.167TyrPhe: 3.167 ± 0.323
2.177TyrGly: 2.177 ± 0.888
1.089TyrHis: 1.089 ± 0.315
2.177TyrIle: 2.177 ± 1.027
3.365TyrLys: 3.365 ± 1.036
3.167TyrLeu: 3.167 ± 0.468
0.99TyrMet: 0.99 ± 0.254
2.573TyrAsn: 2.573 ± 0.532
2.078TyrPro: 2.078 ± 0.595
0.99TyrGln: 0.99 ± 0.616
1.583TyrArg: 1.583 ± 0.593
3.464TyrSer: 3.464 ± 0.927
3.563TyrThr: 3.563 ± 0.573
4.948TyrVal: 4.948 ± 0.825
0.198TyrTrp: 0.198 ± 0.2
2.573TyrTyr: 2.573 ± 0.801
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (10106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski