Amino acid dipepetide frequency for BtNv-AlphaCoV/SC2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.292AlaAla: 5.292 ± 0.958
2.477AlaCys: 2.477 ± 1.127
2.928AlaAsp: 2.928 ± 1.241
1.802AlaGlu: 1.802 ± 0.601
4.729AlaPhe: 4.729 ± 0.954
3.828AlaGly: 3.828 ± 0.977
0.676AlaHis: 0.676 ± 0.489
4.391AlaIle: 4.391 ± 2.102
2.702AlaLys: 2.702 ± 0.95
6.418AlaLeu: 6.418 ± 0.746
1.689AlaMet: 1.689 ± 0.389
3.265AlaAsn: 3.265 ± 0.895
2.365AlaPro: 2.365 ± 0.486
1.351AlaGln: 1.351 ± 0.566
3.04AlaArg: 3.04 ± 0.639
4.391AlaSer: 4.391 ± 0.756
4.391AlaThr: 4.391 ± 1.147
7.319AlaVal: 7.319 ± 1.253
0.788AlaTrp: 0.788 ± 1.611
3.265AlaTyr: 3.265 ± 0.799
0.0AlaXaa: 0.0 ± 0.0
Cys
2.365CysAla: 2.365 ± 0.699
1.239CysCys: 1.239 ± 0.595
2.928CysAsp: 2.928 ± 0.997
0.676CysGlu: 0.676 ± 0.362
2.027CysPhe: 2.027 ± 0.715
2.477CysGly: 2.477 ± 0.874
0.676CysHis: 0.676 ± 0.86
1.802CysIle: 1.802 ± 0.957
1.689CysLys: 1.689 ± 0.905
2.59CysLeu: 2.59 ± 0.803
0.45CysMet: 0.45 ± 0.241
2.027CysAsn: 2.027 ± 0.763
0.901CysPro: 0.901 ± 0.31
0.901CysGln: 0.901 ± 0.31
1.689CysArg: 1.689 ± 0.888
1.914CysSer: 1.914 ± 0.632
2.139CysThr: 2.139 ± 0.724
3.716CysVal: 3.716 ± 1.124
0.676CysTrp: 0.676 ± 0.362
1.576CysTyr: 1.576 ± 0.845
0.0CysXaa: 0.0 ± 0.0
Asp
3.828AspAla: 3.828 ± 0.867
1.802AspCys: 1.802 ± 0.59
2.139AspAsp: 2.139 ± 0.461
2.365AspGlu: 2.365 ± 0.959
5.743AspPhe: 5.743 ± 1.317
5.292AspGly: 5.292 ± 1.302
1.126AspHis: 1.126 ± 0.292
2.928AspIle: 2.928 ± 0.997
1.576AspLys: 1.576 ± 0.336
3.716AspLeu: 3.716 ± 0.865
0.901AspMet: 0.901 ± 0.327
3.04AspAsn: 3.04 ± 0.978
1.802AspPro: 1.802 ± 0.813
1.239AspGln: 1.239 ± 0.922
2.139AspArg: 2.139 ± 0.881
3.265AspSer: 3.265 ± 0.969
2.702AspThr: 2.702 ± 0.568
5.968AspVal: 5.968 ± 0.696
0.788AspTrp: 0.788 ± 0.422
3.265AspTyr: 3.265 ± 0.69
0.0AspXaa: 0.0 ± 0.0
Glu
2.252GluAla: 2.252 ± 0.512
1.126GluCys: 1.126 ± 0.438
2.027GluAsp: 2.027 ± 0.515
1.464GluGlu: 1.464 ± 0.969
2.477GluPhe: 2.477 ± 0.859
3.378GluGly: 3.378 ± 0.662
1.126GluHis: 1.126 ± 0.49
1.239GluIle: 1.239 ± 0.555
1.689GluLys: 1.689 ± 0.374
3.378GluLeu: 3.378 ± 0.615
0.113GluMet: 0.113 ± 0.06
2.252GluAsn: 2.252 ± 0.506
1.802GluPro: 1.802 ± 0.414
0.563GluGln: 0.563 ± 0.314
1.802GluArg: 1.802 ± 0.791
1.802GluSer: 1.802 ± 0.654
2.252GluThr: 2.252 ± 0.667
4.054GluVal: 4.054 ± 1.31
0.563GluTrp: 0.563 ± 0.184
1.351GluTyr: 1.351 ± 0.768
0.0GluXaa: 0.0 ± 0.0
Phe
3.603PheAla: 3.603 ± 1.292
2.027PheCys: 2.027 ± 0.907
4.504PheAsp: 4.504 ± 1.164
2.252PheGlu: 2.252 ± 1.027
1.802PhePhe: 1.802 ± 0.535
3.603PheGly: 3.603 ± 1.355
0.676PheHis: 0.676 ± 0.371
2.59PheIle: 2.59 ± 1.57
4.054PheLys: 4.054 ± 1.063
3.941PheLeu: 3.941 ± 0.99
1.351PheMet: 1.351 ± 0.801
3.941PheAsn: 3.941 ± 1.231
0.45PhePro: 0.45 ± 0.155
0.788PheGln: 0.788 ± 0.46
0.788PheArg: 0.788 ± 0.348
5.18PheSer: 5.18 ± 0.895
3.378PheThr: 3.378 ± 1.037
7.319PheVal: 7.319 ± 2.507
1.126PheTrp: 1.126 ± 0.444
3.265PheTyr: 3.265 ± 0.865
0.0PheXaa: 0.0 ± 0.0
Gly
4.279GlyAla: 4.279 ± 0.825
2.365GlyCys: 2.365 ± 0.774
4.391GlyAsp: 4.391 ± 1.353
2.477GlyGlu: 2.477 ± 0.745
4.617GlyPhe: 4.617 ± 0.826
4.391GlyGly: 4.391 ± 0.822
0.901GlyHis: 0.901 ± 0.65
2.59GlyIle: 2.59 ± 1.164
3.491GlyLys: 3.491 ± 0.995
5.405GlyLeu: 5.405 ± 0.6
1.239GlyMet: 1.239 ± 0.503
3.603GlyAsn: 3.603 ± 1.101
1.802GlyPro: 1.802 ± 0.581
1.802GlyGln: 1.802 ± 1.348
2.365GlyArg: 2.365 ± 2.191
5.517GlySer: 5.517 ± 0.957
3.378GlyThr: 3.378 ± 0.877
9.796GlyVal: 9.796 ± 2.378
0.676GlyTrp: 0.676 ± 0.415
3.265GlyTyr: 3.265 ± 0.375
0.0GlyXaa: 0.0 ± 0.0
His
1.464HisAla: 1.464 ± 0.305
0.563HisCys: 0.563 ± 0.302
0.901HisAsp: 0.901 ± 0.327
0.901HisGlu: 0.901 ± 0.378
1.239HisPhe: 1.239 ± 1.202
0.788HisGly: 0.788 ± 0.422
0.338HisHis: 0.338 ± 0.147
0.45HisIle: 0.45 ± 0.493
1.013HisLys: 1.013 ± 0.415
2.252HisLeu: 2.252 ± 0.596
0.338HisMet: 0.338 ± 0.147
1.239HisAsn: 1.239 ± 0.83
0.676HisPro: 0.676 ± 0.86
0.788HisGln: 0.788 ± 0.85
0.45HisArg: 0.45 ± 0.241
1.351HisSer: 1.351 ± 0.39
1.464HisThr: 1.464 ± 0.587
2.59HisVal: 2.59 ± 0.806
0.225HisTrp: 0.225 ± 0.121
0.563HisTyr: 0.563 ± 0.3
0.0HisXaa: 0.0 ± 0.0
Ile
2.252IleAla: 2.252 ± 1.227
0.788IleCys: 0.788 ± 0.465
2.477IleAsp: 2.477 ± 1.053
1.464IleGlu: 1.464 ± 0.535
2.477IlePhe: 2.477 ± 1.034
3.378IleGly: 3.378 ± 0.778
0.45IleHis: 0.45 ± 0.155
2.139IleIle: 2.139 ± 1.95
2.928IleLys: 2.928 ± 0.965
3.265IleLeu: 3.265 ± 1.73
0.788IleMet: 0.788 ± 0.869
3.153IleAsn: 3.153 ± 1.563
2.252IlePro: 2.252 ± 0.697
2.027IleGln: 2.027 ± 1.364
1.013IleArg: 1.013 ± 0.382
3.491IleSer: 3.491 ± 1.425
4.166IleThr: 4.166 ± 1.883
4.279IleVal: 4.279 ± 0.938
0.45IleTrp: 0.45 ± 0.241
1.239IleTyr: 1.239 ± 1.31
0.0IleXaa: 0.0 ± 0.0
Lys
3.603LysAla: 3.603 ± 0.576
1.576LysCys: 1.576 ± 0.67
3.04LysAsp: 3.04 ± 0.919
2.139LysGlu: 2.139 ± 0.732
2.815LysPhe: 2.815 ± 1.079
3.941LysGly: 3.941 ± 1.024
1.914LysHis: 1.914 ± 1.026
1.914LysIle: 1.914 ± 0.737
1.351LysLys: 1.351 ± 0.724
4.279LysLeu: 4.279 ± 1.044
1.239LysMet: 1.239 ± 0.618
1.464LysAsn: 1.464 ± 0.411
3.04LysPro: 3.04 ± 1.477
1.914LysGln: 1.914 ± 0.639
1.914LysArg: 1.914 ± 0.403
4.166LysSer: 4.166 ± 1.436
2.702LysThr: 2.702 ± 0.744
3.828LysVal: 3.828 ± 0.721
0.676LysTrp: 0.676 ± 0.273
2.139LysTyr: 2.139 ± 0.694
0.0LysXaa: 0.0 ± 0.0
Leu
5.743LeuAla: 5.743 ± 0.764
3.265LeuCys: 3.265 ± 1.08
4.166LeuAsp: 4.166 ± 1.312
3.603LeuGlu: 3.603 ± 1.181
4.279LeuPhe: 4.279 ± 1.678
5.405LeuGly: 5.405 ± 0.884
2.365LeuHis: 2.365 ± 0.476
2.702LeuIle: 2.702 ± 1.58
4.279LeuLys: 4.279 ± 1.543
9.233LeuLeu: 9.233 ± 2.534
1.351LeuMet: 1.351 ± 0.829
5.292LeuAsn: 5.292 ± 1.468
3.378LeuPro: 3.378 ± 1.804
4.054LeuGln: 4.054 ± 0.969
3.716LeuArg: 3.716 ± 0.6
6.531LeuSer: 6.531 ± 2.791
5.517LeuThr: 5.517 ± 0.906
5.743LeuVal: 5.743 ± 1.905
1.464LeuTrp: 1.464 ± 1.9
5.517LeuTyr: 5.517 ± 0.955
0.0LeuXaa: 0.0 ± 0.0
Met
1.013MetAla: 1.013 ± 0.461
1.126MetCys: 1.126 ± 0.603
0.788MetAsp: 0.788 ± 0.274
0.676MetGlu: 0.676 ± 0.326
1.239MetPhe: 1.239 ± 0.578
1.576MetGly: 1.576 ± 0.51
0.113MetHis: 0.113 ± 0.06
0.901MetIle: 0.901 ± 0.483
0.225MetLys: 0.225 ± 0.163
2.027MetLeu: 2.027 ± 0.969
0.45MetMet: 0.45 ± 0.241
0.563MetAsn: 0.563 ± 0.302
1.464MetPro: 1.464 ± 0.527
0.788MetGln: 0.788 ± 0.296
1.013MetArg: 1.013 ± 0.818
1.802MetSer: 1.802 ± 0.429
1.464MetThr: 1.464 ± 1.029
1.351MetVal: 1.351 ± 0.367
0.113MetTrp: 0.113 ± 0.06
1.576MetTyr: 1.576 ± 0.45
0.0MetXaa: 0.0 ± 0.0
Asn
3.716AsnAla: 3.716 ± 1.887
2.365AsnCys: 2.365 ± 0.852
1.914AsnAsp: 1.914 ± 0.883
2.027AsnGlu: 2.027 ± 0.235
2.252AsnPhe: 2.252 ± 0.994
5.517AsnGly: 5.517 ± 1.439
0.788AsnHis: 0.788 ± 0.296
3.04AsnIle: 3.04 ± 1.245
3.378AsnLys: 3.378 ± 0.698
3.265AsnLeu: 3.265 ± 0.861
1.239AsnMet: 1.239 ± 0.495
2.928AsnAsn: 2.928 ± 1.054
1.239AsnPro: 1.239 ± 0.42
1.576AsnGln: 1.576 ± 1.878
2.365AsnArg: 2.365 ± 1.553
4.279AsnSer: 4.279 ± 1.317
3.603AsnThr: 3.603 ± 0.383
7.544AsnVal: 7.544 ± 1.131
0.563AsnTrp: 0.563 ± 1.001
1.464AsnTyr: 1.464 ± 0.499
0.0AsnXaa: 0.0 ± 0.0
Pro
2.928ProAla: 2.928 ± 0.565
1.013ProCys: 1.013 ± 0.382
1.464ProAsp: 1.464 ± 0.363
1.126ProGlu: 1.126 ± 0.332
2.027ProPhe: 2.027 ± 0.696
2.59ProGly: 2.59 ± 0.593
1.802ProHis: 1.802 ± 1.916
2.477ProIle: 2.477 ± 0.758
1.689ProLys: 1.689 ± 1.329
3.716ProLeu: 3.716 ± 1.091
0.338ProMet: 0.338 ± 0.355
1.013ProAsn: 1.013 ± 0.582
1.914ProPro: 1.914 ± 0.516
1.013ProGln: 1.013 ± 0.634
1.576ProArg: 1.576 ± 0.515
2.477ProSer: 2.477 ± 0.868
2.59ProThr: 2.59 ± 1.341
3.265ProVal: 3.265 ± 2.196
0.338ProTrp: 0.338 ± 0.147
1.013ProTyr: 1.013 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
1.351GlnAla: 1.351 ± 0.284
0.563GlnCys: 0.563 ± 0.304
1.351GlnAsp: 1.351 ± 0.546
1.126GlnGlu: 1.126 ± 0.431
0.788GlnPhe: 0.788 ± 0.296
1.914GlnGly: 1.914 ± 1.184
0.788GlnHis: 0.788 ± 0.296
1.351GlnIle: 1.351 ± 0.797
1.576GlnLys: 1.576 ± 0.467
3.716GlnLeu: 3.716 ± 1.509
0.788GlnMet: 0.788 ± 0.284
1.689GlnAsn: 1.689 ± 1.843
2.027GlnPro: 2.027 ± 1.03
1.126GlnGln: 1.126 ± 1.483
1.914GlnArg: 1.914 ± 1.002
2.139GlnSer: 2.139 ± 1.145
1.802GlnThr: 1.802 ± 0.661
2.027GlnVal: 2.027 ± 1.121
0.225GlnTrp: 0.225 ± 0.121
1.576GlnTyr: 1.576 ± 0.578
0.0GlnXaa: 0.0 ± 0.0
Arg
2.702ArgAla: 2.702 ± 0.767
1.351ArgCys: 1.351 ± 0.588
1.576ArgAsp: 1.576 ± 0.749
1.239ArgGlu: 1.239 ± 0.42
2.702ArgPhe: 2.702 ± 1.161
2.252ArgGly: 2.252 ± 1.761
0.563ArgHis: 0.563 ± 0.302
1.464ArgIle: 1.464 ± 0.804
1.576ArgLys: 1.576 ± 0.654
4.617ArgLeu: 4.617 ± 0.953
1.464ArgMet: 1.464 ± 0.412
3.153ArgAsn: 3.153 ± 2.012
0.901ArgPro: 0.901 ± 0.378
1.351ArgGln: 1.351 ± 1.008
1.126ArgArg: 1.126 ± 0.732
2.702ArgSer: 2.702 ± 3.321
2.59ArgThr: 2.59 ± 0.784
3.04ArgVal: 3.04 ± 0.73
0.676ArgTrp: 0.676 ± 0.273
1.351ArgTyr: 1.351 ± 0.466
0.0ArgXaa: 0.0 ± 0.0
Ser
5.517SerAla: 5.517 ± 1.027
2.59SerCys: 2.59 ± 1.028
5.18SerAsp: 5.18 ± 1.678
2.477SerGlu: 2.477 ± 0.496
4.054SerPhe: 4.054 ± 0.489
4.617SerGly: 4.617 ± 1.335
1.126SerHis: 1.126 ± 0.603
3.153SerIle: 3.153 ± 1.661
4.842SerLys: 4.842 ± 1.046
4.842SerLeu: 4.842 ± 1.207
1.351SerMet: 1.351 ± 0.451
2.928SerAsn: 2.928 ± 1.809
1.689SerPro: 1.689 ± 0.468
2.928SerGln: 2.928 ± 2.987
2.365SerArg: 2.365 ± 2.217
5.743SerSer: 5.743 ± 0.904
5.18SerThr: 5.18 ± 0.724
8.895SerVal: 8.895 ± 2.246
0.901SerTrp: 0.901 ± 0.327
2.928SerTyr: 2.928 ± 0.713
0.0SerXaa: 0.0 ± 0.0
Thr
3.941ThrAla: 3.941 ± 0.388
1.126ThrCys: 1.126 ± 0.39
3.153ThrAsp: 3.153 ± 0.963
2.477ThrGlu: 2.477 ± 0.975
3.603ThrPhe: 3.603 ± 0.563
4.504ThrGly: 4.504 ± 2.24
1.351ThrHis: 1.351 ± 0.466
2.815ThrIle: 2.815 ± 0.602
1.914ThrLys: 1.914 ± 0.403
6.643ThrLeu: 6.643 ± 1.415
1.576ThrMet: 1.576 ± 0.45
2.815ThrAsn: 2.815 ± 0.833
2.815ThrPro: 2.815 ± 0.935
1.351ThrGln: 1.351 ± 0.377
2.139ThrArg: 2.139 ± 0.871
5.067ThrSer: 5.067 ± 0.919
4.166ThrThr: 4.166 ± 0.83
6.643ThrVal: 6.643 ± 1.456
0.225ThrTrp: 0.225 ± 0.121
3.603ThrTyr: 3.603 ± 0.927
0.0ThrXaa: 0.0 ± 0.0
Val
6.418ValAla: 6.418 ± 0.873
5.067ValCys: 5.067 ± 0.9
7.319ValAsp: 7.319 ± 1.716
4.166ValGlu: 4.166 ± 1.059
5.067ValPhe: 5.067 ± 0.841
5.517ValGly: 5.517 ± 1.166
1.576ValHis: 1.576 ± 0.88
3.828ValIle: 3.828 ± 0.722
6.306ValLys: 6.306 ± 1.523
10.021ValLeu: 10.021 ± 3.491
2.139ValMet: 2.139 ± 1.214
6.531ValAsn: 6.531 ± 1.369
3.828ValPro: 3.828 ± 1.423
3.04ValGln: 3.04 ± 1.145
4.391ValArg: 4.391 ± 1.904
7.995ValSer: 7.995 ± 2.001
5.292ValThr: 5.292 ± 0.93
11.147ValVal: 11.147 ± 2.502
1.464ValTrp: 1.464 ± 0.305
3.828ValTyr: 3.828 ± 1.052
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 1.268
0.338TrpCys: 0.338 ± 0.147
0.788TrpAsp: 0.788 ± 0.422
0.338TrpGlu: 0.338 ± 0.181
0.45TrpPhe: 0.45 ± 0.155
0.45TrpGly: 0.45 ± 0.241
0.338TrpHis: 0.338 ± 0.378
0.563TrpIle: 0.563 ± 0.416
0.45TrpLys: 0.45 ± 0.314
1.576TrpLeu: 1.576 ± 0.548
0.225TrpMet: 0.225 ± 0.121
0.901TrpAsn: 0.901 ± 1.015
0.563TrpPro: 0.563 ± 0.509
0.113TrpGln: 0.113 ± 0.06
0.676TrpArg: 0.676 ± 0.326
0.788TrpSer: 0.788 ± 0.274
0.676TrpThr: 0.676 ± 0.488
1.351TrpVal: 1.351 ± 1.718
0.338TrpTrp: 0.338 ± 0.147
1.013TrpTyr: 1.013 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.603TyrAla: 3.603 ± 0.672
1.689TyrCys: 1.689 ± 0.6
2.59TyrAsp: 2.59 ± 1.206
1.914TyrGlu: 1.914 ± 0.848
2.252TyrPhe: 2.252 ± 1.199
2.928TyrGly: 2.928 ± 0.688
0.901TyrHis: 0.901 ± 0.41
2.139TyrIle: 2.139 ± 0.957
3.04TyrLys: 3.04 ± 0.868
2.702TyrLeu: 2.702 ± 0.44
1.126TyrMet: 1.126 ± 0.457
3.265TyrAsn: 3.265 ± 1.083
1.464TyrPro: 1.464 ± 0.527
1.126TyrGln: 1.126 ± 0.618
1.914TyrArg: 1.914 ± 1.038
2.702TyrSer: 2.702 ± 0.346
2.477TyrThr: 2.477 ± 0.731
5.067TyrVal: 5.067 ± 1.181
0.901TyrTrp: 0.901 ± 0.378
2.702TyrTyr: 2.702 ± 0.657
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (8882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski