Amino acid dipepetide frequency for BtMf-AlphaCoV/HeN2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.544AlaAla: 6.544 ± 1.205
2.533AlaCys: 2.533 ± 0.69
2.955AlaAsp: 2.955 ± 0.905
2.533AlaGlu: 2.533 ± 0.48
4.644AlaPhe: 4.644 ± 0.798
3.483AlaGly: 3.483 ± 0.822
1.689AlaHis: 1.689 ± 0.333
4.538AlaIle: 4.538 ± 0.998
3.061AlaLys: 3.061 ± 0.87
6.966AlaLeu: 6.966 ± 0.619
1.478AlaMet: 1.478 ± 0.352
4.116AlaAsn: 4.116 ± 0.816
2.744AlaPro: 2.744 ± 1.288
1.372AlaGln: 1.372 ± 0.548
3.377AlaArg: 3.377 ± 0.726
5.172AlaSer: 5.172 ± 0.898
3.799AlaThr: 3.799 ± 1.001
7.282AlaVal: 7.282 ± 1.04
0.844AlaTrp: 0.844 ± 0.401
3.799AlaTyr: 3.799 ± 0.986
0.0AlaXaa: 0.0 ± 0.0
Cys
2.111CysAla: 2.111 ± 0.613
0.95CysCys: 0.95 ± 0.484
1.9CysAsp: 1.9 ± 1.029
0.633CysGlu: 0.633 ± 0.208
2.639CysPhe: 2.639 ± 0.616
2.639CysGly: 2.639 ± 0.818
0.528CysHis: 0.528 ± 0.393
0.844CysIle: 0.844 ± 0.752
2.639CysLys: 2.639 ± 0.874
2.322CysLeu: 2.322 ± 0.417
0.317CysMet: 0.317 ± 0.372
2.111CysAsn: 2.111 ± 0.676
0.844CysPro: 0.844 ± 0.263
1.161CysGln: 1.161 ± 0.308
1.372CysArg: 1.372 ± 0.395
2.322CysSer: 2.322 ± 0.68
2.533CysThr: 2.533 ± 1.035
3.272CysVal: 3.272 ± 0.685
0.633CysTrp: 0.633 ± 0.343
1.689CysTyr: 1.689 ± 0.914
0.0CysXaa: 0.0 ± 0.0
Asp
4.433AspAla: 4.433 ± 1.129
1.266AspCys: 1.266 ± 0.517
2.111AspAsp: 2.111 ± 0.793
1.9AspGlu: 1.9 ± 0.687
4.222AspPhe: 4.222 ± 0.919
6.121AspGly: 6.121 ± 1.759
0.95AspHis: 0.95 ± 0.362
2.322AspIle: 2.322 ± 0.757
2.744AspLys: 2.744 ± 0.574
5.383AspLeu: 5.383 ± 0.691
0.95AspMet: 0.95 ± 0.209
2.639AspAsn: 2.639 ± 0.467
2.111AspPro: 2.111 ± 0.524
1.055AspGln: 1.055 ± 0.357
1.055AspArg: 1.055 ± 0.338
3.061AspSer: 3.061 ± 0.284
2.005AspThr: 2.005 ± 0.428
6.544AspVal: 6.544 ± 1.373
0.844AspTrp: 0.844 ± 0.373
2.533AspTyr: 2.533 ± 0.57
0.0AspXaa: 0.0 ± 0.0
Glu
2.216GluAla: 2.216 ± 0.871
1.794GluCys: 1.794 ± 0.581
1.689GluAsp: 1.689 ± 0.259
1.478GluGlu: 1.478 ± 0.484
3.166GluPhe: 3.166 ± 1.09
2.955GluGly: 2.955 ± 0.824
1.583GluHis: 1.583 ± 1.046
1.689GluIle: 1.689 ± 0.638
1.689GluLys: 1.689 ± 0.464
3.483GluLeu: 3.483 ± 1.967
0.528GluMet: 0.528 ± 0.248
1.583GluAsn: 1.583 ± 0.325
2.533GluPro: 2.533 ± 0.36
1.372GluGln: 1.372 ± 0.59
2.005GluArg: 2.005 ± 0.504
1.583GluSer: 1.583 ± 0.486
2.322GluThr: 2.322 ± 0.359
3.166GluVal: 3.166 ± 0.681
0.528GluTrp: 0.528 ± 0.361
1.266GluTyr: 1.266 ± 0.415
0.0GluXaa: 0.0 ± 0.0
Phe
4.222PheAla: 4.222 ± 1.103
1.689PheCys: 1.689 ± 0.606
4.116PheAsp: 4.116 ± 0.688
2.955PheGlu: 2.955 ± 0.699
2.955PhePhe: 2.955 ± 0.714
4.116PheGly: 4.116 ± 0.967
0.633PheHis: 0.633 ± 0.285
2.955PheIle: 2.955 ± 0.882
3.905PheLys: 3.905 ± 1.259
3.799PheLeu: 3.799 ± 0.984
1.266PheMet: 1.266 ± 0.551
3.799PheAsn: 3.799 ± 1.153
0.739PhePro: 0.739 ± 0.446
1.266PheGln: 1.266 ± 0.636
1.478PheArg: 1.478 ± 0.732
3.905PheSer: 3.905 ± 1.433
2.744PheThr: 2.744 ± 0.647
7.177PheVal: 7.177 ± 1.616
1.161PheTrp: 1.161 ± 0.498
3.166PheTyr: 3.166 ± 0.422
0.0PheXaa: 0.0 ± 0.0
Gly
4.222GlyAla: 4.222 ± 0.805
2.216GlyCys: 2.216 ± 0.734
5.277GlyAsp: 5.277 ± 1.429
2.005GlyGlu: 2.005 ± 0.407
4.327GlyPhe: 4.327 ± 0.829
5.383GlyGly: 5.383 ± 0.991
0.528GlyHis: 0.528 ± 0.169
3.166GlyIle: 3.166 ± 0.736
3.905GlyLys: 3.905 ± 1.658
5.172GlyLeu: 5.172 ± 0.679
1.161GlyMet: 1.161 ± 0.577
3.799GlyAsn: 3.799 ± 2.102
2.111GlyPro: 2.111 ± 0.522
1.266GlyGln: 1.266 ± 0.497
1.583GlyArg: 1.583 ± 0.426
6.332GlySer: 6.332 ± 1.081
3.905GlyThr: 3.905 ± 0.586
8.76GlyVal: 8.76 ± 1.527
0.528GlyTrp: 0.528 ± 0.247
2.955GlyTyr: 2.955 ± 0.714
0.0GlyXaa: 0.0 ± 0.0
His
1.794HisAla: 1.794 ± 0.569
0.739HisCys: 0.739 ± 0.348
0.528HisAsp: 0.528 ± 0.327
1.055HisGlu: 1.055 ± 0.611
0.95HisPhe: 0.95 ± 0.609
0.95HisGly: 0.95 ± 0.417
0.106HisHis: 0.106 ± 0.057
0.633HisIle: 0.633 ± 0.208
0.95HisLys: 0.95 ± 0.383
1.583HisLeu: 1.583 ± 0.696
0.106HisMet: 0.106 ± 0.057
1.161HisAsn: 1.161 ± 0.431
0.528HisPro: 0.528 ± 0.633
0.528HisGln: 0.528 ± 0.288
0.633HisArg: 0.633 ± 0.527
1.055HisSer: 1.055 ± 0.311
1.266HisThr: 1.266 ± 0.517
2.322HisVal: 2.322 ± 0.39
0.211HisTrp: 0.211 ± 0.114
0.844HisTyr: 0.844 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
3.166IleAla: 3.166 ± 0.774
1.266IleCys: 1.266 ± 0.918
1.478IleAsp: 1.478 ± 1.343
1.9IleGlu: 1.9 ± 0.533
2.322IlePhe: 2.322 ± 0.655
2.744IleGly: 2.744 ± 1.012
0.317IleHis: 0.317 ± 0.171
2.005IleIle: 2.005 ± 1.675
3.166IleLys: 3.166 ± 0.984
3.166IleLeu: 3.166 ± 1.699
1.161IleMet: 1.161 ± 0.51
2.533IleAsn: 2.533 ± 0.463
2.533IlePro: 2.533 ± 1.539
1.794IleGln: 1.794 ± 0.822
2.005IleArg: 2.005 ± 0.939
3.377IleSer: 3.377 ± 1.145
2.955IleThr: 2.955 ± 1.331
5.277IleVal: 5.277 ± 0.638
0.739IleTrp: 0.739 ± 0.274
0.95IleTyr: 0.95 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
4.327LysAla: 4.327 ± 0.945
2.005LysCys: 2.005 ± 0.763
3.061LysAsp: 3.061 ± 1.056
2.111LysGlu: 2.111 ± 1.604
3.272LysPhe: 3.272 ± 1.14
3.166LysGly: 3.166 ± 1.048
1.689LysHis: 1.689 ± 0.914
1.794LysIle: 1.794 ± 0.295
1.583LysLys: 1.583 ± 1.62
4.749LysLeu: 4.749 ± 1.016
0.95LysMet: 0.95 ± 0.304
2.427LysAsn: 2.427 ± 0.587
3.377LysPro: 3.377 ± 0.646
2.005LysGln: 2.005 ± 0.443
2.427LysArg: 2.427 ± 0.888
3.377LysSer: 3.377 ± 1.215
3.483LysThr: 3.483 ± 1.135
4.116LysVal: 4.116 ± 0.771
0.317LysTrp: 0.317 ± 0.138
2.216LysTyr: 2.216 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
5.805LeuAla: 5.805 ± 1.271
3.483LeuCys: 3.483 ± 1.044
4.538LeuAsp: 4.538 ± 0.505
4.011LeuGlu: 4.011 ± 1.02
4.222LeuPhe: 4.222 ± 1.191
4.96LeuGly: 4.96 ± 0.474
1.794LeuHis: 1.794 ± 0.524
2.85LeuIle: 2.85 ± 2.035
4.538LeuLys: 4.538 ± 1.646
8.443LeuLeu: 8.443 ± 2.886
1.372LeuMet: 1.372 ± 0.596
4.749LeuAsn: 4.749 ± 0.808
4.327LeuPro: 4.327 ± 2.504
3.799LeuGln: 3.799 ± 1.187
3.272LeuArg: 3.272 ± 0.766
6.755LeuSer: 6.755 ± 1.738
5.594LeuThr: 5.594 ± 0.651
6.121LeuVal: 6.121 ± 1.608
1.055LeuTrp: 1.055 ± 1.079
4.749LeuTyr: 4.749 ± 0.748
0.0LeuXaa: 0.0 ± 0.0
Met
1.372MetAla: 1.372 ± 0.637
1.055MetCys: 1.055 ± 0.572
1.266MetAsp: 1.266 ± 0.517
0.528MetGlu: 0.528 ± 0.286
1.372MetPhe: 1.372 ± 0.293
1.478MetGly: 1.478 ± 0.34
0.211MetHis: 0.211 ± 0.114
0.844MetIle: 0.844 ± 0.303
0.317MetLys: 0.317 ± 0.344
2.639MetLeu: 2.639 ± 1.001
0.422MetMet: 0.422 ± 0.229
0.844MetAsn: 0.844 ± 0.403
0.739MetPro: 0.739 ± 0.302
0.739MetGln: 0.739 ± 0.398
1.266MetArg: 1.266 ± 0.416
1.266MetSer: 1.266 ± 0.728
1.055MetThr: 1.055 ± 0.338
1.372MetVal: 1.372 ± 0.392
0.106MetTrp: 0.106 ± 0.057
0.95MetTyr: 0.95 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
3.799AsnAla: 3.799 ± 2.183
2.111AsnCys: 2.111 ± 0.713
2.005AsnAsp: 2.005 ± 0.568
2.005AsnGlu: 2.005 ± 0.355
2.111AsnPhe: 2.111 ± 0.685
5.594AsnGly: 5.594 ± 1.321
0.633AsnHis: 0.633 ± 0.343
3.694AsnIle: 3.694 ± 0.718
2.427AsnLys: 2.427 ± 0.614
3.694AsnLeu: 3.694 ± 0.324
1.266AsnMet: 1.266 ± 0.342
2.639AsnAsn: 2.639 ± 0.981
1.9AsnPro: 1.9 ± 0.436
1.266AsnGln: 1.266 ± 0.784
1.794AsnArg: 1.794 ± 0.499
4.96AsnSer: 4.96 ± 1.599
2.955AsnThr: 2.955 ± 0.64
6.755AsnVal: 6.755 ± 1.709
0.95AsnTrp: 0.95 ± 0.679
2.427AsnTyr: 2.427 ± 0.505
0.0AsnXaa: 0.0 ± 0.0
Pro
2.639ProAla: 2.639 ± 0.319
0.844ProCys: 0.844 ± 0.303
2.216ProAsp: 2.216 ± 0.499
1.794ProGlu: 1.794 ± 0.512
1.9ProPhe: 1.9 ± 0.36
2.85ProGly: 2.85 ± 0.508
0.844ProHis: 0.844 ± 0.445
1.372ProIle: 1.372 ± 0.306
2.111ProLys: 2.111 ± 1.368
4.011ProLeu: 4.011 ± 0.648
0.528ProMet: 0.528 ± 0.169
2.111ProAsn: 2.111 ± 0.725
2.322ProPro: 2.322 ± 0.237
1.266ProGln: 1.266 ± 1.584
1.266ProArg: 1.266 ± 0.497
2.744ProSer: 2.744 ± 1.094
2.744ProThr: 2.744 ± 1.457
4.855ProVal: 4.855 ± 1.4
0.317ProTrp: 0.317 ± 0.138
0.844ProTyr: 0.844 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.216GlnAla: 2.216 ± 0.507
0.528GlnCys: 0.528 ± 0.305
2.005GlnAsp: 2.005 ± 0.351
0.95GlnGlu: 0.95 ± 0.382
1.266GlnPhe: 1.266 ± 0.577
1.794GlnGly: 1.794 ± 0.561
0.422GlnHis: 0.422 ± 0.302
1.055GlnIle: 1.055 ± 0.357
1.478GlnLys: 1.478 ± 0.651
3.483GlnLeu: 3.483 ± 1.359
0.739GlnMet: 0.739 ± 0.27
1.055GlnAsn: 1.055 ± 0.198
1.372GlnPro: 1.372 ± 1.05
1.478GlnGln: 1.478 ± 1.131
1.9GlnArg: 1.9 ± 0.777
2.427GlnSer: 2.427 ± 0.587
1.583GlnThr: 1.583 ± 0.364
2.216GlnVal: 2.216 ± 1.208
0.211GlnTrp: 0.211 ± 0.114
1.372GlnTyr: 1.372 ± 0.701
0.0GlnXaa: 0.0 ± 0.0
Arg
2.744ArgAla: 2.744 ± 0.384
1.689ArgCys: 1.689 ± 0.369
1.478ArgAsp: 1.478 ± 0.774
0.95ArgGlu: 0.95 ± 0.706
2.639ArgPhe: 2.639 ± 0.581
2.005ArgGly: 2.005 ± 0.827
1.161ArgHis: 1.161 ± 1.249
1.9ArgIle: 1.9 ± 0.624
2.111ArgLys: 2.111 ± 0.959
3.694ArgLeu: 3.694 ± 1.685
0.844ArgMet: 0.844 ± 0.179
2.322ArgAsn: 2.322 ± 0.377
1.055ArgPro: 1.055 ± 0.493
1.161ArgGln: 1.161 ± 0.201
1.266ArgArg: 1.266 ± 0.644
2.427ArgSer: 2.427 ± 1.546
2.533ArgThr: 2.533 ± 1.39
3.377ArgVal: 3.377 ± 0.401
0.739ArgTrp: 0.739 ± 0.498
2.005ArgTyr: 2.005 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
4.749SerAla: 4.749 ± 0.988
1.794SerCys: 1.794 ± 0.577
4.433SerAsp: 4.433 ± 0.918
2.955SerGlu: 2.955 ± 0.536
4.538SerPhe: 4.538 ± 1.573
4.749SerGly: 4.749 ± 0.637
1.372SerHis: 1.372 ± 0.32
3.272SerIle: 3.272 ± 1.185
4.433SerLys: 4.433 ± 1.013
4.855SerLeu: 4.855 ± 1.639
1.583SerMet: 1.583 ± 0.556
3.483SerAsn: 3.483 ± 1.2
1.689SerPro: 1.689 ± 0.796
2.533SerGln: 2.533 ± 1.972
2.639SerArg: 2.639 ± 2.015
5.172SerSer: 5.172 ± 1.314
5.699SerThr: 5.699 ± 1.072
6.649SerVal: 6.649 ± 1.486
0.633SerTrp: 0.633 ± 0.224
2.85SerTyr: 2.85 ± 0.879
0.0SerXaa: 0.0 ± 0.0
Thr
4.116ThrAla: 4.116 ± 0.634
1.583ThrCys: 1.583 ± 0.684
3.483ThrAsp: 3.483 ± 0.549
2.111ThrGlu: 2.111 ± 0.558
4.011ThrPhe: 4.011 ± 0.866
3.588ThrGly: 3.588 ± 1.627
0.528ThrHis: 0.528 ± 0.449
3.588ThrIle: 3.588 ± 0.448
2.639ThrLys: 2.639 ± 0.547
5.699ThrLeu: 5.699 ± 1.248
2.005ThrMet: 2.005 ± 0.909
3.483ThrAsn: 3.483 ± 0.838
2.955ThrPro: 2.955 ± 0.75
1.689ThrGln: 1.689 ± 0.626
2.639ThrArg: 2.639 ± 0.547
4.116ThrSer: 4.116 ± 0.47
4.538ThrThr: 4.538 ± 0.986
5.383ThrVal: 5.383 ± 1.099
0.422ThrTrp: 0.422 ± 0.143
2.533ThrTyr: 2.533 ± 0.463
0.0ThrXaa: 0.0 ± 0.0
Val
8.549ValAla: 8.549 ± 1.09
3.905ValCys: 3.905 ± 1.122
6.121ValAsp: 6.121 ± 1.24
4.538ValGlu: 4.538 ± 0.707
4.749ValPhe: 4.749 ± 0.63
6.332ValGly: 6.332 ± 1.011
1.689ValHis: 1.689 ± 0.542
4.644ValIle: 4.644 ± 0.839
6.438ValLys: 6.438 ± 2.824
7.704ValLeu: 7.704 ± 1.015
1.794ValMet: 1.794 ± 0.537
6.121ValAsn: 6.121 ± 0.981
4.116ValPro: 4.116 ± 0.512
2.744ValGln: 2.744 ± 0.529
3.694ValArg: 3.694 ± 1.649
7.071ValSer: 7.071 ± 1.291
6.332ValThr: 6.332 ± 1.111
12.243ValVal: 12.243 ± 1.519
1.055ValTrp: 1.055 ± 0.309
3.272ValTyr: 3.272 ± 0.9
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.746
0.317TrpCys: 0.317 ± 0.138
0.95TrpAsp: 0.95 ± 0.355
0.211TrpGlu: 0.211 ± 0.114
0.739TrpPhe: 0.739 ± 0.274
0.211TrpGly: 0.211 ± 0.155
0.317TrpHis: 0.317 ± 0.242
0.317TrpIle: 0.317 ± 0.244
0.317TrpLys: 0.317 ± 0.242
1.794TrpLeu: 1.794 ± 0.772
0.211TrpMet: 0.211 ± 0.114
0.739TrpAsn: 0.739 ± 0.629
0.633TrpPro: 0.633 ± 0.37
0.106TrpGln: 0.106 ± 0.057
0.739TrpArg: 0.739 ± 0.17
0.844TrpSer: 0.844 ± 0.423
0.844TrpThr: 0.844 ± 0.632
0.95TrpVal: 0.95 ± 0.204
0.106TrpTrp: 0.106 ± 0.057
0.844TrpTyr: 0.844 ± 0.263
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.166TyrAla: 3.166 ± 0.469
1.689TyrCys: 1.689 ± 0.525
2.639TyrAsp: 2.639 ± 1.09
2.005TyrGlu: 2.005 ± 0.672
2.005TyrPhe: 2.005 ± 0.445
3.483TyrGly: 3.483 ± 0.736
0.95TyrHis: 0.95 ± 0.582
1.372TyrIle: 1.372 ± 0.276
2.005TyrLys: 2.005 ± 0.427
3.799TyrLeu: 3.799 ± 0.804
1.055TyrMet: 1.055 ± 0.697
3.272TyrAsn: 3.272 ± 1.279
0.95TyrPro: 0.95 ± 0.514
0.95TyrGln: 0.95 ± 0.522
1.689TyrArg: 1.689 ± 1.0
2.111TyrSer: 2.111 ± 0.824
2.216TyrThr: 2.216 ± 0.559
5.277TyrVal: 5.277 ± 1.065
0.633TyrTrp: 0.633 ± 0.27
2.744TyrTyr: 2.744 ± 0.519
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (9476 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski