Amino acid dipepetide frequency for Sripur virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.288AlaAla: 2.288 ± 0.977
1.271AlaCys: 1.271 ± 0.414
2.034AlaAsp: 2.034 ± 0.538
3.05AlaGlu: 3.05 ± 1.084
2.034AlaPhe: 2.034 ± 1.094
1.271AlaGly: 1.271 ± 0.729
1.017AlaHis: 1.017 ± 0.66
2.288AlaIle: 2.288 ± 0.6
2.542AlaLys: 2.542 ± 0.945
3.813AlaLeu: 3.813 ± 0.859
2.034AlaMet: 2.034 ± 1.087
1.779AlaAsn: 1.779 ± 1.169
0.508AlaPro: 0.508 ± 0.527
0.763AlaGln: 0.763 ± 0.451
2.034AlaArg: 2.034 ± 0.607
1.271AlaSer: 1.271 ± 0.747
2.288AlaThr: 2.288 ± 1.284
1.525AlaVal: 1.525 ± 0.712
0.763AlaTrp: 0.763 ± 0.362
1.017AlaTyr: 1.017 ± 0.708
0.0AlaXaa: 0.0 ± 0.0
Cys
0.763CysAla: 0.763 ± 0.451
0.763CysCys: 0.763 ± 0.44
1.017CysAsp: 1.017 ± 0.746
1.017CysGlu: 1.017 ± 0.435
1.271CysPhe: 1.271 ± 0.549
2.034CysGly: 2.034 ± 1.063
0.254CysHis: 0.254 ± 0.321
1.017CysIle: 1.017 ± 0.783
2.796CysLys: 2.796 ± 0.587
3.05CysLeu: 3.05 ± 0.895
0.254CysMet: 0.254 ± 0.15
0.763CysAsn: 0.763 ± 0.451
1.271CysPro: 1.271 ± 0.974
2.034CysGln: 2.034 ± 0.996
2.034CysArg: 2.034 ± 1.285
2.542CysSer: 2.542 ± 0.902
0.763CysThr: 0.763 ± 0.688
1.017CysVal: 1.017 ± 0.391
0.508CysTrp: 0.508 ± 0.3
1.017CysTyr: 1.017 ± 0.601
0.0CysXaa: 0.0 ± 0.0
Asp
1.271AspAla: 1.271 ± 1.018
1.017AspCys: 1.017 ± 0.413
4.321AspAsp: 4.321 ± 1.238
3.305AspGlu: 3.305 ± 2.741
2.542AspPhe: 2.542 ± 1.135
2.542AspGly: 2.542 ± 0.856
1.525AspHis: 1.525 ± 0.633
4.067AspIle: 4.067 ± 1.47
5.846AspLys: 5.846 ± 0.836
6.863AspLeu: 6.863 ± 1.406
2.288AspMet: 2.288 ± 0.4
3.05AspAsn: 3.05 ± 1.097
3.813AspPro: 3.813 ± 0.947
1.271AspGln: 1.271 ± 0.752
1.525AspArg: 1.525 ± 0.641
4.321AspSer: 4.321 ± 1.228
2.288AspThr: 2.288 ± 0.531
2.542AspVal: 2.542 ± 1.046
1.525AspTrp: 1.525 ± 0.536
2.034AspTyr: 2.034 ± 0.922
0.0AspXaa: 0.0 ± 0.0
Glu
2.542GluAla: 2.542 ± 1.521
2.288GluCys: 2.288 ± 1.141
4.83GluAsp: 4.83 ± 2.56
5.846GluGlu: 5.846 ± 3.544
2.034GluPhe: 2.034 ± 1.202
2.542GluGly: 2.542 ± 1.439
1.525GluHis: 1.525 ± 0.455
5.084GluIle: 5.084 ± 1.741
7.117GluLys: 7.117 ± 1.278
5.592GluLeu: 5.592 ± 1.377
1.779GluMet: 1.779 ± 0.881
3.05GluAsn: 3.05 ± 0.594
1.017GluPro: 1.017 ± 0.601
1.271GluGln: 1.271 ± 0.756
3.05GluArg: 3.05 ± 1.508
5.338GluSer: 5.338 ± 3.238
2.542GluThr: 2.542 ± 0.624
3.305GluVal: 3.305 ± 1.081
1.271GluTrp: 1.271 ± 0.805
2.034GluTyr: 2.034 ± 0.584
0.0GluXaa: 0.0 ± 0.0
Phe
2.288PheAla: 2.288 ± 1.012
1.779PheCys: 1.779 ± 0.816
1.271PheAsp: 1.271 ± 0.582
2.796PheGlu: 2.796 ± 1.115
2.034PhePhe: 2.034 ± 0.64
1.779PheGly: 1.779 ± 0.883
0.508PheHis: 0.508 ± 0.642
3.05PheIle: 3.05 ± 0.831
3.559PheLys: 3.559 ± 0.772
4.575PheLeu: 4.575 ± 1.254
0.508PheMet: 0.508 ± 0.642
2.796PheAsn: 2.796 ± 0.717
4.067PhePro: 4.067 ± 0.713
2.288PheGln: 2.288 ± 0.648
2.542PheArg: 2.542 ± 0.639
1.779PheSer: 1.779 ± 0.604
2.542PheThr: 2.542 ± 0.681
2.288PheVal: 2.288 ± 0.785
1.271PheTrp: 1.271 ± 0.996
3.05PheTyr: 3.05 ± 1.181
0.0PheXaa: 0.0 ± 0.0
Gly
1.017GlyAla: 1.017 ± 0.44
1.525GlyCys: 1.525 ± 0.55
3.05GlyAsp: 3.05 ± 0.476
3.305GlyGlu: 3.305 ± 0.762
2.796GlyPhe: 2.796 ± 0.725
3.559GlyGly: 3.559 ± 0.819
1.017GlyHis: 1.017 ± 0.611
5.084GlyIle: 5.084 ± 1.623
1.779GlyLys: 1.779 ± 1.062
5.338GlyLeu: 5.338 ± 0.964
1.017GlyMet: 1.017 ± 0.384
2.288GlyAsn: 2.288 ± 1.09
0.763GlyPro: 0.763 ± 0.451
2.288GlyGln: 2.288 ± 0.735
1.017GlyArg: 1.017 ± 0.398
4.575GlySer: 4.575 ± 1.706
2.288GlyThr: 2.288 ± 0.819
4.321GlyVal: 4.321 ± 2.143
1.017GlyTrp: 1.017 ± 0.601
2.288GlyTyr: 2.288 ± 0.743
0.0GlyXaa: 0.0 ± 0.0
His
0.254HisAla: 0.254 ± 0.321
0.763HisCys: 0.763 ± 0.531
0.763HisAsp: 0.763 ± 0.301
1.525HisGlu: 1.525 ± 0.603
1.271HisPhe: 1.271 ± 0.636
0.508HisGly: 0.508 ± 0.79
0.763HisHis: 0.763 ± 0.301
1.017HisIle: 1.017 ± 0.546
1.525HisLys: 1.525 ± 0.43
2.034HisLeu: 2.034 ± 0.782
0.508HisMet: 0.508 ± 0.348
1.271HisAsn: 1.271 ± 0.407
1.271HisPro: 1.271 ± 0.509
0.763HisGln: 0.763 ± 0.688
1.271HisArg: 1.271 ± 0.447
1.017HisSer: 1.017 ± 0.687
0.508HisThr: 0.508 ± 0.3
1.017HisVal: 1.017 ± 1.054
0.254HisTrp: 0.254 ± 0.321
1.017HisTyr: 1.017 ± 0.583
0.0HisXaa: 0.0 ± 0.0
Ile
1.017IleAla: 1.017 ± 0.391
2.288IleCys: 2.288 ± 1.003
4.575IleAsp: 4.575 ± 1.222
4.067IleGlu: 4.067 ± 1.463
4.321IlePhe: 4.321 ± 1.071
4.067IleGly: 4.067 ± 0.876
1.525IleHis: 1.525 ± 0.938
6.609IleIle: 6.609 ± 2.29
7.372IleLys: 7.372 ± 1.038
7.88IleLeu: 7.88 ± 2.1
2.542IleMet: 2.542 ± 1.262
4.321IleAsn: 4.321 ± 2.043
3.05IlePro: 3.05 ± 0.931
4.575IleGln: 4.575 ± 0.701
4.83IleArg: 4.83 ± 1.629
4.575IleSer: 4.575 ± 1.092
6.355IleThr: 6.355 ± 1.411
3.813IleVal: 3.813 ± 0.803
1.271IleTrp: 1.271 ± 0.515
3.05IleTyr: 3.05 ± 0.974
0.0IleXaa: 0.0 ± 0.0
Lys
1.779LysAla: 1.779 ± 1.392
2.034LysCys: 2.034 ± 0.839
4.83LysAsp: 4.83 ± 0.876
7.372LysGlu: 7.372 ± 1.333
3.559LysPhe: 3.559 ± 0.746
3.559LysGly: 3.559 ± 0.658
2.542LysHis: 2.542 ± 1.509
8.643LysIle: 8.643 ± 1.736
5.846LysLys: 5.846 ± 2.848
8.388LysLeu: 8.388 ± 1.266
2.796LysMet: 2.796 ± 1.863
4.067LysAsn: 4.067 ± 0.687
2.288LysPro: 2.288 ± 0.854
1.017LysGln: 1.017 ± 0.486
5.084LysArg: 5.084 ± 1.341
4.575LysSer: 4.575 ± 1.07
4.067LysThr: 4.067 ± 0.701
3.05LysVal: 3.05 ± 1.038
1.271LysTrp: 1.271 ± 0.703
2.542LysTyr: 2.542 ± 1.018
0.0LysXaa: 0.0 ± 0.0
Leu
3.05LeuAla: 3.05 ± 0.957
2.034LeuCys: 2.034 ± 0.864
4.575LeuAsp: 4.575 ± 1.099
6.101LeuGlu: 6.101 ± 1.046
4.83LeuPhe: 4.83 ± 1.187
4.83LeuGly: 4.83 ± 1.917
1.525LeuHis: 1.525 ± 0.706
7.88LeuIle: 7.88 ± 1.755
8.897LeuLys: 8.897 ± 2.253
8.134LeuLeu: 8.134 ± 1.753
3.05LeuMet: 3.05 ± 1.068
6.355LeuAsn: 6.355 ± 1.176
3.559LeuPro: 3.559 ± 1.214
2.796LeuGln: 2.796 ± 1.085
4.83LeuArg: 4.83 ± 1.005
10.422LeuSer: 10.422 ± 2.092
5.592LeuThr: 5.592 ± 0.587
5.084LeuVal: 5.084 ± 1.187
0.508LeuTrp: 0.508 ± 0.41
4.321LeuTyr: 4.321 ± 1.251
0.0LeuXaa: 0.0 ± 0.0
Met
0.763MetAla: 0.763 ± 0.362
0.763MetCys: 0.763 ± 0.473
1.525MetAsp: 1.525 ± 0.862
2.034MetGlu: 2.034 ± 0.805
0.508MetPhe: 0.508 ± 0.306
1.525MetGly: 1.525 ± 0.545
0.254MetHis: 0.254 ± 0.434
3.05MetIle: 3.05 ± 1.267
3.05MetLys: 3.05 ± 2.08
3.305MetLeu: 3.305 ± 1.015
1.525MetMet: 1.525 ± 0.721
1.525MetAsn: 1.525 ± 1.237
0.508MetPro: 0.508 ± 0.642
1.017MetGln: 1.017 ± 0.475
1.779MetArg: 1.779 ± 1.188
2.542MetSer: 2.542 ± 0.597
1.271MetThr: 1.271 ± 0.411
1.271MetVal: 1.271 ± 0.951
0.254MetTrp: 0.254 ± 0.15
1.271MetTyr: 1.271 ± 0.823
0.0MetXaa: 0.0 ± 0.0
Asn
1.525AsnAla: 1.525 ± 0.621
1.271AsnCys: 1.271 ± 0.552
2.796AsnAsp: 2.796 ± 1.402
2.542AsnGlu: 2.542 ± 1.102
0.763AsnPhe: 0.763 ± 0.545
2.288AsnGly: 2.288 ± 1.012
1.271AsnHis: 1.271 ± 0.555
5.846AsnIle: 5.846 ± 0.8
3.305AsnLys: 3.305 ± 1.014
6.101AsnLeu: 6.101 ± 0.815
1.271AsnMet: 1.271 ± 0.371
4.575AsnAsn: 4.575 ± 1.436
3.305AsnPro: 3.305 ± 0.659
2.542AsnGln: 2.542 ± 0.966
2.542AsnArg: 2.542 ± 1.175
3.813AsnSer: 3.813 ± 0.971
3.305AsnThr: 3.305 ± 0.989
1.779AsnVal: 1.779 ± 0.855
1.017AsnTrp: 1.017 ± 0.448
2.288AsnTyr: 2.288 ± 0.626
0.0AsnXaa: 0.0 ± 0.0
Pro
2.542ProAla: 2.542 ± 1.128
0.0ProCys: 0.0 ± 0.0
2.542ProAsp: 2.542 ± 0.711
1.779ProGlu: 1.779 ± 0.608
1.525ProPhe: 1.525 ± 0.495
2.034ProGly: 2.034 ± 1.251
0.254ProHis: 0.254 ± 0.15
2.288ProIle: 2.288 ± 0.676
3.05ProLys: 3.05 ± 0.945
3.559ProLeu: 3.559 ± 0.921
0.508ProMet: 0.508 ± 0.735
0.763ProAsn: 0.763 ± 0.301
1.017ProPro: 1.017 ± 0.448
1.779ProGln: 1.779 ± 0.78
1.017ProArg: 1.017 ± 0.448
4.83ProSer: 4.83 ± 0.517
3.305ProThr: 3.305 ± 0.847
2.034ProVal: 2.034 ± 0.799
0.508ProTrp: 0.508 ± 0.3
1.271ProTyr: 1.271 ± 0.549
0.0ProXaa: 0.0 ± 0.0
Gln
1.525GlnAla: 1.525 ± 0.86
0.508GlnCys: 0.508 ± 0.273
2.034GlnAsp: 2.034 ± 1.162
2.288GlnGlu: 2.288 ± 1.74
2.288GlnPhe: 2.288 ± 1.148
1.017GlnGly: 1.017 ± 0.601
0.763GlnHis: 0.763 ± 0.451
2.034GlnIle: 2.034 ± 0.729
1.525GlnLys: 1.525 ± 0.662
4.321GlnLeu: 4.321 ± 1.395
1.525GlnMet: 1.525 ± 1.379
3.05GlnAsn: 3.05 ± 0.704
1.017GlnPro: 1.017 ± 0.462
2.034GlnGln: 2.034 ± 0.619
2.288GlnArg: 2.288 ± 0.704
3.813GlnSer: 3.813 ± 1.058
1.017GlnThr: 1.017 ± 0.919
1.525GlnVal: 1.525 ± 0.493
0.763GlnTrp: 0.763 ± 0.33
0.763GlnTyr: 0.763 ± 0.451
0.0GlnXaa: 0.0 ± 0.0
Arg
2.542ArgAla: 2.542 ± 0.624
2.542ArgCys: 2.542 ± 1.002
2.288ArgAsp: 2.288 ± 0.839
2.288ArgGlu: 2.288 ± 1.853
1.525ArgPhe: 1.525 ± 0.723
3.305ArgGly: 3.305 ± 0.751
0.508ArgHis: 0.508 ± 0.642
4.067ArgIle: 4.067 ± 1.195
4.575ArgLys: 4.575 ± 2.363
3.305ArgLeu: 3.305 ± 1.062
1.779ArgMet: 1.779 ± 0.699
3.05ArgAsn: 3.05 ± 0.534
1.271ArgPro: 1.271 ± 0.725
0.763ArgGln: 0.763 ± 0.362
1.271ArgArg: 1.271 ± 0.949
3.813ArgSer: 3.813 ± 0.696
2.542ArgThr: 2.542 ± 1.054
3.559ArgVal: 3.559 ± 0.92
0.763ArgTrp: 0.763 ± 0.546
1.017ArgTyr: 1.017 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
4.83SerAla: 4.83 ± 1.105
2.034SerCys: 2.034 ± 1.009
5.846SerAsp: 5.846 ± 1.19
5.592SerGlu: 5.592 ± 1.718
5.084SerPhe: 5.084 ± 1.617
3.559SerGly: 3.559 ± 1.199
2.034SerHis: 2.034 ± 0.801
6.609SerIle: 6.609 ± 2.259
5.592SerLys: 5.592 ± 1.887
6.101SerLeu: 6.101 ± 0.984
2.288SerMet: 2.288 ± 0.755
4.321SerAsn: 4.321 ± 0.885
1.525SerPro: 1.525 ± 0.603
2.542SerGln: 2.542 ± 0.683
2.796SerArg: 2.796 ± 1.653
6.609SerSer: 6.609 ± 1.149
3.559SerThr: 3.559 ± 1.003
4.83SerVal: 4.83 ± 1.367
1.017SerTrp: 1.017 ± 0.524
2.542SerTyr: 2.542 ± 0.567
0.0SerXaa: 0.0 ± 0.0
Thr
2.288ThrAla: 2.288 ± 0.768
0.508ThrCys: 0.508 ± 0.551
2.034ThrAsp: 2.034 ± 1.153
4.321ThrGlu: 4.321 ± 1.25
3.05ThrPhe: 3.05 ± 0.872
4.321ThrGly: 4.321 ± 1.491
1.271ThrHis: 1.271 ± 0.532
3.813ThrIle: 3.813 ± 1.173
3.05ThrLys: 3.05 ± 0.824
4.575ThrLeu: 4.575 ± 0.809
1.271ThrMet: 1.271 ± 0.532
1.525ThrAsn: 1.525 ± 0.539
2.034ThrPro: 2.034 ± 0.624
1.779ThrGln: 1.779 ± 0.462
1.271ThrArg: 1.271 ± 0.437
5.592ThrSer: 5.592 ± 1.227
2.796ThrThr: 2.796 ± 0.774
3.05ThrVal: 3.05 ± 0.625
1.017ThrTrp: 1.017 ± 0.308
1.525ThrTyr: 1.525 ± 0.882
0.0ThrXaa: 0.0 ± 0.0
Val
2.542ValAla: 2.542 ± 2.09
1.017ValCys: 1.017 ± 0.514
3.559ValAsp: 3.559 ± 0.767
2.796ValGlu: 2.796 ± 0.807
2.288ValPhe: 2.288 ± 1.117
1.271ValGly: 1.271 ± 0.549
0.763ValHis: 0.763 ± 0.576
5.846ValIle: 5.846 ± 1.159
3.559ValLys: 3.559 ± 1.374
5.084ValLeu: 5.084 ± 1.568
1.525ValMet: 1.525 ± 0.674
2.034ValAsn: 2.034 ± 0.579
1.779ValPro: 1.779 ± 0.681
2.288ValGln: 2.288 ± 2.203
3.05ValArg: 3.05 ± 0.94
3.05ValSer: 3.05 ± 1.032
3.05ValThr: 3.05 ± 0.776
3.813ValVal: 3.813 ± 1.62
0.763ValTrp: 0.763 ± 0.362
2.034ValTyr: 2.034 ± 0.781
0.0ValXaa: 0.0 ± 0.0
Trp
0.254TrpAla: 0.254 ± 0.499
0.0TrpCys: 0.0 ± 0.0
0.763TrpAsp: 0.763 ± 0.451
0.763TrpGlu: 0.763 ± 0.451
0.508TrpPhe: 0.508 ± 0.273
0.763TrpGly: 0.763 ± 0.301
0.0TrpHis: 0.0 ± 0.0
1.525TrpIle: 1.525 ± 0.541
1.525TrpLys: 1.525 ± 0.934
2.542TrpLeu: 2.542 ± 1.13
0.508TrpMet: 0.508 ± 0.597
1.017TrpAsn: 1.017 ± 0.601
0.254TrpPro: 0.254 ± 0.15
0.508TrpGln: 0.508 ± 0.273
0.763TrpArg: 0.763 ± 0.637
1.525TrpSer: 1.525 ± 0.691
0.763TrpThr: 0.763 ± 0.362
1.017TrpVal: 1.017 ± 0.801
0.0TrpTrp: 0.0 ± 0.0
1.271TrpTyr: 1.271 ± 0.555
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.508TyrAla: 0.508 ± 0.306
1.779TyrCys: 1.779 ± 0.68
3.305TyrAsp: 3.305 ± 0.729
1.271TyrGlu: 1.271 ± 0.541
2.542TyrPhe: 2.542 ± 0.845
3.559TyrGly: 3.559 ± 0.821
0.0TyrHis: 0.0 ± 0.0
2.288TyrIle: 2.288 ± 0.92
2.796TyrLys: 2.796 ± 0.854
3.813TyrLeu: 3.813 ± 1.523
0.508TyrMet: 0.508 ± 0.306
2.542TyrAsn: 2.542 ± 0.685
2.542TyrPro: 2.542 ± 0.479
1.779TyrGln: 1.779 ± 0.458
1.779TyrArg: 1.779 ± 0.662
3.305TyrSer: 3.305 ± 1.067
0.254TyrThr: 0.254 ± 0.15
1.271TyrVal: 1.271 ± 0.356
0.508TyrTrp: 0.508 ± 0.674
1.525TyrTyr: 1.525 ± 0.541
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (3935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski