Amino acid dipepetide frequency for Beihai anemone virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.163AlaAla: 5.163 ± 1.776
0.543AlaCys: 0.543 ± 0.354
2.174AlaAsp: 2.174 ± 0.781
2.446AlaGlu: 2.446 ± 0.913
3.261AlaPhe: 3.261 ± 1.843
2.446AlaGly: 2.446 ± 1.528
1.087AlaHis: 1.087 ± 0.376
2.717AlaIle: 2.717 ± 0.745
2.174AlaLys: 2.174 ± 0.729
5.707AlaLeu: 5.707 ± 1.834
1.359AlaMet: 1.359 ± 0.515
3.261AlaAsn: 3.261 ± 1.639
2.717AlaPro: 2.717 ± 1.029
0.815AlaGln: 0.815 ± 0.63
4.62AlaArg: 4.62 ± 0.725
2.174AlaSer: 2.174 ± 0.616
3.533AlaThr: 3.533 ± 0.547
5.707AlaVal: 5.707 ± 0.964
0.272AlaTrp: 0.272 ± 0.137
2.717AlaTyr: 2.717 ± 1.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.63CysAla: 1.63 ± 0.492
0.815CysCys: 0.815 ± 1.458
1.359CysAsp: 1.359 ± 0.683
1.63CysGlu: 1.63 ± 0.82
1.087CysPhe: 1.087 ± 2.229
2.446CysGly: 2.446 ± 0.54
0.543CysHis: 0.543 ± 0.354
1.359CysIle: 1.359 ± 0.679
0.815CysLys: 0.815 ± 0.63
1.359CysLeu: 1.359 ± 0.59
0.272CysMet: 0.272 ± 0.137
0.543CysAsn: 0.543 ± 0.273
1.087CysPro: 1.087 ± 0.546
0.543CysGln: 0.543 ± 0.894
0.815CysArg: 0.815 ± 0.63
1.359CysSer: 1.359 ± 0.544
0.815CysThr: 0.815 ± 1.076
2.989CysVal: 2.989 ± 0.899
0.272CysTrp: 0.272 ± 0.137
1.087CysTyr: 1.087 ± 1.515
0.0CysXaa: 0.0 ± 0.0
Asp
3.804AspAla: 3.804 ± 1.221
1.902AspCys: 1.902 ± 1.029
2.717AspAsp: 2.717 ± 1.366
2.989AspGlu: 2.989 ± 1.063
4.076AspPhe: 4.076 ± 2.878
2.717AspGly: 2.717 ± 1.013
1.63AspHis: 1.63 ± 0.534
5.163AspIle: 5.163 ± 0.675
2.989AspLys: 2.989 ± 1.184
7.337AspLeu: 7.337 ± 1.486
0.815AspMet: 0.815 ± 0.41
3.261AspAsn: 3.261 ± 1.639
4.891AspPro: 4.891 ± 1.014
1.087AspGln: 1.087 ± 0.805
1.902AspArg: 1.902 ± 0.677
4.348AspSer: 4.348 ± 1.324
2.174AspThr: 2.174 ± 1.725
6.793AspVal: 6.793 ± 1.897
0.815AspTrp: 0.815 ± 0.47
3.804AspTyr: 3.804 ± 0.774
0.0AspXaa: 0.0 ± 0.0
Glu
2.174GluAla: 2.174 ± 0.781
0.272GluCys: 0.272 ± 0.417
2.174GluAsp: 2.174 ± 1.093
2.174GluGlu: 2.174 ± 1.43
3.804GluPhe: 3.804 ± 0.887
2.717GluGly: 2.717 ± 1.088
1.359GluHis: 1.359 ± 0.52
6.522GluIle: 6.522 ± 2.364
2.989GluLys: 2.989 ± 0.857
7.609GluLeu: 7.609 ± 1.211
0.272GluMet: 0.272 ± 0.137
3.533GluAsn: 3.533 ± 0.737
1.359GluPro: 1.359 ± 0.465
1.087GluGln: 1.087 ± 0.546
2.989GluArg: 2.989 ± 0.778
2.717GluSer: 2.717 ± 2.459
3.261GluThr: 3.261 ± 0.828
1.359GluVal: 1.359 ± 0.453
0.272GluTrp: 0.272 ± 0.137
1.902GluTyr: 1.902 ± 0.755
0.0GluXaa: 0.0 ± 0.0
Phe
3.261PheAla: 3.261 ± 0.845
1.359PheCys: 1.359 ± 0.683
4.076PheAsp: 4.076 ± 1.65
1.63PheGlu: 1.63 ± 0.616
1.359PhePhe: 1.359 ± 1.046
3.533PheGly: 3.533 ± 1.091
1.087PheHis: 1.087 ± 0.916
3.533PheIle: 3.533 ± 2.171
5.163PheLys: 5.163 ± 0.658
4.076PheLeu: 4.076 ± 2.159
0.815PheMet: 0.815 ± 0.41
2.174PheAsn: 2.174 ± 0.684
4.348PhePro: 4.348 ± 1.301
1.902PheGln: 1.902 ± 2.138
2.446PheArg: 2.446 ± 1.412
5.435PheSer: 5.435 ± 2.455
4.891PheThr: 4.891 ± 0.78
5.707PheVal: 5.707 ± 0.967
0.272PheTrp: 0.272 ± 0.417
1.63PheTyr: 1.63 ± 1.329
0.0PheXaa: 0.0 ± 0.0
Gly
2.174GlyAla: 2.174 ± 0.814
1.087GlyCys: 1.087 ± 0.546
3.533GlyAsp: 3.533 ± 0.921
3.261GlyGlu: 3.261 ± 1.035
2.174GlyPhe: 2.174 ± 0.744
3.533GlyGly: 3.533 ± 0.855
1.087GlyHis: 1.087 ± 0.979
2.717GlyIle: 2.717 ± 1.029
3.804GlyLys: 3.804 ± 0.872
1.902GlyLeu: 1.902 ± 0.746
1.087GlyMet: 1.087 ± 0.449
3.261GlyAsn: 3.261 ± 0.718
0.815GlyPro: 0.815 ± 0.339
0.543GlyGln: 0.543 ± 0.354
1.902GlyArg: 1.902 ± 1.051
2.446GlySer: 2.446 ± 0.833
2.717GlyThr: 2.717 ± 0.745
3.261GlyVal: 3.261 ± 0.845
0.272GlyTrp: 0.272 ± 0.137
1.63GlyTyr: 1.63 ± 0.534
0.0GlyXaa: 0.0 ± 0.0
His
1.359HisAla: 1.359 ± 0.683
1.087HisCys: 1.087 ± 1.515
1.359HisAsp: 1.359 ± 0.453
0.815HisGlu: 0.815 ± 0.585
3.533HisPhe: 3.533 ± 0.547
1.359HisGly: 1.359 ± 0.59
1.359HisHis: 1.359 ± 0.453
1.63HisIle: 1.63 ± 0.534
1.359HisLys: 1.359 ± 0.683
2.446HisLeu: 2.446 ± 1.349
0.272HisMet: 0.272 ± 0.137
1.359HisAsn: 1.359 ± 0.683
1.902HisPro: 1.902 ± 0.5
0.0HisGln: 0.0 ± 0.0
0.272HisArg: 0.272 ± 0.137
1.63HisSer: 1.63 ± 0.554
2.174HisThr: 2.174 ± 0.723
1.087HisVal: 1.087 ± 0.546
0.0HisTrp: 0.0 ± 0.0
0.272HisTyr: 0.272 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
4.348IleAla: 4.348 ± 1.562
2.446IleCys: 2.446 ± 1.118
6.522IleAsp: 6.522 ± 1.817
6.793IleGlu: 6.793 ± 1.414
2.446IlePhe: 2.446 ± 0.825
2.446IleGly: 2.446 ± 0.642
1.087IleHis: 1.087 ± 0.546
3.261IleIle: 3.261 ± 2.346
4.076IleLys: 4.076 ± 0.789
7.065IleLeu: 7.065 ± 2.919
0.543IleMet: 0.543 ± 0.273
2.989IleAsn: 2.989 ± 0.992
3.804IlePro: 3.804 ± 1.448
1.63IleGln: 1.63 ± 0.616
1.902IleArg: 1.902 ± 1.136
6.522IleSer: 6.522 ± 1.239
2.717IleThr: 2.717 ± 0.541
7.065IleVal: 7.065 ± 1.633
0.272IleTrp: 0.272 ± 0.137
4.891IleTyr: 4.891 ± 1.461
0.0IleXaa: 0.0 ± 0.0
Lys
1.902LysAla: 1.902 ± 0.956
1.359LysCys: 1.359 ± 0.453
4.348LysAsp: 4.348 ± 1.402
5.163LysGlu: 5.163 ± 1.185
6.25LysPhe: 6.25 ± 0.917
1.902LysGly: 1.902 ± 0.677
1.087LysHis: 1.087 ± 0.546
6.793LysIle: 6.793 ± 1.699
6.793LysLys: 6.793 ± 2.371
4.891LysLeu: 4.891 ± 1.004
1.087LysMet: 1.087 ± 0.377
2.989LysAsn: 2.989 ± 0.619
3.804LysPro: 3.804 ± 1.283
1.087LysGln: 1.087 ± 1.339
3.533LysArg: 3.533 ± 1.714
5.435LysSer: 5.435 ± 2.374
3.533LysThr: 3.533 ± 0.595
4.076LysVal: 4.076 ± 1.364
0.543LysTrp: 0.543 ± 0.273
3.261LysTyr: 3.261 ± 1.639
0.0LysXaa: 0.0 ± 0.0
Leu
4.891LeuAla: 4.891 ± 1.567
2.174LeuCys: 2.174 ± 0.495
7.065LeuAsp: 7.065 ± 1.916
4.348LeuGlu: 4.348 ± 1.234
4.348LeuPhe: 4.348 ± 2.279
2.446LeuGly: 2.446 ± 0.601
2.174LeuHis: 2.174 ± 0.744
7.337LeuIle: 7.337 ± 3.41
4.62LeuLys: 4.62 ± 1.523
10.598LeuLeu: 10.598 ± 3.783
1.902LeuMet: 1.902 ± 0.677
5.435LeuAsn: 5.435 ± 1.307
4.62LeuPro: 4.62 ± 0.929
3.261LeuGln: 3.261 ± 0.667
3.804LeuArg: 3.804 ± 1.011
7.609LeuSer: 7.609 ± 2.192
4.62LeuThr: 4.62 ± 1.819
5.435LeuVal: 5.435 ± 0.997
0.815LeuTrp: 0.815 ± 0.41
4.348LeuTyr: 4.348 ± 3.294
0.0LeuXaa: 0.0 ± 0.0
Met
0.543MetAla: 0.543 ± 0.505
0.543MetCys: 0.543 ± 0.273
0.543MetAsp: 0.543 ± 0.711
0.815MetGlu: 0.815 ± 0.41
1.63MetPhe: 1.63 ± 0.751
0.272MetGly: 0.272 ± 0.137
0.0MetHis: 0.0 ± 0.0
1.63MetIle: 1.63 ± 0.597
1.63MetLys: 1.63 ± 0.597
1.63MetLeu: 1.63 ± 0.82
0.543MetMet: 0.543 ± 0.505
0.543MetAsn: 0.543 ± 0.505
0.543MetPro: 0.543 ± 0.679
0.0MetGln: 0.0 ± 0.0
0.815MetArg: 0.815 ± 0.41
2.446MetSer: 2.446 ± 1.018
0.543MetThr: 0.543 ± 0.273
0.543MetVal: 0.543 ± 0.273
0.0MetTrp: 0.0 ± 0.0
1.359MetTyr: 1.359 ± 0.59
0.0MetXaa: 0.0 ± 0.0
Asn
4.076AsnAla: 4.076 ± 1.565
1.63AsnCys: 1.63 ± 1.26
1.087AsnAsp: 1.087 ± 0.376
1.087AsnGlu: 1.087 ± 0.447
4.076AsnPhe: 4.076 ± 1.123
2.717AsnGly: 2.717 ± 1.013
2.174AsnHis: 2.174 ± 0.752
3.804AsnIle: 3.804 ± 0.8
4.076AsnLys: 4.076 ± 1.398
4.076AsnLeu: 4.076 ± 1.172
0.543AsnMet: 0.543 ± 0.438
3.261AsnAsn: 3.261 ± 0.532
1.902AsnPro: 1.902 ± 0.677
2.174AsnGln: 2.174 ± 0.495
2.174AsnArg: 2.174 ± 1.093
4.62AsnSer: 4.62 ± 0.688
3.804AsnThr: 3.804 ± 1.11
4.891AsnVal: 4.891 ± 1.185
0.272AsnTrp: 0.272 ± 0.137
1.087AsnTyr: 1.087 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
2.174ProAla: 2.174 ± 0.947
1.63ProCys: 1.63 ± 0.616
4.076ProAsp: 4.076 ± 0.91
2.717ProGlu: 2.717 ± 0.78
2.717ProPhe: 2.717 ± 1.815
1.902ProGly: 1.902 ± 0.701
1.902ProHis: 1.902 ± 0.701
3.261ProIle: 3.261 ± 0.896
3.533ProLys: 3.533 ± 1.258
3.533ProLeu: 3.533 ± 1.624
0.543ProMet: 0.543 ± 0.273
2.989ProAsn: 2.989 ± 1.136
2.174ProPro: 2.174 ± 3.279
2.717ProGln: 2.717 ± 1.933
2.174ProArg: 2.174 ± 1.474
2.446ProSer: 2.446 ± 0.934
1.087ProThr: 1.087 ± 0.447
5.978ProVal: 5.978 ± 2.573
0.0ProTrp: 0.0 ± 0.0
1.087ProTyr: 1.087 ± 0.447
0.0ProXaa: 0.0 ± 0.0
Gln
0.543GlnAla: 0.543 ± 0.826
0.543GlnCys: 0.543 ± 0.354
3.261GlnAsp: 3.261 ± 0.845
0.543GlnGlu: 0.543 ± 0.273
0.815GlnPhe: 0.815 ± 1.163
1.902GlnGly: 1.902 ± 0.474
0.815GlnHis: 0.815 ± 0.718
3.261GlnIle: 3.261 ± 1.072
1.359GlnLys: 1.359 ± 2.149
1.359GlnLeu: 1.359 ± 0.453
0.543GlnMet: 0.543 ± 0.438
2.174GlnAsn: 2.174 ± 0.992
0.815GlnPro: 0.815 ± 0.47
1.087GlnGln: 1.087 ± 0.918
0.815GlnArg: 0.815 ± 1.07
2.446GlnSer: 2.446 ± 0.987
2.446GlnThr: 2.446 ± 0.54
0.543GlnVal: 0.543 ± 0.273
0.0GlnTrp: 0.0 ± 0.0
1.359GlnTyr: 1.359 ± 0.453
0.0GlnXaa: 0.0 ± 0.0
Arg
1.087ArgAla: 1.087 ± 0.474
0.815ArgCys: 0.815 ± 0.531
1.359ArgAsp: 1.359 ± 0.679
2.446ArgGlu: 2.446 ± 1.052
2.717ArgPhe: 2.717 ± 1.492
1.63ArgGly: 1.63 ± 0.677
1.359ArgHis: 1.359 ± 0.683
2.989ArgIle: 2.989 ± 0.64
5.163ArgLys: 5.163 ± 1.604
3.533ArgLeu: 3.533 ± 0.83
0.543ArgMet: 0.543 ± 0.993
2.174ArgAsn: 2.174 ± 0.662
2.717ArgPro: 2.717 ± 0.494
1.902ArgGln: 1.902 ± 0.474
2.174ArgArg: 2.174 ± 1.75
2.717ArgSer: 2.717 ± 1.721
1.63ArgThr: 1.63 ± 0.534
2.717ArgVal: 2.717 ± 0.794
1.087ArgTrp: 1.087 ± 0.474
2.446ArgTyr: 2.446 ± 0.821
0.0ArgXaa: 0.0 ± 0.0
Ser
4.891SerAla: 4.891 ± 0.78
1.63SerCys: 1.63 ± 1.329
8.424SerAsp: 8.424 ± 1.277
1.902SerGlu: 1.902 ± 0.749
5.163SerPhe: 5.163 ± 1.513
2.446SerGly: 2.446 ± 0.725
1.902SerHis: 1.902 ± 0.745
3.533SerIle: 3.533 ± 1.22
7.337SerLys: 7.337 ± 2.81
8.967SerLeu: 8.967 ± 2.194
0.815SerMet: 0.815 ± 0.514
4.076SerAsn: 4.076 ± 1.23
2.446SerPro: 2.446 ± 0.601
2.174SerGln: 2.174 ± 0.555
2.717SerArg: 2.717 ± 0.822
6.25SerSer: 6.25 ± 1.412
4.076SerThr: 4.076 ± 1.87
4.348SerVal: 4.348 ± 1.576
0.0SerTrp: 0.0 ± 0.0
3.533SerTyr: 3.533 ± 1.279
0.0SerXaa: 0.0 ± 0.0
Thr
1.087ThrAla: 1.087 ± 1.515
1.087ThrCys: 1.087 ± 0.595
1.63ThrAsp: 1.63 ± 0.821
1.902ThrGlu: 1.902 ± 0.658
2.174ThrPhe: 2.174 ± 0.684
4.076ThrGly: 4.076 ± 1.352
0.815ThrHis: 0.815 ± 0.785
4.076ThrIle: 4.076 ± 1.298
2.446ThrLys: 2.446 ± 0.987
5.163ThrLeu: 5.163 ± 3.959
1.902ThrMet: 1.902 ± 0.687
2.717ThrAsn: 2.717 ± 0.934
1.902ThrPro: 1.902 ± 0.433
1.902ThrGln: 1.902 ± 0.895
2.446ThrArg: 2.446 ± 0.635
5.707ThrSer: 5.707 ± 1.525
4.348ThrThr: 4.348 ± 1.576
5.163ThrVal: 5.163 ± 0.899
0.543ThrTrp: 0.543 ± 0.273
2.717ThrTyr: 2.717 ± 0.906
0.0ThrXaa: 0.0 ± 0.0
Val
5.978ValAla: 5.978 ± 1.109
1.359ValCys: 1.359 ± 0.59
4.62ValAsp: 4.62 ± 0.624
4.076ValGlu: 4.076 ± 0.764
3.261ValPhe: 3.261 ± 0.896
1.902ValGly: 1.902 ± 0.474
2.174ValHis: 2.174 ± 0.502
5.163ValIle: 5.163 ± 1.384
6.25ValLys: 6.25 ± 1.398
5.435ValLeu: 5.435 ± 0.906
1.087ValMet: 1.087 ± 0.546
5.163ValAsn: 5.163 ± 1.654
4.891ValPro: 4.891 ± 1.271
1.63ValGln: 1.63 ± 0.82
3.804ValArg: 3.804 ± 0.524
7.065ValSer: 7.065 ± 1.603
3.804ValThr: 3.804 ± 0.938
5.435ValVal: 5.435 ± 0.696
0.272ValTrp: 0.272 ± 0.417
2.989ValTyr: 2.989 ± 2.358
0.0ValXaa: 0.0 ± 0.0
Trp
0.543TrpAla: 0.543 ± 0.273
0.0TrpCys: 0.0 ± 0.0
0.543TrpAsp: 0.543 ± 0.273
0.543TrpGlu: 0.543 ± 0.273
0.543TrpPhe: 0.543 ± 0.273
0.0TrpGly: 0.0 ± 0.0
0.272TrpHis: 0.272 ± 0.137
1.087TrpIle: 1.087 ± 1.123
0.543TrpLys: 0.543 ± 0.273
0.543TrpLeu: 0.543 ± 0.273
0.0TrpMet: 0.0 ± 0.0
0.272TrpAsn: 0.272 ± 0.137
0.0TrpPro: 0.0 ± 0.0
0.543TrpGln: 0.543 ± 0.711
0.543TrpArg: 0.543 ± 0.273
0.272TrpSer: 0.272 ± 0.137
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.261TyrAla: 3.261 ± 0.658
0.543TyrCys: 0.543 ± 1.18
4.348TyrAsp: 4.348 ± 0.368
3.261TyrGlu: 3.261 ± 1.107
3.261TyrPhe: 3.261 ± 0.905
0.815TyrGly: 0.815 ± 0.785
1.359TyrHis: 1.359 ± 1.315
2.989TyrIle: 2.989 ± 0.833
2.717TyrLys: 2.717 ± 0.617
4.62TyrLeu: 4.62 ± 1.416
1.359TyrMet: 1.359 ± 0.598
1.359TyrAsn: 1.359 ± 0.59
1.902TyrPro: 1.902 ± 0.433
0.543TyrGln: 0.543 ± 0.354
1.359TyrArg: 1.359 ± 0.941
3.261TyrSer: 3.261 ± 0.718
1.63TyrThr: 1.63 ± 1.329
3.261TyrVal: 3.261 ± 0.976
0.272TyrTrp: 0.272 ± 0.572
1.087TyrTyr: 1.087 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski