Amino acid dipepetide frequency for Halogeometricum pleomorphic virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.529AlaAla: 11.529 ± 2.894
1.356AlaCys: 1.356 ± 0.485
6.443AlaAsp: 6.443 ± 1.29
6.104AlaGlu: 6.104 ± 1.224
6.104AlaPhe: 6.104 ± 1.422
6.782AlaGly: 6.782 ± 1.043
2.374AlaHis: 2.374 ± 0.857
4.069AlaIle: 4.069 ± 1.199
4.408AlaLys: 4.408 ± 1.368
9.156AlaLeu: 9.156 ± 1.327
1.356AlaMet: 1.356 ± 0.612
3.052AlaAsn: 3.052 ± 0.904
3.391AlaPro: 3.391 ± 1.647
2.035AlaGln: 2.035 ± 0.741
7.799AlaArg: 7.799 ± 1.791
7.46AlaSer: 7.46 ± 1.176
4.408AlaThr: 4.408 ± 0.965
9.156AlaVal: 9.156 ± 0.957
1.695AlaTrp: 1.695 ± 0.618
2.374AlaTyr: 2.374 ± 0.819
0.0AlaXaa: 0.0 ± 0.0
Cys
0.339CysAla: 0.339 ± 0.254
0.0CysCys: 0.0 ± 0.0
0.678CysAsp: 0.678 ± 0.35
0.339CysGlu: 0.339 ± 0.29
0.0CysPhe: 0.0 ± 0.0
2.035CysGly: 2.035 ± 1.091
0.339CysHis: 0.339 ± 0.303
0.0CysIle: 0.0 ± 0.0
0.339CysLys: 0.339 ± 0.358
0.339CysLeu: 0.339 ± 0.254
0.339CysMet: 0.339 ± 0.33
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.339CysGln: 0.339 ± 0.358
0.339CysArg: 0.339 ± 0.29
0.678CysSer: 0.678 ± 0.43
0.339CysThr: 0.339 ± 0.42
0.0CysVal: 0.0 ± 0.0
0.339CysTrp: 0.339 ± 0.367
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.121AspAla: 7.121 ± 2.045
0.339AspCys: 0.339 ± 0.303
6.782AspAsp: 6.782 ± 1.179
5.426AspGlu: 5.426 ± 1.144
1.356AspPhe: 1.356 ± 0.599
9.156AspGly: 9.156 ± 2.139
0.678AspHis: 0.678 ± 0.372
2.035AspIle: 2.035 ± 0.897
1.695AspLys: 1.695 ± 1.003
7.121AspLeu: 7.121 ± 0.913
1.356AspMet: 1.356 ± 0.546
3.052AspAsn: 3.052 ± 1.333
3.391AspPro: 3.391 ± 0.834
2.035AspGln: 2.035 ± 0.868
5.086AspArg: 5.086 ± 0.956
4.069AspSer: 4.069 ± 0.907
5.426AspThr: 5.426 ± 1.233
7.46AspVal: 7.46 ± 1.56
1.356AspTrp: 1.356 ± 0.864
2.374AspTyr: 2.374 ± 0.488
0.0AspXaa: 0.0 ± 0.0
Glu
5.086GluAla: 5.086 ± 0.71
1.356GluCys: 1.356 ± 1.14
8.477GluAsp: 8.477 ± 1.399
4.069GluGlu: 4.069 ± 1.268
3.391GluPhe: 3.391 ± 1.108
3.73GluGly: 3.73 ± 1.268
0.339GluHis: 0.339 ± 0.254
3.391GluIle: 3.391 ± 0.701
3.052GluLys: 3.052 ± 0.954
6.443GluLeu: 6.443 ± 0.781
2.713GluMet: 2.713 ± 1.186
2.035GluAsn: 2.035 ± 0.851
3.73GluPro: 3.73 ± 1.745
2.374GluGln: 2.374 ± 0.851
4.408GluArg: 4.408 ± 0.971
4.408GluSer: 4.408 ± 1.117
6.104GluThr: 6.104 ± 1.34
3.73GluVal: 3.73 ± 1.163
1.695GluTrp: 1.695 ± 0.661
2.035GluTyr: 2.035 ± 0.578
0.0GluXaa: 0.0 ± 0.0
Phe
4.069PheAla: 4.069 ± 0.694
0.339PheCys: 0.339 ± 0.37
3.391PheAsp: 3.391 ± 1.753
3.052PheGlu: 3.052 ± 0.755
0.339PhePhe: 0.339 ± 0.29
2.374PheGly: 2.374 ± 1.001
0.339PheHis: 0.339 ± 0.336
1.017PheIle: 1.017 ± 0.687
0.339PheLys: 0.339 ± 0.331
3.052PheLeu: 3.052 ± 1.099
0.0PheMet: 0.0 ± 0.0
1.017PheAsn: 1.017 ± 0.909
0.339PhePro: 0.339 ± 0.37
0.339PheGln: 0.339 ± 0.288
4.069PheArg: 4.069 ± 1.167
2.035PheSer: 2.035 ± 0.6
1.356PheThr: 1.356 ± 0.528
4.408PheVal: 4.408 ± 0.825
0.339PheTrp: 0.339 ± 0.336
0.678PheTyr: 0.678 ± 0.422
0.0PheXaa: 0.0 ± 0.0
Gly
10.173GlyAla: 10.173 ± 1.728
0.678GlyCys: 0.678 ± 0.528
5.086GlyAsp: 5.086 ± 0.911
6.443GlyGlu: 6.443 ± 1.093
2.374GlyPhe: 2.374 ± 0.764
8.817GlyGly: 8.817 ± 1.962
2.035GlyHis: 2.035 ± 0.871
2.374GlyIle: 2.374 ± 0.738
1.017GlyLys: 1.017 ± 0.536
5.086GlyLeu: 5.086 ± 1.266
2.374GlyMet: 2.374 ± 0.941
3.391GlyAsn: 3.391 ± 1.026
3.73GlyPro: 3.73 ± 1.049
2.713GlyGln: 2.713 ± 0.896
6.782GlyArg: 6.782 ± 1.204
7.121GlySer: 7.121 ± 1.236
5.086GlyThr: 5.086 ± 2.404
5.765GlyVal: 5.765 ± 1.1
1.695GlyTrp: 1.695 ± 0.763
3.391GlyTyr: 3.391 ± 0.874
0.0GlyXaa: 0.0 ± 0.0
His
2.035HisAla: 2.035 ± 0.812
0.0HisCys: 0.0 ± 0.0
1.356HisAsp: 1.356 ± 0.647
1.695HisGlu: 1.695 ± 0.548
0.339HisPhe: 0.339 ± 0.288
1.695HisGly: 1.695 ± 0.502
0.678HisHis: 0.678 ± 0.528
0.678HisIle: 0.678 ± 0.508
0.0HisLys: 0.0 ± 0.0
1.356HisLeu: 1.356 ± 0.649
0.678HisMet: 0.678 ± 0.379
0.0HisAsn: 0.0 ± 0.0
1.356HisPro: 1.356 ± 0.718
0.339HisGln: 0.339 ± 0.29
0.678HisArg: 0.678 ± 0.506
1.017HisSer: 1.017 ± 0.433
2.035HisThr: 2.035 ± 0.476
2.035HisVal: 2.035 ± 1.013
0.678HisTrp: 0.678 ± 0.46
1.695HisTyr: 1.695 ± 0.475
0.0HisXaa: 0.0 ± 0.0
Ile
5.086IleAla: 5.086 ± 1.368
0.339IleCys: 0.339 ± 0.358
3.73IleAsp: 3.73 ± 1.018
3.052IleGlu: 3.052 ± 0.882
0.678IlePhe: 0.678 ± 0.457
3.391IleGly: 3.391 ± 1.35
0.0IleHis: 0.0 ± 0.0
1.017IleIle: 1.017 ± 0.732
0.678IleLys: 0.678 ± 0.398
2.713IleLeu: 2.713 ± 1.259
0.678IleMet: 0.678 ± 0.606
1.017IleAsn: 1.017 ± 0.909
1.356IlePro: 1.356 ± 0.725
1.695IleGln: 1.695 ± 0.746
1.017IleArg: 1.017 ± 0.473
2.035IleSer: 2.035 ± 1.102
2.035IleThr: 2.035 ± 0.78
4.747IleVal: 4.747 ± 0.956
0.339IleTrp: 0.339 ± 0.37
0.339IleTyr: 0.339 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
4.408LysAla: 4.408 ± 0.93
0.0LysCys: 0.0 ± 0.0
3.052LysAsp: 3.052 ± 0.785
1.017LysGlu: 1.017 ± 0.588
0.339LysPhe: 0.339 ± 0.336
2.035LysGly: 2.035 ± 0.727
0.339LysHis: 0.339 ± 0.29
2.035LysIle: 2.035 ± 0.865
2.374LysLys: 2.374 ± 1.0
1.356LysLeu: 1.356 ± 0.677
1.695LysMet: 1.695 ± 0.77
2.035LysAsn: 2.035 ± 0.602
1.017LysPro: 1.017 ± 0.396
1.695LysGln: 1.695 ± 0.718
2.374LysArg: 2.374 ± 0.875
2.035LysSer: 2.035 ± 0.708
3.391LysThr: 3.391 ± 0.73
1.356LysVal: 1.356 ± 0.635
0.678LysTrp: 0.678 ± 0.372
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.495LeuAla: 9.495 ± 1.748
0.339LeuCys: 0.339 ± 0.254
4.069LeuAsp: 4.069 ± 0.975
4.747LeuGlu: 4.747 ± 1.769
1.017LeuPhe: 1.017 ± 0.544
5.765LeuGly: 5.765 ± 1.007
1.356LeuHis: 1.356 ± 0.653
1.695LeuIle: 1.695 ± 0.652
2.713LeuLys: 2.713 ± 0.921
6.104LeuLeu: 6.104 ± 1.036
1.356LeuMet: 1.356 ± 0.429
2.035LeuAsn: 2.035 ± 0.843
2.374LeuPro: 2.374 ± 0.971
3.052LeuGln: 3.052 ± 0.649
4.747LeuArg: 4.747 ± 0.942
6.443LeuSer: 6.443 ± 1.11
5.765LeuThr: 5.765 ± 1.004
7.121LeuVal: 7.121 ± 1.987
0.678LeuTrp: 0.678 ± 0.432
2.713LeuTyr: 2.713 ± 0.913
0.0LeuXaa: 0.0 ± 0.0
Met
1.356MetAla: 1.356 ± 0.544
0.0MetCys: 0.0 ± 0.0
2.374MetAsp: 2.374 ± 0.924
1.695MetGlu: 1.695 ± 0.714
0.678MetPhe: 0.678 ± 0.545
1.695MetGly: 1.695 ± 0.845
0.339MetHis: 0.339 ± 0.303
0.0MetIle: 0.0 ± 0.0
2.374MetLys: 2.374 ± 1.237
1.356MetLeu: 1.356 ± 0.58
0.678MetMet: 0.678 ± 0.386
1.017MetAsn: 1.017 ± 0.522
0.339MetPro: 0.339 ± 0.336
1.017MetGln: 1.017 ± 0.376
1.356MetArg: 1.356 ± 0.807
2.713MetSer: 2.713 ± 1.026
1.695MetThr: 1.695 ± 0.665
1.017MetVal: 1.017 ± 0.435
0.0MetTrp: 0.0 ± 0.0
0.339MetTyr: 0.339 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
2.374AsnAla: 2.374 ± 0.909
0.339AsnCys: 0.339 ± 0.303
3.052AsnAsp: 3.052 ± 0.954
2.713AsnGlu: 2.713 ± 0.887
0.678AsnPhe: 0.678 ± 0.606
4.408AsnGly: 4.408 ± 1.144
2.035AsnHis: 2.035 ± 0.845
2.035AsnIle: 2.035 ± 0.932
1.017AsnLys: 1.017 ± 0.508
1.356AsnLeu: 1.356 ± 0.568
0.678AsnMet: 0.678 ± 0.602
1.017AsnAsn: 1.017 ± 0.718
0.678AsnPro: 0.678 ± 0.417
1.695AsnGln: 1.695 ± 0.99
1.356AsnArg: 1.356 ± 0.661
3.052AsnSer: 3.052 ± 1.506
2.713AsnThr: 2.713 ± 1.486
2.713AsnVal: 2.713 ± 0.798
0.339AsnTrp: 0.339 ± 0.303
2.035AsnTyr: 2.035 ± 0.638
0.0AsnXaa: 0.0 ± 0.0
Pro
3.73ProAla: 3.73 ± 1.315
0.0ProCys: 0.0 ± 0.0
2.713ProAsp: 2.713 ± 1.087
2.713ProGlu: 2.713 ± 0.949
1.017ProPhe: 1.017 ± 0.59
5.426ProGly: 5.426 ± 0.939
1.017ProHis: 1.017 ± 0.348
1.017ProIle: 1.017 ± 0.548
1.695ProLys: 1.695 ± 0.813
1.695ProLeu: 1.695 ± 1.028
0.339ProMet: 0.339 ± 0.336
0.678ProAsn: 0.678 ± 0.589
1.017ProPro: 1.017 ± 0.51
1.356ProGln: 1.356 ± 0.62
1.695ProArg: 1.695 ± 0.85
3.052ProSer: 3.052 ± 1.02
3.052ProThr: 3.052 ± 1.24
2.374ProVal: 2.374 ± 0.652
0.678ProTrp: 0.678 ± 0.5
1.695ProTyr: 1.695 ± 0.574
0.0ProXaa: 0.0 ± 0.0
Gln
2.035GlnAla: 2.035 ± 0.645
0.0GlnCys: 0.0 ± 0.0
1.356GlnAsp: 1.356 ± 0.841
1.017GlnGlu: 1.017 ± 0.51
1.695GlnPhe: 1.695 ± 0.73
2.713GlnGly: 2.713 ± 1.171
0.678GlnHis: 0.678 ± 0.49
0.678GlnIle: 0.678 ± 0.534
0.678GlnLys: 0.678 ± 0.46
3.052GlnLeu: 3.052 ± 0.807
0.678GlnMet: 0.678 ± 0.446
1.017GlnAsn: 1.017 ± 0.646
1.017GlnPro: 1.017 ± 0.837
1.017GlnGln: 1.017 ± 0.376
1.356GlnArg: 1.356 ± 0.622
4.747GlnSer: 4.747 ± 1.055
2.374GlnThr: 2.374 ± 0.638
3.052GlnVal: 3.052 ± 0.576
0.678GlnTrp: 0.678 ± 0.397
0.678GlnTyr: 0.678 ± 0.432
0.0GlnXaa: 0.0 ± 0.0
Arg
5.086ArgAla: 5.086 ± 1.435
0.339ArgCys: 0.339 ± 0.358
7.121ArgAsp: 7.121 ± 2.398
5.426ArgGlu: 5.426 ± 1.249
2.374ArgPhe: 2.374 ± 0.797
4.069ArgGly: 4.069 ± 1.108
2.713ArgHis: 2.713 ± 1.085
3.391ArgIle: 3.391 ± 1.015
1.017ArgLys: 1.017 ± 0.45
5.086ArgLeu: 5.086 ± 1.048
1.017ArgMet: 1.017 ± 0.54
0.678ArgAsn: 0.678 ± 0.386
2.035ArgPro: 2.035 ± 1.249
1.356ArgGln: 1.356 ± 0.564
3.391ArgArg: 3.391 ± 1.31
6.104ArgSer: 6.104 ± 1.434
3.391ArgThr: 3.391 ± 0.964
2.374ArgVal: 2.374 ± 0.76
1.356ArgTrp: 1.356 ± 0.893
2.713ArgTyr: 2.713 ± 0.697
0.0ArgXaa: 0.0 ± 0.0
Ser
5.765SerAla: 5.765 ± 1.048
0.0SerCys: 0.0 ± 0.0
4.069SerAsp: 4.069 ± 1.184
8.138SerGlu: 8.138 ± 1.7
2.374SerPhe: 2.374 ± 0.584
4.747SerGly: 4.747 ± 1.748
1.356SerHis: 1.356 ± 0.725
3.052SerIle: 3.052 ± 1.16
4.747SerLys: 4.747 ± 0.801
5.086SerLeu: 5.086 ± 0.974
1.017SerMet: 1.017 ± 0.733
4.408SerAsn: 4.408 ± 1.237
4.069SerPro: 4.069 ± 0.816
3.052SerGln: 3.052 ± 1.315
3.391SerArg: 3.391 ± 0.983
5.765SerSer: 5.765 ± 1.757
4.408SerThr: 4.408 ± 2.215
6.104SerVal: 6.104 ± 1.716
2.035SerTrp: 2.035 ± 0.691
2.374SerTyr: 2.374 ± 0.764
0.0SerXaa: 0.0 ± 0.0
Thr
7.46ThrAla: 7.46 ± 1.68
0.0ThrCys: 0.0 ± 0.0
5.765ThrAsp: 5.765 ± 1.083
5.765ThrGlu: 5.765 ± 1.217
3.391ThrPhe: 3.391 ± 0.881
4.408ThrGly: 4.408 ± 1.988
1.356ThrHis: 1.356 ± 0.97
2.035ThrIle: 2.035 ± 0.822
1.695ThrLys: 1.695 ± 0.627
5.086ThrLeu: 5.086 ± 1.24
2.374ThrMet: 2.374 ± 0.88
4.069ThrAsn: 4.069 ± 1.532
2.713ThrPro: 2.713 ± 0.784
1.356ThrGln: 1.356 ± 0.979
3.052ThrArg: 3.052 ± 0.724
3.73ThrSer: 3.73 ± 0.94
6.443ThrThr: 6.443 ± 1.956
5.765ThrVal: 5.765 ± 0.997
0.678ThrTrp: 0.678 ± 0.379
2.374ThrTyr: 2.374 ± 1.269
0.0ThrXaa: 0.0 ± 0.0
Val
9.156ValAla: 9.156 ± 1.977
0.678ValCys: 0.678 ± 0.451
5.426ValAsp: 5.426 ± 1.699
5.765ValGlu: 5.765 ± 1.441
2.713ValPhe: 2.713 ± 1.233
8.817ValGly: 8.817 ± 1.508
1.356ValHis: 1.356 ± 0.907
3.73ValIle: 3.73 ± 1.205
2.374ValLys: 2.374 ± 0.741
5.086ValLeu: 5.086 ± 1.618
1.017ValMet: 1.017 ± 0.631
3.391ValAsn: 3.391 ± 0.84
2.035ValPro: 2.035 ± 0.708
1.695ValGln: 1.695 ± 0.718
5.765ValArg: 5.765 ± 1.741
6.782ValSer: 6.782 ± 1.294
5.765ValThr: 5.765 ± 1.285
4.408ValVal: 4.408 ± 1.613
0.678ValTrp: 0.678 ± 0.372
1.695ValTyr: 1.695 ± 0.594
0.0ValXaa: 0.0 ± 0.0
Trp
1.356TrpAla: 1.356 ± 0.575
0.339TrpCys: 0.339 ± 0.29
1.356TrpAsp: 1.356 ± 0.776
1.356TrpGlu: 1.356 ± 0.403
0.0TrpPhe: 0.0 ± 0.0
2.374TrpGly: 2.374 ± 1.134
0.339TrpHis: 0.339 ± 0.33
0.678TrpIle: 0.678 ± 0.406
0.678TrpLys: 0.678 ± 0.416
1.017TrpLeu: 1.017 ± 0.522
0.678TrpMet: 0.678 ± 0.473
1.017TrpAsn: 1.017 ± 0.544
0.678TrpPro: 0.678 ± 0.58
0.0TrpGln: 0.0 ± 0.0
0.678TrpArg: 0.678 ± 0.484
1.017TrpSer: 1.017 ± 0.421
1.017TrpThr: 1.017 ± 0.487
2.035TrpVal: 2.035 ± 0.624
0.339TrpTrp: 0.339 ± 0.336
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.391TyrAla: 3.391 ± 0.91
0.339TyrCys: 0.339 ± 0.254
1.017TyrAsp: 1.017 ± 0.579
2.035TyrGlu: 2.035 ± 0.641
2.035TyrPhe: 2.035 ± 0.68
2.035TyrGly: 2.035 ± 0.653
0.678TyrHis: 0.678 ± 0.58
1.017TyrIle: 1.017 ± 0.501
0.339TyrLys: 0.339 ± 0.254
2.035TyrLeu: 2.035 ± 0.704
0.678TyrMet: 0.678 ± 0.606
1.356TyrAsn: 1.356 ± 0.6
1.695TyrPro: 1.695 ± 0.917
1.356TyrGln: 1.356 ± 0.564
1.695TyrArg: 1.695 ± 0.677
1.695TyrSer: 1.695 ± 1.216
2.713TyrThr: 2.713 ± 1.181
2.713TyrVal: 2.713 ± 0.799
0.678TyrTrp: 0.678 ± 0.517
3.052TyrTyr: 3.052 ± 1.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2950 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski