Amino acid dipepetide frequency for Halorubrum pleomorphic virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.71AlaAla: 12.71 ± 2.854
0.374AlaCys: 0.374 ± 0.414
6.729AlaAsp: 6.729 ± 1.317
4.86AlaGlu: 4.86 ± 1.564
2.991AlaPhe: 2.991 ± 1.02
8.972AlaGly: 8.972 ± 1.418
1.121AlaHis: 1.121 ± 0.627
5.981AlaIle: 5.981 ± 2.157
2.991AlaLys: 2.991 ± 0.94
5.234AlaLeu: 5.234 ± 1.208
1.869AlaMet: 1.869 ± 0.633
1.869AlaAsn: 1.869 ± 0.628
2.243AlaPro: 2.243 ± 0.576
2.617AlaGln: 2.617 ± 0.583
6.355AlaArg: 6.355 ± 1.104
7.103AlaSer: 7.103 ± 2.038
6.355AlaThr: 6.355 ± 0.809
8.598AlaVal: 8.598 ± 2.162
1.121AlaTrp: 1.121 ± 0.589
1.121AlaTyr: 1.121 ± 0.553
0.0AlaXaa: 0.0 ± 0.0
Cys
0.374CysAla: 0.374 ± 0.414
0.0CysCys: 0.0 ± 0.0
0.374CysAsp: 0.374 ± 0.414
0.0CysGlu: 0.0 ± 0.0
0.374CysPhe: 0.374 ± 0.414
0.374CysGly: 0.374 ± 0.27
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.374CysLys: 0.374 ± 0.421
0.374CysLeu: 0.374 ± 0.414
0.0CysMet: 0.0 ± 0.0
0.374CysAsn: 0.374 ± 0.27
0.374CysPro: 0.374 ± 0.421
0.374CysGln: 0.374 ± 0.27
0.748CysArg: 0.748 ± 0.508
1.121CysSer: 1.121 ± 0.675
1.121CysThr: 1.121 ± 0.811
0.374CysVal: 0.374 ± 0.414
0.374CysTrp: 0.374 ± 0.27
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.85AspAla: 7.85 ± 0.994
0.0AspCys: 0.0 ± 0.0
8.224AspAsp: 8.224 ± 2.873
4.486AspGlu: 4.486 ± 0.977
2.617AspPhe: 2.617 ± 1.006
10.467AspGly: 10.467 ± 2.027
1.121AspHis: 1.121 ± 0.811
0.748AspIle: 0.748 ± 0.476
1.495AspLys: 1.495 ± 0.72
11.215AspLeu: 11.215 ± 2.083
1.869AspMet: 1.869 ± 0.727
1.121AspAsn: 1.121 ± 0.553
5.607AspPro: 5.607 ± 0.794
0.748AspGln: 0.748 ± 0.389
6.355AspArg: 6.355 ± 1.546
7.85AspSer: 7.85 ± 1.028
5.234AspThr: 5.234 ± 1.375
8.598AspVal: 8.598 ± 2.013
1.869AspTrp: 1.869 ± 0.44
2.243AspTyr: 2.243 ± 0.807
0.0AspXaa: 0.0 ± 0.0
Glu
4.86GluAla: 4.86 ± 1.113
0.748GluCys: 0.748 ± 0.524
3.738GluAsp: 3.738 ± 1.213
4.86GluGlu: 4.86 ± 1.609
1.869GluPhe: 1.869 ± 0.566
5.234GluGly: 5.234 ± 0.998
1.869GluHis: 1.869 ± 0.723
3.364GluIle: 3.364 ± 0.564
2.617GluLys: 2.617 ± 0.954
5.607GluLeu: 5.607 ± 1.244
2.243GluMet: 2.243 ± 0.946
1.495GluAsn: 1.495 ± 0.766
2.617GluPro: 2.617 ± 0.553
4.112GluGln: 4.112 ± 1.541
3.738GluArg: 3.738 ± 0.976
2.617GluSer: 2.617 ± 0.924
7.85GluThr: 7.85 ± 1.209
4.486GluVal: 4.486 ± 1.625
1.121GluTrp: 1.121 ± 0.553
0.748GluTyr: 0.748 ± 0.389
0.0GluXaa: 0.0 ± 0.0
Phe
2.991PheAla: 2.991 ± 0.848
0.374PheCys: 0.374 ± 0.27
2.991PheAsp: 2.991 ± 1.05
1.121PheGlu: 1.121 ± 0.426
1.121PhePhe: 1.121 ± 0.583
3.364PheGly: 3.364 ± 0.866
1.495PheHis: 1.495 ± 0.605
1.495PheIle: 1.495 ± 0.814
0.374PheLys: 0.374 ± 0.385
2.617PheLeu: 2.617 ± 0.651
0.374PheMet: 0.374 ± 0.46
0.748PheAsn: 0.748 ± 0.683
1.121PhePro: 1.121 ± 0.464
0.374PheGln: 0.374 ± 0.313
3.364PheArg: 3.364 ± 1.552
0.748PheSer: 0.748 ± 0.625
2.243PheThr: 2.243 ± 1.129
2.617PheVal: 2.617 ± 0.74
0.748PheTrp: 0.748 ± 0.377
0.374PheTyr: 0.374 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
4.86GlyAla: 4.86 ± 2.198
0.374GlyCys: 0.374 ± 0.414
8.224GlyAsp: 8.224 ± 2.186
3.364GlyGlu: 3.364 ± 0.96
2.617GlyPhe: 2.617 ± 0.624
8.224GlyGly: 8.224 ± 1.717
1.495GlyHis: 1.495 ± 0.524
2.991GlyIle: 2.991 ± 0.703
2.617GlyLys: 2.617 ± 0.914
8.224GlyLeu: 8.224 ± 1.679
1.121GlyMet: 1.121 ± 0.325
3.364GlyAsn: 3.364 ± 0.649
3.738GlyPro: 3.738 ± 1.31
1.869GlyGln: 1.869 ± 0.45
4.486GlyArg: 4.486 ± 1.456
5.234GlySer: 5.234 ± 1.273
7.477GlyThr: 7.477 ± 1.532
4.486GlyVal: 4.486 ± 1.155
3.738GlyTrp: 3.738 ± 0.982
1.869GlyTyr: 1.869 ± 0.642
0.0GlyXaa: 0.0 ± 0.0
His
1.869HisAla: 1.869 ± 0.483
0.0HisCys: 0.0 ± 0.0
4.112HisAsp: 4.112 ± 1.757
0.748HisGlu: 0.748 ± 0.377
0.748HisPhe: 0.748 ± 0.372
1.121HisGly: 1.121 ± 0.811
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.869HisLys: 1.869 ± 0.756
0.748HisLeu: 0.748 ± 0.377
0.0HisMet: 0.0 ± 0.0
0.374HisAsn: 0.374 ± 0.313
0.748HisPro: 0.748 ± 0.429
0.374HisGln: 0.374 ± 0.27
1.121HisArg: 1.121 ± 0.611
0.374HisSer: 0.374 ± 0.414
0.748HisThr: 0.748 ± 0.389
0.748HisVal: 0.748 ± 0.446
0.0HisTrp: 0.0 ± 0.0
1.121HisTyr: 1.121 ± 0.609
0.0HisXaa: 0.0 ± 0.0
Ile
4.86IleAla: 4.86 ± 1.223
0.0IleCys: 0.0 ± 0.0
2.991IleAsp: 2.991 ± 0.766
4.486IleGlu: 4.486 ± 1.192
0.748IlePhe: 0.748 ± 0.443
3.364IleGly: 3.364 ± 0.685
0.374IleHis: 0.374 ± 0.324
0.374IleIle: 0.374 ± 0.341
1.121IleLys: 1.121 ± 0.464
1.495IleLeu: 1.495 ± 0.458
0.0IleMet: 0.0 ± 0.0
1.121IleAsn: 1.121 ± 0.358
1.495IlePro: 1.495 ± 0.731
0.748IleGln: 0.748 ± 0.426
2.991IleArg: 2.991 ± 1.427
3.364IleSer: 3.364 ± 0.831
4.486IleThr: 4.486 ± 1.647
2.243IleVal: 2.243 ± 0.87
0.374IleTrp: 0.374 ± 0.385
0.374IleTyr: 0.374 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
1.869LysAla: 1.869 ± 0.745
0.0LysCys: 0.0 ± 0.0
3.364LysAsp: 3.364 ± 0.867
1.495LysGlu: 1.495 ± 0.717
1.121LysPhe: 1.121 ± 0.358
1.495LysGly: 1.495 ± 0.613
0.0LysHis: 0.0 ± 0.0
0.374LysIle: 0.374 ± 0.313
1.869LysLys: 1.869 ± 1.047
2.617LysLeu: 2.617 ± 0.976
0.748LysMet: 0.748 ± 0.541
0.748LysAsn: 0.748 ± 0.429
1.869LysPro: 1.869 ± 0.806
1.121LysGln: 1.121 ± 0.358
2.991LysArg: 2.991 ± 1.07
2.243LysSer: 2.243 ± 0.965
2.991LysThr: 2.991 ± 1.004
1.869LysVal: 1.869 ± 1.02
0.374LysTrp: 0.374 ± 0.27
0.748LysTyr: 0.748 ± 0.303
0.0LysXaa: 0.0 ± 0.0
Leu
6.729LeuAla: 6.729 ± 1.88
0.748LeuCys: 0.748 ± 0.433
10.841LeuAsp: 10.841 ± 2.692
7.85LeuGlu: 7.85 ± 1.692
1.869LeuPhe: 1.869 ± 0.769
4.86LeuGly: 4.86 ± 1.178
1.121LeuHis: 1.121 ± 0.811
4.86LeuIle: 4.86 ± 1.098
2.243LeuLys: 2.243 ± 0.716
7.103LeuLeu: 7.103 ± 2.319
1.121LeuMet: 1.121 ± 0.436
4.112LeuAsn: 4.112 ± 0.775
2.617LeuPro: 2.617 ± 0.802
2.243LeuGln: 2.243 ± 0.628
6.729LeuArg: 6.729 ± 1.833
5.234LeuSer: 5.234 ± 0.828
4.486LeuThr: 4.486 ± 1.178
9.346LeuVal: 9.346 ± 1.694
0.0LeuTrp: 0.0 ± 0.0
4.112LeuTyr: 4.112 ± 1.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.617MetAla: 2.617 ± 0.605
0.0MetCys: 0.0 ± 0.0
0.374MetAsp: 0.374 ± 0.313
0.374MetGlu: 0.374 ± 0.27
0.374MetPhe: 0.374 ± 0.313
1.121MetGly: 1.121 ± 0.517
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.748MetLys: 0.748 ± 0.492
1.869MetLeu: 1.869 ± 0.576
0.374MetMet: 0.374 ± 0.313
0.374MetAsn: 0.374 ± 0.424
1.121MetPro: 1.121 ± 0.482
0.0MetGln: 0.0 ± 0.0
0.748MetArg: 0.748 ± 0.508
3.364MetSer: 3.364 ± 0.816
1.869MetThr: 1.869 ± 0.48
0.748MetVal: 0.748 ± 0.625
0.0MetTrp: 0.0 ± 0.0
0.374MetTyr: 0.374 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
2.991AsnAla: 2.991 ± 0.988
0.374AsnCys: 0.374 ± 0.27
0.748AsnAsp: 0.748 ± 0.429
1.495AsnGlu: 1.495 ± 0.866
2.617AsnPhe: 2.617 ± 0.604
2.617AsnGly: 2.617 ± 0.603
0.0AsnHis: 0.0 ± 0.0
0.748AsnIle: 0.748 ± 0.426
0.748AsnLys: 0.748 ± 0.625
2.617AsnLeu: 2.617 ± 0.886
0.374AsnMet: 0.374 ± 0.298
1.495AsnAsn: 1.495 ± 0.539
1.121AsnPro: 1.121 ± 0.554
2.617AsnGln: 2.617 ± 0.826
1.869AsnArg: 1.869 ± 0.953
2.617AsnSer: 2.617 ± 0.831
4.86AsnThr: 4.86 ± 1.107
2.243AsnVal: 2.243 ± 1.295
0.374AsnTrp: 0.374 ± 0.27
1.121AsnTyr: 1.121 ± 0.35
0.0AsnXaa: 0.0 ± 0.0
Pro
3.738ProAla: 3.738 ± 1.147
0.0ProCys: 0.0 ± 0.0
4.112ProAsp: 4.112 ± 1.04
5.234ProGlu: 5.234 ± 1.582
0.748ProPhe: 0.748 ± 0.546
4.486ProGly: 4.486 ± 2.36
1.121ProHis: 1.121 ± 0.56
1.121ProIle: 1.121 ± 0.635
1.495ProLys: 1.495 ± 0.374
3.738ProLeu: 3.738 ± 0.676
0.374ProMet: 0.374 ± 0.324
0.748ProAsn: 0.748 ± 0.389
2.243ProPro: 2.243 ± 0.772
1.121ProGln: 1.121 ± 0.35
4.486ProArg: 4.486 ± 1.069
3.364ProSer: 3.364 ± 0.905
2.991ProThr: 2.991 ± 1.613
1.121ProVal: 1.121 ± 0.534
1.121ProTrp: 1.121 ± 0.325
0.748ProTyr: 0.748 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
1.121GlnAla: 1.121 ± 0.408
0.0GlnCys: 0.0 ± 0.0
1.869GlnAsp: 1.869 ± 0.568
2.243GlnGlu: 2.243 ± 0.504
1.495GlnPhe: 1.495 ± 0.75
0.748GlnGly: 0.748 ± 0.429
0.748GlnHis: 0.748 ± 0.347
0.748GlnIle: 0.748 ± 0.347
0.748GlnLys: 0.748 ± 0.541
5.234GlnLeu: 5.234 ± 0.864
0.374GlnMet: 0.374 ± 0.424
1.121GlnAsn: 1.121 ± 0.938
1.121GlnPro: 1.121 ± 0.675
1.495GlnGln: 1.495 ± 0.509
2.243GlnArg: 2.243 ± 1.337
3.738GlnSer: 3.738 ± 1.245
2.243GlnThr: 2.243 ± 0.842
1.495GlnVal: 1.495 ± 0.895
0.748GlnTrp: 0.748 ± 0.487
0.374GlnTyr: 0.374 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
5.607ArgAla: 5.607 ± 2.089
0.748ArgCys: 0.748 ± 0.827
5.607ArgAsp: 5.607 ± 2.33
6.729ArgGlu: 6.729 ± 1.452
1.869ArgPhe: 1.869 ± 0.839
5.234ArgGly: 5.234 ± 1.598
1.869ArgHis: 1.869 ± 0.625
2.617ArgIle: 2.617 ± 1.083
0.748ArgLys: 0.748 ± 0.492
5.607ArgLeu: 5.607 ± 2.019
1.495ArgMet: 1.495 ± 0.49
3.364ArgAsn: 3.364 ± 0.861
3.364ArgPro: 3.364 ± 1.545
1.121ArgGln: 1.121 ± 0.641
5.234ArgArg: 5.234 ± 1.896
7.477ArgSer: 7.477 ± 3.723
5.234ArgThr: 5.234 ± 1.818
3.364ArgVal: 3.364 ± 1.388
0.748ArgTrp: 0.748 ± 0.446
1.869ArgTyr: 1.869 ± 0.789
0.0ArgXaa: 0.0 ± 0.0
Ser
7.85SerAla: 7.85 ± 1.328
1.121SerCys: 1.121 ± 0.953
7.85SerAsp: 7.85 ± 1.294
5.234SerGlu: 5.234 ± 0.947
1.869SerPhe: 1.869 ± 0.613
8.224SerGly: 8.224 ± 1.728
0.748SerHis: 0.748 ± 0.541
2.243SerIle: 2.243 ± 0.886
1.495SerLys: 1.495 ± 0.709
7.477SerLeu: 7.477 ± 2.149
1.121SerMet: 1.121 ± 0.517
2.991SerAsn: 2.991 ± 0.783
3.738SerPro: 3.738 ± 1.405
1.869SerGln: 1.869 ± 0.833
2.991SerArg: 2.991 ± 1.124
6.729SerSer: 6.729 ± 3.076
2.991SerThr: 2.991 ± 1.85
3.364SerVal: 3.364 ± 1.033
1.869SerTrp: 1.869 ± 0.78
3.738SerTyr: 3.738 ± 0.762
0.0SerXaa: 0.0 ± 0.0
Thr
7.85ThrAla: 7.85 ± 1.42
1.121ThrCys: 1.121 ± 0.521
6.355ThrAsp: 6.355 ± 1.497
4.112ThrGlu: 4.112 ± 1.337
2.243ThrPhe: 2.243 ± 1.284
3.738ThrGly: 3.738 ± 0.722
0.748ThrHis: 0.748 ± 0.47
4.112ThrIle: 4.112 ± 1.569
2.243ThrLys: 2.243 ± 0.82
7.103ThrLeu: 7.103 ± 0.931
1.495ThrMet: 1.495 ± 0.607
4.486ThrAsn: 4.486 ± 1.429
2.243ThrPro: 2.243 ± 0.755
3.738ThrGln: 3.738 ± 1.117
5.234ThrArg: 5.234 ± 1.99
4.112ThrSer: 4.112 ± 1.145
5.607ThrThr: 5.607 ± 1.191
7.85ThrVal: 7.85 ± 2.288
1.495ThrTrp: 1.495 ± 0.758
1.495ThrTyr: 1.495 ± 0.982
0.0ThrXaa: 0.0 ± 0.0
Val
7.85ValAla: 7.85 ± 2.447
0.374ValCys: 0.374 ± 0.27
7.103ValAsp: 7.103 ± 1.258
3.364ValGlu: 3.364 ± 1.346
2.991ValPhe: 2.991 ± 0.754
4.486ValGly: 4.486 ± 1.836
1.869ValHis: 1.869 ± 0.917
2.617ValIle: 2.617 ± 0.941
1.495ValLys: 1.495 ± 0.777
6.729ValLeu: 6.729 ± 1.104
0.374ValMet: 0.374 ± 0.27
0.748ValAsn: 0.748 ± 0.444
5.234ValPro: 5.234 ± 0.845
2.243ValGln: 2.243 ± 0.92
5.981ValArg: 5.981 ± 1.129
5.607ValSer: 5.607 ± 1.267
5.234ValThr: 5.234 ± 1.455
7.103ValVal: 7.103 ± 1.806
0.374ValTrp: 0.374 ± 0.341
1.869ValTyr: 1.869 ± 0.741
0.0ValXaa: 0.0 ± 0.0
Trp
0.748TrpAla: 0.748 ± 0.538
0.748TrpCys: 0.748 ± 0.541
1.121TrpAsp: 1.121 ± 0.938
0.374TrpGlu: 0.374 ± 0.34
0.374TrpPhe: 0.374 ± 0.414
0.374TrpGly: 0.374 ± 0.34
0.374TrpHis: 0.374 ± 0.27
0.374TrpIle: 0.374 ± 0.385
1.495TrpLys: 1.495 ± 0.581
2.243TrpLeu: 2.243 ± 1.12
0.374TrpMet: 0.374 ± 0.27
1.121TrpAsn: 1.121 ± 0.627
0.748TrpPro: 0.748 ± 0.526
0.0TrpGln: 0.0 ± 0.0
0.748TrpArg: 0.748 ± 0.377
1.869TrpSer: 1.869 ± 0.435
1.121TrpThr: 1.121 ± 0.627
1.869TrpVal: 1.869 ± 0.723
0.374TrpTrp: 0.374 ± 0.27
0.748TrpTyr: 0.748 ± 0.429
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.869TyrAla: 1.869 ± 0.523
0.0TyrCys: 0.0 ± 0.0
2.991TyrAsp: 2.991 ± 0.73
3.364TyrGlu: 3.364 ± 0.866
0.374TyrPhe: 0.374 ± 0.313
1.869TyrGly: 1.869 ± 0.782
0.748TyrHis: 0.748 ± 0.429
2.243TyrIle: 2.243 ± 0.964
1.121TyrLys: 1.121 ± 0.325
0.374TyrLeu: 0.374 ± 0.27
0.374TyrMet: 0.374 ± 0.313
1.869TyrAsn: 1.869 ± 0.568
0.748TyrPro: 0.748 ± 0.541
1.121TyrGln: 1.121 ± 0.632
1.495TyrArg: 1.495 ± 0.628
0.748TyrSer: 0.748 ± 0.389
2.243TyrThr: 2.243 ± 0.692
1.121TyrVal: 1.121 ± 0.458
0.374TyrTrp: 0.374 ± 0.27
0.374TyrTyr: 0.374 ± 0.34
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (2676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski