Amino acid dipepetide frequency for Haloarcula hispanica pleomorphic virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.659AlaAla: 14.659 ± 2.431
0.792AlaCys: 0.792 ± 0.351
8.716AlaAsp: 8.716 ± 1.589
5.547AlaGlu: 5.547 ± 1.371
3.17AlaPhe: 3.17 ± 1.11
13.074AlaGly: 13.074 ± 2.017
2.377AlaHis: 2.377 ± 0.737
3.566AlaIle: 3.566 ± 1.057
2.377AlaLys: 2.377 ± 0.904
7.528AlaLeu: 7.528 ± 1.872
2.377AlaMet: 2.377 ± 0.677
3.17AlaAsn: 3.17 ± 0.76
4.754AlaPro: 4.754 ± 2.984
1.585AlaGln: 1.585 ± 0.932
5.151AlaArg: 5.151 ± 1.567
6.339AlaSer: 6.339 ± 1.548
7.924AlaThr: 7.924 ± 1.289
5.151AlaVal: 5.151 ± 1.294
0.396AlaTrp: 0.396 ± 0.302
2.377AlaTyr: 2.377 ± 0.629
0.0AlaXaa: 0.0 ± 0.0
Cys
0.792CysAla: 0.792 ± 0.457
0.792CysCys: 0.792 ± 0.603
2.377CysAsp: 2.377 ± 1.809
0.792CysGlu: 0.792 ± 0.351
0.0CysPhe: 0.0 ± 0.0
1.189CysGly: 1.189 ± 0.556
0.396CysHis: 0.396 ± 0.344
0.0CysIle: 0.0 ± 0.0
1.189CysLys: 1.189 ± 0.905
0.396CysLeu: 0.396 ± 0.383
0.0CysMet: 0.0 ± 0.0
0.396CysAsn: 0.396 ± 0.302
0.792CysPro: 0.792 ± 0.457
0.396CysGln: 0.396 ± 0.302
0.792CysArg: 0.792 ± 0.458
0.792CysSer: 0.792 ± 0.457
0.396CysThr: 0.396 ± 0.302
1.189CysVal: 1.189 ± 0.43
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.32AspAla: 8.32 ± 2.745
0.792AspCys: 0.792 ± 0.603
9.113AspAsp: 9.113 ± 2.326
7.132AspGlu: 7.132 ± 1.325
3.17AspPhe: 3.17 ± 1.482
8.716AspGly: 8.716 ± 2.156
1.981AspHis: 1.981 ± 0.784
0.792AspIle: 0.792 ± 0.656
1.585AspLys: 1.585 ± 0.661
6.339AspLeu: 6.339 ± 1.831
1.189AspMet: 1.189 ± 0.404
1.585AspAsn: 1.585 ± 0.6
6.339AspPro: 6.339 ± 1.277
1.981AspGln: 1.981 ± 0.563
5.151AspArg: 5.151 ± 1.46
6.735AspSer: 6.735 ± 1.965
3.962AspThr: 3.962 ± 0.805
6.339AspVal: 6.339 ± 1.164
0.792AspTrp: 0.792 ± 0.516
2.377AspTyr: 2.377 ± 0.728
0.0AspXaa: 0.0 ± 0.0
Glu
4.754GluAla: 4.754 ± 2.42
0.792GluCys: 0.792 ± 0.458
4.754GluAsp: 4.754 ± 1.393
7.528GluGlu: 7.528 ± 3.116
2.377GluPhe: 2.377 ± 0.809
3.17GluGly: 3.17 ± 1.223
2.773GluHis: 2.773 ± 1.332
2.773GluIle: 2.773 ± 1.177
3.566GluLys: 3.566 ± 0.643
7.132GluLeu: 7.132 ± 1.821
2.773GluMet: 2.773 ± 0.928
2.377GluAsn: 2.377 ± 0.636
2.377GluPro: 2.377 ± 0.753
4.358GluGln: 4.358 ± 0.814
7.924GluArg: 7.924 ± 2.792
6.735GluSer: 6.735 ± 0.979
6.339GluThr: 6.339 ± 1.289
8.32GluVal: 8.32 ± 2.799
0.0GluTrp: 0.0 ± 0.0
3.17GluTyr: 3.17 ± 1.322
0.0GluXaa: 0.0 ± 0.0
Phe
4.358PheAla: 4.358 ± 1.745
0.0PheCys: 0.0 ± 0.0
3.962PheAsp: 3.962 ± 1.258
1.189PheGlu: 1.189 ± 0.697
1.189PhePhe: 1.189 ± 0.828
1.585PheGly: 1.585 ± 0.721
0.0PheHis: 0.0 ± 0.0
1.585PheIle: 1.585 ± 0.739
1.981PheLys: 1.981 ± 0.944
3.17PheLeu: 3.17 ± 1.673
0.396PheMet: 0.396 ± 0.383
0.792PheAsn: 0.792 ± 0.33
1.189PhePro: 1.189 ± 0.604
0.792PheGln: 0.792 ± 0.656
2.377PheArg: 2.377 ± 1.081
1.585PheSer: 1.585 ± 0.65
2.377PheThr: 2.377 ± 0.972
3.17PheVal: 3.17 ± 0.74
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.132GlyAla: 7.132 ± 1.431
1.189GlyCys: 1.189 ± 0.556
8.32GlyAsp: 8.32 ± 1.947
9.113GlyGlu: 9.113 ± 1.794
3.17GlyPhe: 3.17 ± 0.906
9.905GlyGly: 9.905 ± 1.341
0.0GlyHis: 0.0 ± 0.0
2.773GlyIle: 2.773 ± 0.831
2.377GlyLys: 2.377 ± 0.738
3.566GlyLeu: 3.566 ± 0.99
1.981GlyMet: 1.981 ± 0.882
4.358GlyAsn: 4.358 ± 1.084
3.17GlyPro: 3.17 ± 1.382
1.189GlyGln: 1.189 ± 0.669
3.17GlyArg: 3.17 ± 0.876
5.943GlySer: 5.943 ± 1.436
3.962GlyThr: 3.962 ± 1.38
5.151GlyVal: 5.151 ± 1.761
0.396GlyTrp: 0.396 ± 0.446
2.377GlyTyr: 2.377 ± 0.763
0.0GlyXaa: 0.0 ± 0.0
His
3.17HisAla: 3.17 ± 0.979
0.396HisCys: 0.396 ± 0.302
0.792HisAsp: 0.792 ± 0.603
0.792HisGlu: 0.792 ± 0.351
0.396HisPhe: 0.396 ± 0.344
1.981HisGly: 1.981 ± 1.178
0.792HisHis: 0.792 ± 0.33
0.396HisIle: 0.396 ± 0.383
0.396HisLys: 0.396 ± 0.302
1.189HisLeu: 1.189 ± 0.626
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.396HisPro: 0.396 ± 0.344
1.585HisGln: 1.585 ± 0.823
1.189HisArg: 1.189 ± 1.148
0.396HisSer: 0.396 ± 0.328
1.585HisThr: 1.585 ± 0.636
1.585HisVal: 1.585 ± 0.811
0.396HisTrp: 0.396 ± 0.302
0.396HisTyr: 0.396 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
6.735IleAla: 6.735 ± 1.87
0.396IleCys: 0.396 ± 0.344
1.981IleAsp: 1.981 ± 0.736
2.377IleGlu: 2.377 ± 1.093
0.792IlePhe: 0.792 ± 0.33
1.981IleGly: 1.981 ± 1.414
0.396IleHis: 0.396 ± 0.383
1.189IleIle: 1.189 ± 0.406
1.585IleLys: 1.585 ± 0.575
1.585IleLeu: 1.585 ± 0.71
0.0IleMet: 0.0 ± 0.0
1.189IleAsn: 1.189 ± 0.67
3.17IlePro: 3.17 ± 1.231
1.585IleGln: 1.585 ± 1.051
2.377IleArg: 2.377 ± 0.686
1.189IleSer: 1.189 ± 0.577
3.566IleThr: 3.566 ± 1.443
1.585IleVal: 1.585 ± 0.897
0.0IleTrp: 0.0 ± 0.0
0.396IleTyr: 0.396 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
1.585LysAla: 1.585 ± 0.514
0.396LysCys: 0.396 ± 0.383
0.792LysAsp: 0.792 ± 0.351
2.377LysGlu: 2.377 ± 0.952
0.396LysPhe: 0.396 ± 0.344
1.585LysGly: 1.585 ± 0.661
1.585LysHis: 1.585 ± 0.811
1.189LysIle: 1.189 ± 0.579
0.792LysLys: 0.792 ± 0.33
3.17LysLeu: 3.17 ± 0.985
1.189LysMet: 1.189 ± 0.466
1.189LysAsn: 1.189 ± 0.619
0.792LysPro: 0.792 ± 0.457
0.396LysGln: 0.396 ± 0.446
1.981LysArg: 1.981 ± 0.736
1.585LysSer: 1.585 ± 0.47
1.189LysThr: 1.189 ± 0.77
2.773LysVal: 2.773 ± 1.148
0.0LysTrp: 0.0 ± 0.0
1.189LysTyr: 1.189 ± 0.984
0.0LysXaa: 0.0 ± 0.0
Leu
5.547LeuAla: 5.547 ± 1.523
1.585LeuCys: 1.585 ± 0.594
6.735LeuAsp: 6.735 ± 1.375
8.32LeuGlu: 8.32 ± 1.064
2.377LeuPhe: 2.377 ± 0.888
5.547LeuGly: 5.547 ± 1.651
1.189LeuHis: 1.189 ± 0.556
4.358LeuIle: 4.358 ± 1.506
1.189LeuLys: 1.189 ± 0.406
7.132LeuLeu: 7.132 ± 1.875
1.981LeuMet: 1.981 ± 0.653
1.981LeuAsn: 1.981 ± 1.029
3.962LeuPro: 3.962 ± 1.167
2.377LeuGln: 2.377 ± 1.122
5.547LeuArg: 5.547 ± 1.013
6.339LeuSer: 6.339 ± 1.507
5.547LeuThr: 5.547 ± 1.36
4.754LeuVal: 4.754 ± 1.737
0.396LeuTrp: 0.396 ± 0.383
1.189LeuTyr: 1.189 ± 0.477
0.0LeuXaa: 0.0 ± 0.0
Met
2.377MetAla: 2.377 ± 0.703
0.0MetCys: 0.0 ± 0.0
0.792MetAsp: 0.792 ± 0.351
0.0MetGlu: 0.0 ± 0.0
0.792MetPhe: 0.792 ± 0.533
1.585MetGly: 1.585 ± 0.636
0.0MetHis: 0.0 ± 0.0
0.396MetIle: 0.396 ± 0.456
0.396MetLys: 0.396 ± 0.328
3.566MetLeu: 3.566 ± 1.131
0.396MetMet: 0.396 ± 0.344
0.792MetAsn: 0.792 ± 0.475
0.792MetPro: 0.792 ± 0.607
1.981MetGln: 1.981 ± 0.809
0.792MetArg: 0.792 ± 0.351
3.566MetSer: 3.566 ± 0.824
1.981MetThr: 1.981 ± 0.606
0.792MetVal: 0.792 ± 0.457
0.792MetTrp: 0.792 ± 0.603
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.566AsnAla: 3.566 ± 1.922
0.0AsnCys: 0.0 ± 0.0
1.585AsnAsp: 1.585 ± 0.556
2.377AsnGlu: 2.377 ± 0.528
0.792AsnPhe: 0.792 ± 0.521
1.189AsnGly: 1.189 ± 0.697
0.396AsnHis: 0.396 ± 0.328
1.189AsnIle: 1.189 ± 0.749
1.189AsnLys: 1.189 ± 0.477
2.773AsnLeu: 2.773 ± 0.825
1.189AsnMet: 1.189 ± 0.667
2.377AsnAsn: 2.377 ± 0.636
1.981AsnPro: 1.981 ± 1.146
0.396AsnGln: 0.396 ± 0.328
1.585AsnArg: 1.585 ± 0.723
1.189AsnSer: 1.189 ± 0.585
3.17AsnThr: 3.17 ± 1.147
2.773AsnVal: 2.773 ± 0.987
0.0AsnTrp: 0.0 ± 0.0
0.792AsnTyr: 0.792 ± 0.552
0.0AsnXaa: 0.0 ± 0.0
Pro
4.358ProAla: 4.358 ± 1.304
0.0ProCys: 0.0 ± 0.0
7.132ProAsp: 7.132 ± 2.138
5.943ProGlu: 5.943 ± 2.963
1.189ProPhe: 1.189 ± 0.541
1.189ProGly: 1.189 ± 0.541
1.981ProHis: 1.981 ± 1.522
1.189ProIle: 1.189 ± 0.386
0.396ProLys: 0.396 ± 0.446
3.566ProLeu: 3.566 ± 0.893
0.792ProMet: 0.792 ± 0.481
0.792ProAsn: 0.792 ± 0.492
1.585ProPro: 1.585 ± 1.206
0.396ProGln: 0.396 ± 0.302
3.566ProArg: 3.566 ± 1.239
3.566ProSer: 3.566 ± 1.563
2.377ProThr: 2.377 ± 1.364
3.17ProVal: 3.17 ± 1.113
0.792ProTrp: 0.792 ± 0.569
1.585ProTyr: 1.585 ± 0.717
0.0ProXaa: 0.0 ± 0.0
Gln
1.189GlnAla: 1.189 ± 0.84
0.396GlnCys: 0.396 ± 0.446
2.773GlnAsp: 2.773 ± 0.771
4.358GlnGlu: 4.358 ± 0.989
1.585GlnPhe: 1.585 ± 0.689
0.792GlnGly: 0.792 ± 0.33
0.396GlnHis: 0.396 ± 0.456
1.189GlnIle: 1.189 ± 0.61
1.189GlnLys: 1.189 ± 0.384
1.585GlnLeu: 1.585 ± 0.712
0.0GlnMet: 0.0 ± 0.0
1.585GlnAsn: 1.585 ± 0.965
0.396GlnPro: 0.396 ± 0.328
1.585GlnGln: 1.585 ± 0.986
1.189GlnArg: 1.189 ± 0.542
4.358GlnSer: 4.358 ± 1.186
1.585GlnThr: 1.585 ± 1.312
1.189GlnVal: 1.189 ± 0.585
1.189GlnTrp: 1.189 ± 0.673
1.585GlnTyr: 1.585 ± 0.749
0.0GlnXaa: 0.0 ± 0.0
Arg
5.547ArgAla: 5.547 ± 1.989
1.189ArgCys: 1.189 ± 0.905
2.773ArgAsp: 2.773 ± 1.346
5.151ArgGlu: 5.151 ± 2.295
1.585ArgPhe: 1.585 ± 1.012
3.566ArgGly: 3.566 ± 2.071
0.0ArgHis: 0.0 ± 0.0
1.585ArgIle: 1.585 ± 1.53
1.189ArgLys: 1.189 ± 0.695
3.962ArgLeu: 3.962 ± 1.473
1.585ArgMet: 1.585 ± 0.53
1.189ArgAsn: 1.189 ± 0.561
3.17ArgPro: 3.17 ± 1.293
1.189ArgGln: 1.189 ± 0.424
3.962ArgArg: 3.962 ± 0.556
6.339ArgSer: 6.339 ± 1.319
6.735ArgThr: 6.735 ± 1.367
3.962ArgVal: 3.962 ± 1.563
1.189ArgTrp: 1.189 ± 0.905
2.377ArgTyr: 2.377 ± 0.985
0.0ArgXaa: 0.0 ± 0.0
Ser
8.32SerAla: 8.32 ± 1.441
1.189SerCys: 1.189 ± 0.545
6.339SerAsp: 6.339 ± 2.092
6.339SerGlu: 6.339 ± 1.131
3.17SerPhe: 3.17 ± 1.591
8.716SerGly: 8.716 ± 1.757
0.396SerHis: 0.396 ± 0.344
2.377SerIle: 2.377 ± 0.567
1.585SerLys: 1.585 ± 0.535
5.943SerLeu: 5.943 ± 1.277
1.189SerMet: 1.189 ± 0.477
1.981SerAsn: 1.981 ± 0.661
2.773SerPro: 2.773 ± 1.346
2.773SerGln: 2.773 ± 1.488
3.962SerArg: 3.962 ± 1.035
6.339SerSer: 6.339 ± 1.25
3.17SerThr: 3.17 ± 0.886
7.528SerVal: 7.528 ± 2.76
2.773SerTrp: 2.773 ± 1.087
2.377SerTyr: 2.377 ± 0.914
0.0SerXaa: 0.0 ± 0.0
Thr
6.339ThrAla: 6.339 ± 1.211
1.189ThrCys: 1.189 ± 0.905
4.754ThrAsp: 4.754 ± 1.069
7.528ThrGlu: 7.528 ± 2.07
2.773ThrPhe: 2.773 ± 0.516
5.943ThrGly: 5.943 ± 0.735
1.981ThrHis: 1.981 ± 0.771
3.17ThrIle: 3.17 ± 0.691
0.792ThrLys: 0.792 ± 0.577
6.339ThrLeu: 6.339 ± 1.652
1.189ThrMet: 1.189 ± 0.497
2.377ThrAsn: 2.377 ± 1.223
1.585ThrPro: 1.585 ± 0.739
3.566ThrGln: 3.566 ± 0.981
1.585ThrArg: 1.585 ± 0.755
4.358ThrSer: 4.358 ± 1.295
4.754ThrThr: 4.754 ± 2.708
7.924ThrVal: 7.924 ± 1.365
1.189ThrTrp: 1.189 ± 0.598
1.585ThrTyr: 1.585 ± 1.078
0.0ThrXaa: 0.0 ± 0.0
Val
9.905ValAla: 9.905 ± 1.364
1.189ValCys: 1.189 ± 0.905
7.924ValAsp: 7.924 ± 1.423
5.151ValGlu: 5.151 ± 1.624
1.981ValPhe: 1.981 ± 0.661
5.151ValGly: 5.151 ± 1.729
0.396ValHis: 0.396 ± 0.302
3.566ValIle: 3.566 ± 1.109
1.981ValLys: 1.981 ± 0.561
5.151ValLeu: 5.151 ± 1.519
1.981ValMet: 1.981 ± 0.647
0.792ValAsn: 0.792 ± 0.33
5.943ValPro: 5.943 ± 1.784
0.396ValGln: 0.396 ± 0.344
4.358ValArg: 4.358 ± 1.442
5.547ValSer: 5.547 ± 1.454
6.735ValThr: 6.735 ± 3.391
9.113ValVal: 9.113 ± 2.276
0.792ValTrp: 0.792 ± 0.475
0.792ValTyr: 0.792 ± 0.656
0.0ValXaa: 0.0 ± 0.0
Trp
1.189TrpAla: 1.189 ± 0.541
0.396TrpCys: 0.396 ± 0.302
0.396TrpAsp: 0.396 ± 0.328
0.396TrpGlu: 0.396 ± 0.456
0.0TrpPhe: 0.0 ± 0.0
0.792TrpGly: 0.792 ± 0.516
0.396TrpHis: 0.396 ± 0.302
0.396TrpIle: 0.396 ± 0.446
0.396TrpLys: 0.396 ± 0.302
1.189TrpLeu: 1.189 ± 0.605
0.396TrpMet: 0.396 ± 0.302
0.792TrpAsn: 0.792 ± 0.688
0.396TrpPro: 0.396 ± 0.302
0.396TrpGln: 0.396 ± 0.383
0.396TrpArg: 0.396 ± 0.426
1.585TrpSer: 1.585 ± 0.723
0.792TrpThr: 0.792 ± 0.603
0.792TrpVal: 0.792 ± 0.457
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.585TyrAla: 1.585 ± 0.615
0.0TyrCys: 0.0 ± 0.0
2.377TyrAsp: 2.377 ± 1.156
1.585TyrGlu: 1.585 ± 0.514
0.792TyrPhe: 0.792 ± 0.656
1.585TyrGly: 1.585 ± 0.709
0.396TyrHis: 0.396 ± 0.302
0.396TyrIle: 0.396 ± 0.328
0.0TyrLys: 0.0 ± 0.0
2.773TyrLeu: 2.773 ± 1.212
0.792TyrMet: 0.792 ± 0.477
0.792TyrAsn: 0.792 ± 0.552
0.0TyrPro: 0.0 ± 0.0
1.189TyrGln: 1.189 ± 0.477
1.189TyrArg: 1.189 ± 0.673
4.754TyrSer: 4.754 ± 1.978
2.773TyrThr: 2.773 ± 0.925
1.585TyrVal: 1.585 ± 0.535
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2525 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski