Amino acid dipepetide frequency for Hart Park virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.658AlaAla: 1.658 ± 0.667
0.0AlaCys: 0.0 ± 0.0
3.078AlaAsp: 3.078 ± 1.163
1.421AlaGlu: 1.421 ± 0.605
1.894AlaPhe: 1.894 ± 1.005
1.894AlaGly: 1.894 ± 0.96
0.71AlaHis: 0.71 ± 0.393
2.605AlaIle: 2.605 ± 1.107
2.605AlaLys: 2.605 ± 0.599
4.262AlaLeu: 4.262 ± 1.153
0.474AlaMet: 0.474 ± 0.482
2.842AlaAsn: 2.842 ± 0.544
1.184AlaPro: 1.184 ± 0.99
2.131AlaGln: 2.131 ± 0.869
2.368AlaArg: 2.368 ± 0.902
2.842AlaSer: 2.842 ± 1.471
0.947AlaThr: 0.947 ± 0.589
2.368AlaVal: 2.368 ± 0.524
1.184AlaTrp: 1.184 ± 0.362
2.131AlaTyr: 2.131 ± 1.041
0.0AlaXaa: 0.0 ± 0.0
Cys
1.421CysAla: 1.421 ± 0.433
0.0CysCys: 0.0 ± 0.0
0.947CysAsp: 0.947 ± 0.376
0.474CysGlu: 0.474 ± 0.507
0.947CysPhe: 0.947 ± 0.449
1.184CysGly: 1.184 ± 0.955
0.947CysHis: 0.947 ± 0.501
0.947CysIle: 0.947 ± 0.346
2.605CysLys: 2.605 ± 1.245
0.947CysLeu: 0.947 ± 0.49
0.0CysMet: 0.0 ± 0.0
1.184CysAsn: 1.184 ± 0.677
0.71CysPro: 0.71 ± 0.323
0.71CysGln: 0.71 ± 0.42
0.71CysArg: 0.71 ± 0.277
1.421CysSer: 1.421 ± 0.474
0.71CysThr: 0.71 ± 0.398
1.184CysVal: 1.184 ± 0.464
0.71CysTrp: 0.71 ± 0.337
0.71CysTyr: 0.71 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
1.658AspAla: 1.658 ± 1.367
0.947AspCys: 0.947 ± 0.657
3.552AspAsp: 3.552 ± 1.544
2.842AspGlu: 2.842 ± 0.742
1.184AspPhe: 1.184 ± 0.507
3.315AspGly: 3.315 ± 1.332
0.71AspHis: 0.71 ± 0.412
3.315AspIle: 3.315 ± 1.394
4.262AspLys: 4.262 ± 0.881
6.867AspLeu: 6.867 ± 1.202
1.184AspMet: 1.184 ± 0.899
3.552AspAsn: 3.552 ± 0.726
4.026AspPro: 4.026 ± 0.636
3.552AspGln: 3.552 ± 0.599
1.894AspArg: 1.894 ± 0.614
2.605AspSer: 2.605 ± 0.818
1.894AspThr: 1.894 ± 0.962
3.789AspVal: 3.789 ± 1.155
1.894AspTrp: 1.894 ± 0.798
3.078AspTyr: 3.078 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
2.131GluAla: 2.131 ± 1.059
0.237GluCys: 0.237 ± 0.14
4.736GluAsp: 4.736 ± 0.619
5.21GluGlu: 5.21 ± 1.027
3.552GluPhe: 3.552 ± 1.488
2.131GluGly: 2.131 ± 0.622
0.71GluHis: 0.71 ± 0.34
3.315GluIle: 3.315 ± 1.528
4.499GluLys: 4.499 ± 1.384
5.683GluLeu: 5.683 ± 0.746
2.131GluMet: 2.131 ± 0.633
3.315GluAsn: 3.315 ± 1.11
1.658GluPro: 1.658 ± 0.411
1.184GluGln: 1.184 ± 0.395
1.658GluArg: 1.658 ± 0.536
4.262GluSer: 4.262 ± 0.932
2.842GluThr: 2.842 ± 0.763
3.789GluVal: 3.789 ± 0.366
1.184GluTrp: 1.184 ± 0.325
2.605GluTyr: 2.605 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
1.421PheAla: 1.421 ± 0.498
0.947PheCys: 0.947 ± 0.496
3.078PheAsp: 3.078 ± 1.507
2.368PheGlu: 2.368 ± 0.642
2.605PhePhe: 2.605 ± 0.921
3.315PheGly: 3.315 ± 0.799
1.421PheHis: 1.421 ± 0.735
2.605PheIle: 2.605 ± 0.941
3.078PheLys: 3.078 ± 0.912
4.973PheLeu: 4.973 ± 0.985
0.237PheMet: 0.237 ± 0.14
1.894PheAsn: 1.894 ± 0.559
2.368PhePro: 2.368 ± 0.409
2.842PheGln: 2.842 ± 0.56
2.605PheArg: 2.605 ± 0.486
2.842PheSer: 2.842 ± 0.38
1.421PheThr: 1.421 ± 0.519
3.078PheVal: 3.078 ± 0.819
0.71PheTrp: 0.71 ± 0.42
1.184PheTyr: 1.184 ± 0.499
0.0PheXaa: 0.0 ± 0.0
Gly
0.947GlyAla: 0.947 ± 0.418
0.71GlyCys: 0.71 ± 0.277
3.315GlyAsp: 3.315 ± 0.76
1.421GlyGlu: 1.421 ± 0.667
2.605GlyPhe: 2.605 ± 0.772
4.262GlyGly: 4.262 ± 0.738
0.947GlyHis: 0.947 ± 0.381
6.394GlyIle: 6.394 ± 1.567
5.683GlyLys: 5.683 ± 1.946
6.867GlyLeu: 6.867 ± 1.59
0.71GlyMet: 0.71 ± 0.277
2.368GlyAsn: 2.368 ± 0.577
0.947GlyPro: 0.947 ± 0.376
1.894GlyGln: 1.894 ± 0.685
2.605GlyArg: 2.605 ± 1.156
4.973GlySer: 4.973 ± 0.911
3.789GlyThr: 3.789 ± 1.661
2.605GlyVal: 2.605 ± 1.057
0.71GlyTrp: 0.71 ± 0.323
2.368GlyTyr: 2.368 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
0.474HisAla: 0.474 ± 0.336
0.0HisCys: 0.0 ± 0.0
1.421HisAsp: 1.421 ± 0.878
0.71HisGlu: 0.71 ± 0.364
0.71HisPhe: 0.71 ± 0.322
1.658HisGly: 1.658 ± 0.846
0.947HisHis: 0.947 ± 0.376
2.605HisIle: 2.605 ± 0.665
0.474HisLys: 0.474 ± 0.225
3.552HisLeu: 3.552 ± 1.02
0.71HisMet: 0.71 ± 0.516
0.947HisAsn: 0.947 ± 0.376
3.552HisPro: 3.552 ± 0.964
1.184HisGln: 1.184 ± 0.485
0.947HisArg: 0.947 ± 0.56
1.894HisSer: 1.894 ± 0.561
0.237HisThr: 0.237 ± 0.14
1.658HisVal: 1.658 ± 0.501
0.237HisTrp: 0.237 ± 0.14
0.474HisTyr: 0.474 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
3.789IleAla: 3.789 ± 0.887
2.842IleCys: 2.842 ± 0.839
3.552IleAsp: 3.552 ± 0.573
5.21IleGlu: 5.21 ± 1.079
3.078IlePhe: 3.078 ± 0.842
6.867IleGly: 6.867 ± 1.068
1.894IleHis: 1.894 ± 0.935
3.552IleIle: 3.552 ± 1.1
3.078IleLys: 3.078 ± 0.86
5.683IleLeu: 5.683 ± 1.317
1.184IleMet: 1.184 ± 0.431
4.499IleAsn: 4.499 ± 1.572
3.078IlePro: 3.078 ± 1.195
3.552IleGln: 3.552 ± 0.658
5.683IleArg: 5.683 ± 1.017
7.104IleSer: 7.104 ± 1.67
3.789IleThr: 3.789 ± 1.031
4.026IleVal: 4.026 ± 1.279
0.71IleTrp: 0.71 ± 0.398
5.683IleTyr: 5.683 ± 1.327
0.0IleXaa: 0.0 ± 0.0
Lys
2.368LysAla: 2.368 ± 1.148
1.184LysCys: 1.184 ± 1.086
4.026LysAsp: 4.026 ± 1.899
2.842LysGlu: 2.842 ± 0.651
1.184LysPhe: 1.184 ± 0.717
4.262LysGly: 4.262 ± 0.832
1.894LysHis: 1.894 ± 0.648
7.104LysIle: 7.104 ± 0.922
6.157LysLys: 6.157 ± 1.57
4.736LysLeu: 4.736 ± 1.22
2.131LysMet: 2.131 ± 1.151
3.789LysAsn: 3.789 ± 0.912
1.184LysPro: 1.184 ± 0.473
1.894LysGln: 1.894 ± 0.753
5.92LysArg: 5.92 ± 1.553
5.92LysSer: 5.92 ± 1.255
4.026LysThr: 4.026 ± 0.697
4.736LysVal: 4.736 ± 1.454
1.184LysTrp: 1.184 ± 0.581
1.658LysTyr: 1.658 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
2.842LeuAla: 2.842 ± 0.724
1.658LeuCys: 1.658 ± 0.745
6.157LeuAsp: 6.157 ± 0.778
8.525LeuGlu: 8.525 ± 1.822
4.736LeuPhe: 4.736 ± 1.151
4.262LeuGly: 4.262 ± 1.191
2.131LeuHis: 2.131 ± 0.64
9.946LeuIle: 9.946 ± 2.221
6.867LeuLys: 6.867 ± 1.341
8.525LeuLeu: 8.525 ± 1.523
3.078LeuMet: 3.078 ± 0.67
7.341LeuAsn: 7.341 ± 1.101
3.552LeuPro: 3.552 ± 0.845
3.789LeuGln: 3.789 ± 0.721
4.736LeuArg: 4.736 ± 1.232
7.578LeuSer: 7.578 ± 0.944
4.973LeuThr: 4.973 ± 1.074
5.21LeuVal: 5.21 ± 1.014
1.184LeuTrp: 1.184 ± 0.569
3.315LeuTyr: 3.315 ± 0.967
0.0LeuXaa: 0.0 ± 0.0
Met
0.947MetAla: 0.947 ± 0.411
0.474MetCys: 0.474 ± 0.225
1.658MetAsp: 1.658 ± 0.491
1.894MetGlu: 1.894 ± 1.036
2.842MetPhe: 2.842 ± 0.902
1.421MetGly: 1.421 ± 0.552
0.0MetHis: 0.0 ± 0.0
1.658MetIle: 1.658 ± 0.464
1.894MetLys: 1.894 ± 0.97
3.078MetLeu: 3.078 ± 0.845
0.71MetMet: 0.71 ± 0.322
2.131MetAsn: 2.131 ± 0.759
0.237MetPro: 0.237 ± 0.39
0.237MetGln: 0.237 ± 0.14
0.947MetArg: 0.947 ± 0.749
1.184MetSer: 1.184 ± 0.499
1.184MetThr: 1.184 ± 1.212
1.658MetVal: 1.658 ± 0.606
0.237MetTrp: 0.237 ± 0.36
0.237MetTyr: 0.237 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
2.131AsnAla: 2.131 ± 0.691
0.947AsnCys: 0.947 ± 0.46
2.368AsnAsp: 2.368 ± 0.857
1.184AsnGlu: 1.184 ± 0.507
2.131AsnPhe: 2.131 ± 0.542
2.842AsnGly: 2.842 ± 1.373
2.131AsnHis: 2.131 ± 0.61
4.499AsnIle: 4.499 ± 0.838
4.499AsnLys: 4.499 ± 1.162
6.394AsnLeu: 6.394 ± 1.95
2.131AsnMet: 2.131 ± 0.402
2.605AsnAsn: 2.605 ± 0.836
3.315AsnPro: 3.315 ± 0.453
3.552AsnGln: 3.552 ± 1.225
1.658AsnArg: 1.658 ± 0.47
5.683AsnSer: 5.683 ± 1.668
1.894AsnThr: 1.894 ± 0.699
3.315AsnVal: 3.315 ± 1.417
1.894AsnTrp: 1.894 ± 0.532
2.842AsnTyr: 2.842 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
2.131ProAla: 2.131 ± 0.685
0.474ProCys: 0.474 ± 0.225
3.315ProAsp: 3.315 ± 0.805
1.658ProGlu: 1.658 ± 0.5
1.421ProPhe: 1.421 ± 0.644
2.605ProGly: 2.605 ± 1.605
1.421ProHis: 1.421 ± 0.676
4.973ProIle: 4.973 ± 0.784
3.789ProLys: 3.789 ± 1.516
3.789ProLeu: 3.789 ± 0.724
0.71ProMet: 0.71 ± 0.763
2.605ProAsn: 2.605 ± 0.706
2.842ProPro: 2.842 ± 1.822
1.421ProGln: 1.421 ± 0.438
2.368ProArg: 2.368 ± 0.513
3.552ProSer: 3.552 ± 0.758
1.421ProThr: 1.421 ± 0.701
1.421ProVal: 1.421 ± 0.589
0.947ProTrp: 0.947 ± 0.516
1.421ProTyr: 1.421 ± 0.712
0.0ProXaa: 0.0 ± 0.0
Gln
0.947GlnAla: 0.947 ± 0.34
1.184GlnCys: 1.184 ± 0.489
1.421GlnAsp: 1.421 ± 0.768
3.078GlnGlu: 3.078 ± 1.079
1.184GlnPhe: 1.184 ± 0.359
1.421GlnGly: 1.421 ± 0.674
0.947GlnHis: 0.947 ± 0.659
2.368GlnIle: 2.368 ± 0.731
2.605GlnLys: 2.605 ± 0.617
3.078GlnLeu: 3.078 ± 0.898
0.71GlnMet: 0.71 ± 0.595
2.368GlnAsn: 2.368 ± 0.466
1.184GlnPro: 1.184 ± 0.687
0.474GlnGln: 0.474 ± 0.294
1.658GlnArg: 1.658 ± 0.494
3.315GlnSer: 3.315 ± 0.791
3.315GlnThr: 3.315 ± 0.867
2.131GlnVal: 2.131 ± 0.708
1.658GlnTrp: 1.658 ± 0.646
2.131GlnTyr: 2.131 ± 0.882
0.0GlnXaa: 0.0 ± 0.0
Arg
1.658ArgAla: 1.658 ± 0.398
0.237ArgCys: 0.237 ± 0.253
1.894ArgAsp: 1.894 ± 1.081
4.499ArgGlu: 4.499 ± 1.149
3.789ArgPhe: 3.789 ± 1.462
3.315ArgGly: 3.315 ± 0.782
0.71ArgHis: 0.71 ± 0.322
2.842ArgIle: 2.842 ± 0.903
2.131ArgLys: 2.131 ± 1.174
3.789ArgLeu: 3.789 ± 0.79
1.421ArgMet: 1.421 ± 0.429
3.552ArgAsn: 3.552 ± 0.828
2.131ArgPro: 2.131 ± 0.386
1.421ArgGln: 1.421 ± 0.671
2.368ArgArg: 2.368 ± 1.381
4.262ArgSer: 4.262 ± 0.599
2.605ArgThr: 2.605 ± 0.704
1.658ArgVal: 1.658 ± 0.463
0.237ArgTrp: 0.237 ± 0.14
1.894ArgTyr: 1.894 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
3.315SerAla: 3.315 ± 1.197
1.184SerCys: 1.184 ± 0.411
3.078SerAsp: 3.078 ± 0.76
4.262SerGlu: 4.262 ± 0.826
2.842SerPhe: 2.842 ± 1.369
3.315SerGly: 3.315 ± 1.101
3.078SerHis: 3.078 ± 0.753
8.051SerIle: 8.051 ± 1.194
4.736SerLys: 4.736 ± 1.189
9.472SerLeu: 9.472 ± 1.812
1.658SerMet: 1.658 ± 0.624
3.789SerAsn: 3.789 ± 1.283
3.078SerPro: 3.078 ± 0.988
2.605SerGln: 2.605 ± 0.877
2.842SerArg: 2.842 ± 0.937
5.446SerSer: 5.446 ± 1.508
5.446SerThr: 5.446 ± 1.634
4.736SerVal: 4.736 ± 1.07
2.842SerTrp: 2.842 ± 0.558
4.262SerTyr: 4.262 ± 1.005
0.0SerXaa: 0.0 ± 0.0
Thr
1.658ThrAla: 1.658 ± 0.88
1.184ThrCys: 1.184 ± 0.559
1.184ThrAsp: 1.184 ± 0.505
2.842ThrGlu: 2.842 ± 0.546
3.552ThrPhe: 3.552 ± 1.746
3.078ThrGly: 3.078 ± 0.898
1.421ThrHis: 1.421 ± 0.443
4.026ThrIle: 4.026 ± 0.576
1.658ThrLys: 1.658 ± 1.116
4.499ThrLeu: 4.499 ± 0.662
1.658ThrMet: 1.658 ± 0.541
2.842ThrAsn: 2.842 ± 0.884
2.605ThrPro: 2.605 ± 1.205
1.658ThrGln: 1.658 ± 0.765
1.421ThrArg: 1.421 ± 0.78
4.499ThrSer: 4.499 ± 1.305
2.605ThrThr: 2.605 ± 0.633
2.605ThrVal: 2.605 ± 0.96
2.131ThrTrp: 2.131 ± 0.772
1.421ThrTyr: 1.421 ± 0.554
0.0ThrXaa: 0.0 ± 0.0
Val
3.315ValAla: 3.315 ± 1.089
2.131ValCys: 2.131 ± 0.394
3.789ValAsp: 3.789 ± 0.953
2.605ValGlu: 2.605 ± 0.685
1.421ValPhe: 1.421 ± 0.72
1.658ValGly: 1.658 ± 0.684
0.71ValHis: 0.71 ± 0.42
4.026ValIle: 4.026 ± 1.1
3.789ValLys: 3.789 ± 1.355
5.683ValLeu: 5.683 ± 0.837
0.947ValMet: 0.947 ± 0.399
3.315ValAsn: 3.315 ± 1.07
4.262ValPro: 4.262 ± 0.601
1.184ValGln: 1.184 ± 0.564
2.368ValArg: 2.368 ± 0.974
5.683ValSer: 5.683 ± 1.028
2.131ValThr: 2.131 ± 0.63
1.894ValVal: 1.894 ± 0.615
0.947ValTrp: 0.947 ± 0.435
2.842ValTyr: 2.842 ± 0.847
0.0ValXaa: 0.0 ± 0.0
Trp
1.658TrpAla: 1.658 ± 0.846
0.474TrpCys: 0.474 ± 0.49
1.421TrpAsp: 1.421 ± 0.498
1.658TrpGlu: 1.658 ± 0.834
1.421TrpPhe: 1.421 ± 0.554
2.131TrpGly: 2.131 ± 0.777
0.474TrpHis: 0.474 ± 0.284
1.894TrpIle: 1.894 ± 0.946
0.71TrpLys: 0.71 ± 0.42
1.894TrpLeu: 1.894 ± 0.658
1.184TrpMet: 1.184 ± 0.833
0.71TrpAsn: 0.71 ± 0.42
0.237TrpPro: 0.237 ± 0.14
0.474TrpGln: 0.474 ± 0.4
0.474TrpArg: 0.474 ± 0.28
0.947TrpSer: 0.947 ± 0.56
0.947TrpThr: 0.947 ± 0.376
0.947TrpVal: 0.947 ± 0.451
0.474TrpTrp: 0.474 ± 0.507
0.947TrpTyr: 0.947 ± 0.663
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.131TyrAla: 2.131 ± 0.747
1.658TyrCys: 1.658 ± 0.683
2.131TyrAsp: 2.131 ± 0.468
2.131TyrGlu: 2.131 ± 0.571
1.894TyrPhe: 1.894 ± 0.902
1.184TyrGly: 1.184 ± 0.485
1.184TyrHis: 1.184 ± 0.629
2.131TyrIle: 2.131 ± 0.663
3.078TyrLys: 3.078 ± 0.735
6.63TyrLeu: 6.63 ± 1.371
1.421TyrMet: 1.421 ± 0.971
2.368TyrAsn: 2.368 ± 0.872
2.368TyrPro: 2.368 ± 0.996
1.421TyrGln: 1.421 ± 0.958
1.184TyrArg: 1.184 ± 0.714
4.026TyrSer: 4.026 ± 0.704
2.368TyrThr: 2.368 ± 1.164
1.894TyrVal: 1.894 ± 1.001
0.0TyrTrp: 0.0 ± 0.0
0.947TyrTyr: 0.947 ± 0.437
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4224 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski