Amino acid dipepetide frequency for Potato virus H

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.835AlaAla: 5.835 ± 1.506
1.094AlaCys: 1.094 ± 0.929
4.012AlaAsp: 4.012 ± 1.312
4.376AlaGlu: 4.376 ± 1.269
4.741AlaPhe: 4.741 ± 1.536
2.553AlaGly: 2.553 ± 1.333
1.823AlaHis: 1.823 ± 0.695
4.376AlaIle: 4.376 ± 2.529
6.929AlaLys: 6.929 ± 2.859
6.929AlaLeu: 6.929 ± 2.292
2.918AlaMet: 2.918 ± 0.675
5.106AlaAsn: 5.106 ± 2.194
2.553AlaPro: 2.553 ± 1.082
2.918AlaGln: 2.918 ± 1.166
4.741AlaArg: 4.741 ± 1.536
6.565AlaSer: 6.565 ± 2.496
4.741AlaThr: 4.741 ± 1.225
4.741AlaVal: 4.741 ± 1.245
0.729AlaTrp: 0.729 ± 0.404
1.823AlaTyr: 1.823 ± 1.869
0.0AlaXaa: 0.0 ± 0.0
Cys
2.188CysAla: 2.188 ± 1.858
1.094CysCys: 1.094 ± 0.606
1.094CysAsp: 1.094 ± 1.221
1.823CysGlu: 1.823 ± 2.082
1.459CysPhe: 1.459 ± 0.808
1.823CysGly: 1.823 ± 1.01
0.365CysHis: 0.365 ± 0.202
1.459CysIle: 1.459 ± 0.808
1.459CysLys: 1.459 ± 0.805
1.823CysLeu: 1.823 ± 0.717
0.365CysMet: 0.365 ± 0.858
0.365CysAsn: 0.365 ± 0.858
0.729CysPro: 0.729 ± 1.274
0.365CysGln: 0.365 ± 0.202
2.188CysArg: 2.188 ± 0.826
2.188CysSer: 2.188 ± 1.858
1.823CysThr: 1.823 ± 1.056
3.282CysVal: 3.282 ± 1.48
0.0CysTrp: 0.0 ± 0.0
1.823CysTyr: 1.823 ± 1.214
0.0CysXaa: 0.0 ± 0.0
Asp
4.376AspAla: 4.376 ± 1.269
0.365AspCys: 0.365 ± 0.202
2.553AspAsp: 2.553 ± 1.593
6.2AspGlu: 6.2 ± 2.701
0.729AspPhe: 0.729 ± 0.551
3.282AspGly: 3.282 ± 1.547
1.094AspHis: 1.094 ± 0.525
2.553AspIle: 2.553 ± 1.116
0.729AspLys: 0.729 ± 0.551
4.376AspLeu: 4.376 ± 1.807
2.188AspMet: 2.188 ± 1.943
2.918AspAsn: 2.918 ± 0.95
2.918AspPro: 2.918 ± 2.192
1.094AspGln: 1.094 ± 0.606
1.823AspArg: 1.823 ± 1.01
2.553AspSer: 2.553 ± 1.022
0.729AspThr: 0.729 ± 0.551
3.647AspVal: 3.647 ± 1.178
0.729AspTrp: 0.729 ± 0.551
1.823AspTyr: 1.823 ± 1.334
0.0AspXaa: 0.0 ± 0.0
Glu
5.835GluAla: 5.835 ± 2.576
0.729GluCys: 0.729 ± 0.747
3.282GluAsp: 3.282 ± 0.943
4.012GluGlu: 4.012 ± 2.222
2.188GluPhe: 2.188 ± 1.101
5.47GluGly: 5.47 ± 1.306
1.823GluHis: 1.823 ± 1.01
3.647GluIle: 3.647 ± 0.948
3.647GluLys: 3.647 ± 0.745
8.388GluLeu: 8.388 ± 2.401
0.0GluMet: 0.0 ± 0.0
2.188GluAsn: 2.188 ± 0.826
1.823GluPro: 1.823 ± 1.284
4.012GluGln: 4.012 ± 1.303
4.376GluArg: 4.376 ± 1.651
5.106GluSer: 5.106 ± 2.225
3.282GluThr: 3.282 ± 1.576
9.847GluVal: 9.847 ± 3.457
0.729GluTrp: 0.729 ± 0.404
4.012GluTyr: 4.012 ± 1.258
0.0GluXaa: 0.0 ± 0.0
Phe
3.647PheAla: 3.647 ± 1.054
1.094PheCys: 1.094 ± 0.929
4.376PheAsp: 4.376 ± 1.08
5.47PheGlu: 5.47 ± 1.796
0.365PhePhe: 0.365 ± 0.643
2.553PheGly: 2.553 ± 1.333
0.365PheHis: 0.365 ± 1.089
1.094PheIle: 1.094 ± 1.024
2.553PheLys: 2.553 ± 2.413
5.835PheLeu: 5.835 ± 1.091
1.094PheMet: 1.094 ± 0.525
1.823PheAsn: 1.823 ± 1.01
0.365PhePro: 0.365 ± 0.202
1.823PheGln: 1.823 ± 0.682
0.365PheArg: 0.365 ± 0.202
4.012PheSer: 4.012 ± 1.684
2.553PheThr: 2.553 ± 1.74
3.647PheVal: 3.647 ± 1.971
0.365PheTrp: 0.365 ± 0.202
2.188PheTyr: 2.188 ± 0.996
0.0PheXaa: 0.0 ± 0.0
Gly
4.012GlyAla: 4.012 ± 0.996
2.188GlyCys: 2.188 ± 3.472
4.376GlyAsp: 4.376 ± 1.372
5.106GlyGlu: 5.106 ± 2.335
2.553GlyPhe: 2.553 ± 1.848
2.918GlyGly: 2.918 ± 1.799
0.729GlyHis: 0.729 ± 0.404
1.823GlyIle: 1.823 ± 1.214
5.47GlyLys: 5.47 ± 1.525
6.565GlyLeu: 6.565 ± 3.52
1.459GlyMet: 1.459 ± 0.808
2.188GlyAsn: 2.188 ± 1.051
1.094GlyPro: 1.094 ± 1.371
2.188GlyGln: 2.188 ± 0.814
4.012GlyArg: 4.012 ± 1.578
1.823GlySer: 1.823 ± 1.01
2.553GlyThr: 2.553 ± 0.896
4.741GlyVal: 4.741 ± 1.507
0.729GlyTrp: 0.729 ± 0.404
0.729GlyTyr: 0.729 ± 0.404
0.0GlyXaa: 0.0 ± 0.0
His
2.188HisAla: 2.188 ± 0.996
1.094HisCys: 1.094 ± 0.679
1.094HisAsp: 1.094 ± 1.181
1.459HisGlu: 1.459 ± 0.808
0.729HisPhe: 0.729 ± 0.404
0.365HisGly: 0.365 ± 1.089
1.094HisHis: 1.094 ± 0.679
0.365HisIle: 0.365 ± 0.202
1.094HisLys: 1.094 ± 0.525
2.553HisLeu: 2.553 ± 0.957
0.365HisMet: 0.365 ± 0.643
1.094HisAsn: 1.094 ± 0.525
0.729HisPro: 0.729 ± 0.404
0.0HisGln: 0.0 ± 0.0
1.094HisArg: 1.094 ± 0.679
3.282HisSer: 3.282 ± 0.986
1.094HisThr: 1.094 ± 0.606
0.729HisVal: 0.729 ± 0.747
0.729HisTrp: 0.729 ± 1.703
1.094HisTyr: 1.094 ± 1.505
0.0HisXaa: 0.0 ± 0.0
Ile
4.012IleAla: 4.012 ± 3.492
1.094IleCys: 1.094 ± 0.606
1.094IleAsp: 1.094 ± 0.525
4.012IleGlu: 4.012 ± 1.233
1.094IlePhe: 1.094 ± 0.606
2.553IleGly: 2.553 ± 1.523
1.823IleHis: 1.823 ± 1.089
2.918IleIle: 2.918 ± 2.806
3.282IleLys: 3.282 ± 1.264
1.823IleLeu: 1.823 ± 1.414
3.647IleMet: 3.647 ± 1.364
0.729IleAsn: 0.729 ± 0.404
0.729IlePro: 0.729 ± 0.404
2.188IleGln: 2.188 ± 1.16
2.188IleArg: 2.188 ± 2.453
2.918IleSer: 2.918 ± 2.736
2.188IleThr: 2.188 ± 2.174
4.741IleVal: 4.741 ± 2.408
0.365IleTrp: 0.365 ± 0.202
1.823IleTyr: 1.823 ± 1.289
0.0IleXaa: 0.0 ± 0.0
Lys
3.282LysAla: 3.282 ± 1.214
2.918LysCys: 2.918 ± 2.401
1.459LysAsp: 1.459 ± 0.574
3.282LysGlu: 3.282 ± 1.818
2.918LysPhe: 2.918 ± 1.337
4.741LysGly: 4.741 ± 1.536
1.094LysHis: 1.094 ± 0.679
1.459LysIle: 1.459 ± 1.203
5.835LysLys: 5.835 ± 2.142
9.847LysLeu: 9.847 ± 1.768
0.365LysMet: 0.365 ± 0.202
2.553LysAsn: 2.553 ± 2.14
2.553LysPro: 2.553 ± 1.104
0.365LysGln: 0.365 ± 1.089
5.47LysArg: 5.47 ± 2.335
3.647LysSer: 3.647 ± 1.364
2.553LysThr: 2.553 ± 0.63
7.294LysVal: 7.294 ± 3.512
0.0LysTrp: 0.0 ± 0.0
2.188LysTyr: 2.188 ± 0.814
0.0LysXaa: 0.0 ± 0.0
Leu
5.835LeuAla: 5.835 ± 2.284
3.282LeuCys: 3.282 ± 1.818
5.835LeuAsp: 5.835 ± 1.66
6.565LeuGlu: 6.565 ± 1.055
5.47LeuPhe: 5.47 ± 1.896
6.929LeuGly: 6.929 ± 1.932
1.823LeuHis: 1.823 ± 1.01
5.106LeuIle: 5.106 ± 2.765
6.565LeuLys: 6.565 ± 2.607
8.388LeuLeu: 8.388 ± 2.691
1.823LeuMet: 1.823 ± 1.01
3.647LeuAsn: 3.647 ± 1.059
6.2LeuPro: 6.2 ± 1.634
2.188LeuGln: 2.188 ± 0.826
6.565LeuArg: 6.565 ± 1.163
8.753LeuSer: 8.753 ± 5.598
3.282LeuThr: 3.282 ± 1.264
6.565LeuVal: 6.565 ± 1.739
0.365LeuTrp: 0.365 ± 0.202
2.918LeuTyr: 2.918 ± 1.412
0.0LeuXaa: 0.0 ± 0.0
Met
4.012MetAla: 4.012 ± 1.501
1.094MetCys: 1.094 ± 0.606
1.459MetAsp: 1.459 ± 0.805
1.823MetGlu: 1.823 ± 0.682
0.0MetPhe: 0.0 ± 0.0
1.823MetGly: 1.823 ± 0.682
0.0MetHis: 0.0 ± 0.0
1.094MetIle: 1.094 ± 0.525
1.459MetLys: 1.459 ± 1.103
2.553MetLeu: 2.553 ± 1.462
0.365MetMet: 0.365 ± 0.202
0.729MetAsn: 0.729 ± 0.404
1.459MetPro: 1.459 ± 0.908
0.729MetGln: 0.729 ± 0.404
2.918MetArg: 2.918 ± 1.136
1.823MetSer: 1.823 ± 1.01
0.365MetThr: 0.365 ± 0.858
0.365MetVal: 0.365 ± 0.202
0.0MetTrp: 0.0 ± 0.0
0.365MetTyr: 0.365 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
1.823AsnAla: 1.823 ± 1.058
1.823AsnCys: 1.823 ± 1.01
0.365AsnAsp: 0.365 ± 0.202
3.282AsnGlu: 3.282 ± 1.244
2.918AsnPhe: 2.918 ± 1.219
0.365AsnGly: 0.365 ± 0.202
1.094AsnHis: 1.094 ± 1.299
1.459AsnIle: 1.459 ± 2.375
1.459AsnLys: 1.459 ± 0.805
3.647AsnLeu: 3.647 ± 1.939
0.365AsnMet: 0.365 ± 0.202
1.823AsnAsn: 1.823 ± 1.728
0.729AsnPro: 0.729 ± 0.551
0.729AsnGln: 0.729 ± 0.404
3.647AsnArg: 3.647 ± 1.986
1.459AsnSer: 1.459 ± 1.061
2.553AsnThr: 2.553 ± 1.054
2.553AsnVal: 2.553 ± 1.414
0.729AsnTrp: 0.729 ± 0.404
2.188AsnTyr: 2.188 ± 1.212
0.0AsnXaa: 0.0 ± 0.0
Pro
3.282ProAla: 3.282 ± 1.214
1.094ProCys: 1.094 ± 0.606
2.553ProAsp: 2.553 ± 1.62
1.823ProGlu: 1.823 ± 1.592
0.365ProPhe: 0.365 ± 0.202
2.553ProGly: 2.553 ± 1.593
1.094ProHis: 1.094 ± 1.335
1.459ProIle: 1.459 ± 1.719
2.918ProLys: 2.918 ± 0.927
2.188ProLeu: 2.188 ± 0.911
0.365ProMet: 0.365 ± 0.202
0.365ProAsn: 0.365 ± 0.643
2.918ProPro: 2.918 ± 2.464
1.459ProGln: 1.459 ± 0.574
4.012ProArg: 4.012 ± 2.605
2.918ProSer: 2.918 ± 1.201
4.012ProThr: 4.012 ± 2.702
1.459ProVal: 1.459 ± 0.668
1.094ProTrp: 1.094 ± 0.929
1.094ProTyr: 1.094 ± 0.606
0.0ProXaa: 0.0 ± 0.0
Gln
2.553GlnAla: 2.553 ± 1.604
0.729GlnCys: 0.729 ± 0.747
0.729GlnAsp: 0.729 ± 0.404
2.188GlnGlu: 2.188 ± 1.16
1.459GlnPhe: 1.459 ± 0.574
1.094GlnGly: 1.094 ± 1.024
1.094GlnHis: 1.094 ± 0.606
1.823GlnIle: 1.823 ± 0.931
0.729GlnLys: 0.729 ± 1.498
1.459GlnLeu: 1.459 ± 0.808
1.094GlnMet: 1.094 ± 0.509
0.365GlnAsn: 0.365 ± 1.354
1.823GlnPro: 1.823 ± 1.058
0.0GlnGln: 0.0 ± 0.0
1.823GlnArg: 1.823 ± 1.01
2.918GlnSer: 2.918 ± 0.95
1.094GlnThr: 1.094 ± 0.679
2.553GlnVal: 2.553 ± 1.333
0.365GlnTrp: 0.365 ± 0.202
0.729GlnTyr: 0.729 ± 0.404
0.0GlnXaa: 0.0 ± 0.0
Arg
6.565ArgAla: 6.565 ± 1.291
1.823ArgCys: 1.823 ± 3.809
3.282ArgAsp: 3.282 ± 1.161
2.918ArgGlu: 2.918 ± 1.612
5.106ArgPhe: 5.106 ± 1.405
2.188ArgGly: 2.188 ± 0.826
1.459ArgHis: 1.459 ± 1.384
1.823ArgIle: 1.823 ± 1.289
2.918ArgLys: 2.918 ± 2.44
5.835ArgLeu: 5.835 ± 1.69
1.459ArgMet: 1.459 ± 0.808
1.094ArgAsn: 1.094 ± 0.525
0.729ArgPro: 0.729 ± 1.286
1.094ArgGln: 1.094 ± 1.299
4.012ArgArg: 4.012 ± 3.541
7.294ArgSer: 7.294 ± 2.277
2.918ArgThr: 2.918 ± 1.616
3.647ArgVal: 3.647 ± 1.149
1.459ArgTrp: 1.459 ± 0.668
4.012ArgTyr: 4.012 ± 1.384
0.0ArgXaa: 0.0 ± 0.0
Ser
8.023SerAla: 8.023 ± 3.048
1.094SerCys: 1.094 ± 0.606
1.823SerAsp: 1.823 ± 0.682
5.106SerGlu: 5.106 ± 1.633
2.918SerPhe: 2.918 ± 1.137
4.376SerGly: 4.376 ± 1.807
0.729SerHis: 0.729 ± 0.992
2.188SerIle: 2.188 ± 1.979
8.753SerLys: 8.753 ± 2.522
5.47SerLeu: 5.47 ± 1.551
2.188SerMet: 2.188 ± 1.051
4.012SerAsn: 4.012 ± 2.147
3.282SerPro: 3.282 ± 2.921
2.553SerGln: 2.553 ± 1.551
4.376SerArg: 4.376 ± 1.389
7.294SerSer: 7.294 ± 2.63
4.376SerThr: 4.376 ± 2.172
6.2SerVal: 6.2 ± 2.993
0.0SerTrp: 0.0 ± 0.0
3.647SerTyr: 3.647 ± 1.691
0.0SerXaa: 0.0 ± 0.0
Thr
5.47ThrAla: 5.47 ± 1.822
0.365ThrCys: 0.365 ± 0.858
2.188ThrAsp: 2.188 ± 1.212
4.012ThrGlu: 4.012 ± 1.287
5.106ThrPhe: 5.106 ± 2.017
4.741ThrGly: 4.741 ± 1.797
1.094ThrHis: 1.094 ± 0.525
1.823ThrIle: 1.823 ± 1.01
2.553ThrLys: 2.553 ± 2.477
5.47ThrLeu: 5.47 ± 0.925
1.094ThrMet: 1.094 ± 0.606
0.729ThrAsn: 0.729 ± 0.551
2.188ThrPro: 2.188 ± 1.871
0.365ThrGln: 0.365 ± 0.643
2.188ThrArg: 2.188 ± 1.737
4.376ThrSer: 4.376 ± 1.389
0.365ThrThr: 0.365 ± 0.202
2.553ThrVal: 2.553 ± 1.104
0.0ThrTrp: 0.0 ± 0.0
1.459ThrTyr: 1.459 ± 1.265
0.0ThrXaa: 0.0 ± 0.0
Val
4.376ValAla: 4.376 ± 4.216
3.282ValCys: 3.282 ± 1.35
3.282ValAsp: 3.282 ± 2.275
8.023ValGlu: 8.023 ± 1.415
2.918ValPhe: 2.918 ± 1.043
5.47ValGly: 5.47 ± 3.237
3.282ValHis: 3.282 ± 0.798
4.741ValIle: 4.741 ± 2.782
3.282ValLys: 3.282 ± 1.492
9.117ValLeu: 9.117 ± 2.58
2.188ValMet: 2.188 ± 1.178
1.459ValAsn: 1.459 ± 0.808
3.647ValPro: 3.647 ± 1.149
1.094ValGln: 1.094 ± 0.606
3.282ValArg: 3.282 ± 1.264
5.835ValSer: 5.835 ± 1.636
4.012ValThr: 4.012 ± 1.068
9.117ValVal: 9.117 ± 3.625
0.365ValTrp: 0.365 ± 0.643
1.823ValTyr: 1.823 ± 0.717
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.365TrpCys: 0.365 ± 0.202
0.0TrpAsp: 0.0 ± 0.0
0.365TrpGlu: 0.365 ± 0.202
0.729TrpPhe: 0.729 ± 0.404
0.0TrpGly: 0.0 ± 0.0
0.365TrpHis: 0.365 ± 0.202
0.729TrpIle: 0.729 ± 0.747
0.0TrpLys: 0.0 ± 0.0
2.188TrpLeu: 2.188 ± 1.212
0.365TrpMet: 0.365 ± 0.974
0.729TrpAsn: 0.729 ± 0.551
0.365TrpPro: 0.365 ± 0.202
0.365TrpGln: 0.365 ± 0.643
0.365TrpArg: 0.365 ± 0.202
1.094TrpSer: 1.094 ± 0.929
0.365TrpThr: 0.365 ± 0.202
1.094TrpVal: 1.094 ± 0.929
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.553TyrAla: 2.553 ± 1.929
1.094TyrCys: 1.094 ± 1.182
1.823TyrAsp: 1.823 ± 1.183
2.553TyrGlu: 2.553 ± 1.414
1.823TyrPhe: 1.823 ± 1.217
1.459TyrGly: 1.459 ± 2.547
0.0TyrHis: 0.0 ± 0.0
3.282TyrIle: 3.282 ± 1.818
2.188TyrLys: 2.188 ± 0.826
3.647TyrLeu: 3.647 ± 2.02
0.365TyrMet: 0.365 ± 0.202
1.094TyrAsn: 1.094 ± 1.221
2.188TyrPro: 2.188 ± 0.826
1.094TyrGln: 1.094 ± 1.596
2.188TyrArg: 2.188 ± 1.592
2.553TyrSer: 2.553 ± 0.63
3.282TyrThr: 3.282 ± 3.204
1.823TyrVal: 1.823 ± 0.682
0.729TyrTrp: 0.729 ± 0.404
0.365TyrTyr: 0.365 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski