Amino acid dipepetide frequency for Aeropyrum pernix bacilliform virus 1 (isolate -/Japan/Tanaka/2005) (APBV1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.51AlaAla: 4.51 ± 2.195
0.644AlaCys: 0.644 ± 0.733
4.51AlaAsp: 4.51 ± 1.799
7.088AlaGlu: 7.088 ± 1.856
1.933AlaPhe: 1.933 ± 0.788
7.088AlaGly: 7.088 ± 1.514
0.0AlaHis: 0.0 ± 0.0
5.155AlaIle: 5.155 ± 1.678
0.644AlaLys: 0.644 ± 0.733
10.954AlaLeu: 10.954 ± 3.305
0.644AlaMet: 0.644 ± 0.686
2.577AlaAsn: 2.577 ± 1.579
5.799AlaPro: 5.799 ± 2.124
2.577AlaGln: 2.577 ± 1.286
5.155AlaArg: 5.155 ± 2.307
4.51AlaSer: 4.51 ± 1.551
1.933AlaThr: 1.933 ± 0.982
9.665AlaVal: 9.665 ± 2.465
0.644AlaTrp: 0.644 ± 0.478
3.222AlaTyr: 3.222 ± 1.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.644CysPhe: 0.644 ± 0.714
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.289CysIle: 1.289 ± 0.942
0.644CysLys: 0.644 ± 0.579
1.933CysLeu: 1.933 ± 1.048
0.644CysMet: 0.644 ± 0.723
0.644CysAsn: 0.644 ± 0.579
0.644CysPro: 0.644 ± 0.579
0.644CysGln: 0.644 ± 0.579
0.644CysArg: 0.644 ± 0.588
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.289CysVal: 1.289 ± 1.002
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.933AspAla: 1.933 ± 1.132
1.289AspCys: 1.289 ± 0.942
1.289AspAsp: 1.289 ± 0.735
0.644AspGlu: 0.644 ± 0.644
1.933AspPhe: 1.933 ± 1.345
1.933AspGly: 1.933 ± 1.244
0.0AspHis: 0.0 ± 0.0
4.51AspIle: 4.51 ± 1.799
1.933AspLys: 1.933 ± 0.788
6.443AspLeu: 6.443 ± 2.073
2.577AspMet: 2.577 ± 1.206
1.289AspAsn: 1.289 ± 1.089
5.799AspPro: 5.799 ± 2.049
0.644AspGln: 0.644 ± 0.544
1.933AspArg: 1.933 ± 1.106
2.577AspSer: 2.577 ± 1.725
4.51AspThr: 4.51 ± 1.282
2.577AspVal: 2.577 ± 1.16
1.289AspTrp: 1.289 ± 1.095
2.577AspTyr: 2.577 ± 1.159
0.0AspXaa: 0.0 ± 0.0
Glu
7.732GluAla: 7.732 ± 3.271
0.0GluCys: 0.0 ± 0.0
3.866GluAsp: 3.866 ± 1.837
5.799GluGlu: 5.799 ± 2.677
5.799GluPhe: 5.799 ± 1.906
5.799GluGly: 5.799 ± 2.109
0.0GluHis: 0.0 ± 0.0
0.644GluIle: 0.644 ± 0.798
0.644GluLys: 0.644 ± 0.747
5.799GluLeu: 5.799 ± 1.699
0.0GluMet: 0.0 ± 0.0
1.289GluAsn: 1.289 ± 0.835
1.289GluPro: 1.289 ± 0.882
0.0GluGln: 0.0 ± 0.0
5.155GluArg: 5.155 ± 1.953
1.933GluSer: 1.933 ± 0.92
4.51GluThr: 4.51 ± 1.383
6.443GluVal: 6.443 ± 1.898
1.289GluTrp: 1.289 ± 0.956
2.577GluTyr: 2.577 ± 1.216
0.0GluXaa: 0.0 ± 0.0
Phe
1.933PheAla: 1.933 ± 1.036
0.0PheCys: 0.0 ± 0.0
3.866PheAsp: 3.866 ± 1.79
2.577PheGlu: 2.577 ± 1.351
3.866PhePhe: 3.866 ± 2.154
1.933PheGly: 1.933 ± 0.929
1.933PheHis: 1.933 ± 0.884
1.933PheIle: 1.933 ± 2.26
1.289PheLys: 1.289 ± 0.961
3.222PheLeu: 3.222 ± 1.275
2.577PheMet: 2.577 ± 1.529
0.644PheAsn: 0.644 ± 0.588
0.644PhePro: 0.644 ± 0.544
0.644PheGln: 0.644 ± 0.588
2.577PheArg: 2.577 ± 1.318
3.222PheSer: 3.222 ± 1.373
1.289PheThr: 1.289 ± 1.089
1.933PheVal: 1.933 ± 0.889
0.0PheTrp: 0.0 ± 0.0
0.644PheTyr: 0.644 ± 0.747
0.0PheXaa: 0.0 ± 0.0
Gly
4.51GlyAla: 4.51 ± 1.926
0.0GlyCys: 0.0 ± 0.0
3.866GlyAsp: 3.866 ± 1.7
4.51GlyGlu: 4.51 ± 1.993
2.577GlyPhe: 2.577 ± 1.153
6.443GlyGly: 6.443 ± 2.496
0.644GlyHis: 0.644 ± 0.714
3.222GlyIle: 3.222 ± 1.544
2.577GlyLys: 2.577 ± 1.265
16.753GlyLeu: 16.753 ± 3.668
0.644GlyMet: 0.644 ± 0.585
1.289GlyAsn: 1.289 ± 1.089
6.443GlyPro: 6.443 ± 3.027
3.222GlyGln: 3.222 ± 1.173
5.155GlyArg: 5.155 ± 2.24
2.577GlySer: 2.577 ± 1.239
2.577GlyThr: 2.577 ± 1.169
8.376GlyVal: 8.376 ± 2.051
0.644GlyTrp: 0.644 ± 0.478
1.933GlyTyr: 1.933 ± 0.879
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.644HisGlu: 0.644 ± 0.478
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.644HisHis: 0.644 ± 0.747
0.0HisIle: 0.0 ± 0.0
3.866HisLys: 3.866 ± 1.961
2.577HisLeu: 2.577 ± 1.019
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.933HisArg: 1.933 ± 1.21
1.933HisSer: 1.933 ± 1.461
0.0HisThr: 0.0 ± 0.0
1.289HisVal: 1.289 ± 0.701
0.0HisTrp: 0.0 ± 0.0
1.289HisTyr: 1.289 ± 0.77
0.0HisXaa: 0.0 ± 0.0
Ile
3.222IleAla: 3.222 ± 1.267
0.0IleCys: 0.0 ± 0.0
3.866IleAsp: 3.866 ± 1.431
2.577IleGlu: 2.577 ± 1.051
1.933IlePhe: 1.933 ± 1.709
3.866IleGly: 3.866 ± 1.576
1.289IleHis: 1.289 ± 0.753
3.222IleIle: 3.222 ± 1.176
3.866IleLys: 3.866 ± 1.289
3.222IleLeu: 3.222 ± 1.256
2.577IleMet: 2.577 ± 1.136
3.866IleAsn: 3.866 ± 1.163
3.222IlePro: 3.222 ± 1.248
3.866IleGln: 3.866 ± 1.395
5.799IleArg: 5.799 ± 1.691
7.732IleSer: 7.732 ± 1.951
3.866IleThr: 3.866 ± 1.685
6.443IleVal: 6.443 ± 1.976
2.577IleTrp: 2.577 ± 1.294
0.644IleTyr: 0.644 ± 0.714
0.0IleXaa: 0.0 ± 0.0
Lys
3.222LysAla: 3.222 ± 1.246
1.933LysCys: 1.933 ± 1.25
4.51LysAsp: 4.51 ± 1.547
1.289LysGlu: 1.289 ± 0.935
1.289LysPhe: 1.289 ± 0.844
3.866LysGly: 3.866 ± 1.604
0.644LysHis: 0.644 ± 0.657
1.933LysIle: 1.933 ± 0.948
1.933LysLys: 1.933 ± 1.084
3.222LysLeu: 3.222 ± 1.739
1.289LysMet: 1.289 ± 0.75
0.644LysAsn: 0.644 ± 0.579
3.222LysPro: 3.222 ± 1.451
2.577LysGln: 2.577 ± 1.31
5.155LysArg: 5.155 ± 1.747
1.933LysSer: 1.933 ± 1.16
2.577LysThr: 2.577 ± 1.217
5.155LysVal: 5.155 ± 1.599
0.0LysTrp: 0.0 ± 0.0
1.289LysTyr: 1.289 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
16.108LeuAla: 16.108 ± 3.512
1.289LeuCys: 1.289 ± 0.942
3.222LeuAsp: 3.222 ± 1.284
10.309LeuGlu: 10.309 ± 4.079
4.51LeuPhe: 4.51 ± 1.579
7.732LeuGly: 7.732 ± 2.399
1.289LeuHis: 1.289 ± 0.969
9.665LeuIle: 9.665 ± 2.055
9.021LeuLys: 9.021 ± 3.105
14.175LeuLeu: 14.175 ± 3.19
4.51LeuMet: 4.51 ± 1.921
1.933LeuAsn: 1.933 ± 0.827
3.866LeuPro: 3.866 ± 1.734
1.933LeuGln: 1.933 ± 1.137
7.088LeuArg: 7.088 ± 2.009
3.866LeuSer: 3.866 ± 1.28
3.866LeuThr: 3.866 ± 1.079
7.732LeuVal: 7.732 ± 1.699
1.933LeuTrp: 1.933 ± 0.911
6.443LeuTyr: 6.443 ± 1.815
0.0LeuXaa: 0.0 ± 0.0
Met
5.155MetAla: 5.155 ± 1.719
0.0MetCys: 0.0 ± 0.0
0.644MetAsp: 0.644 ± 0.733
1.289MetGlu: 1.289 ± 0.863
0.0MetPhe: 0.0 ± 0.0
1.933MetGly: 1.933 ± 1.084
0.0MetHis: 0.0 ± 0.0
0.644MetIle: 0.644 ± 0.747
3.222MetLys: 3.222 ± 1.797
3.222MetLeu: 3.222 ± 1.384
0.0MetMet: 0.0 ± 0.0
1.289MetAsn: 1.289 ± 0.68
0.644MetPro: 0.644 ± 0.544
0.0MetGln: 0.0 ± 0.0
3.222MetArg: 3.222 ± 1.441
0.0MetSer: 0.0 ± 0.0
1.289MetThr: 1.289 ± 0.89
1.289MetVal: 1.289 ± 0.844
1.289MetTrp: 1.289 ± 0.935
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.644AsnGlu: 0.644 ± 0.478
0.0AsnPhe: 0.0 ± 0.0
1.289AsnGly: 1.289 ± 0.81
0.644AsnHis: 0.644 ± 0.714
2.577AsnIle: 2.577 ± 1.402
3.222AsnLys: 3.222 ± 2.291
5.799AsnLeu: 5.799 ± 1.865
0.644AsnMet: 0.644 ± 0.657
1.933AsnAsn: 1.933 ± 1.023
2.577AsnPro: 2.577 ± 1.166
1.289AsnGln: 1.289 ± 1.507
1.289AsnArg: 1.289 ± 0.995
1.933AsnSer: 1.933 ± 1.028
3.222AsnThr: 3.222 ± 1.723
2.577AsnVal: 2.577 ± 1.266
1.933AsnTrp: 1.933 ± 1.092
1.289AsnTyr: 1.289 ± 0.753
0.0AsnXaa: 0.0 ± 0.0
Pro
3.866ProAla: 3.866 ± 1.258
0.644ProCys: 0.644 ± 0.579
1.933ProAsp: 1.933 ± 1.323
3.866ProGlu: 3.866 ± 1.436
2.577ProPhe: 2.577 ± 1.514
2.577ProGly: 2.577 ± 1.217
0.644ProHis: 0.644 ± 0.579
1.933ProIle: 1.933 ± 0.986
2.577ProLys: 2.577 ± 1.206
3.222ProLeu: 3.222 ± 1.465
3.222ProMet: 3.222 ± 1.326
2.577ProAsn: 2.577 ± 1.498
2.577ProPro: 2.577 ± 1.257
0.644ProGln: 0.644 ± 0.758
1.289ProArg: 1.289 ± 1.428
3.222ProSer: 3.222 ± 1.781
3.866ProThr: 3.866 ± 1.405
4.51ProVal: 4.51 ± 1.814
0.644ProTrp: 0.644 ± 0.758
0.644ProTyr: 0.644 ± 0.544
0.0ProXaa: 0.0 ± 0.0
Gln
4.51GlnAla: 4.51 ± 1.976
0.0GlnCys: 0.0 ± 0.0
1.289GlnAsp: 1.289 ± 1.077
1.289GlnGlu: 1.289 ± 0.863
0.0GlnPhe: 0.0 ± 0.0
1.933GlnGly: 1.933 ± 1.131
0.644GlnHis: 0.644 ± 0.544
0.644GlnIle: 0.644 ± 0.644
0.644GlnLys: 0.644 ± 0.579
1.933GlnLeu: 1.933 ± 1.067
0.0GlnMet: 0.0 ± 0.0
0.644GlnAsn: 0.644 ± 0.714
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
0.644GlnArg: 0.644 ± 0.579
1.933GlnSer: 1.933 ± 1.022
1.289GlnThr: 1.289 ± 0.928
5.155GlnVal: 5.155 ± 1.949
0.0GlnTrp: 0.0 ± 0.0
1.933GlnTyr: 1.933 ± 0.788
0.0GlnXaa: 0.0 ± 0.0
Arg
5.155ArgAla: 5.155 ± 1.792
0.0ArgCys: 0.0 ± 0.0
0.644ArgAsp: 0.644 ± 0.644
6.443ArgGlu: 6.443 ± 1.971
2.577ArgPhe: 2.577 ± 1.726
7.732ArgGly: 7.732 ± 2.518
0.644ArgHis: 0.644 ± 0.588
3.866ArgIle: 3.866 ± 1.563
0.644ArgLys: 0.644 ± 0.657
6.443ArgLeu: 6.443 ± 2.069
1.289ArgMet: 1.289 ± 0.926
3.222ArgAsn: 3.222 ± 1.11
3.866ArgPro: 3.866 ± 1.431
0.0ArgGln: 0.0 ± 0.0
5.155ArgArg: 5.155 ± 1.954
2.577ArgSer: 2.577 ± 1.501
1.933ArgThr: 1.933 ± 1.414
5.799ArgVal: 5.799 ± 1.916
0.644ArgTrp: 0.644 ± 0.644
1.933ArgTyr: 1.933 ± 1.21
0.0ArgXaa: 0.0 ± 0.0
Ser
1.289SerAla: 1.289 ± 0.914
0.644SerCys: 0.644 ± 0.588
0.644SerAsp: 0.644 ± 0.657
1.933SerGlu: 1.933 ± 0.914
1.933SerPhe: 1.933 ± 1.133
5.799SerGly: 5.799 ± 1.713
3.222SerHis: 3.222 ± 0.906
5.155SerIle: 5.155 ± 1.773
1.933SerLys: 1.933 ± 1.039
7.732SerLeu: 7.732 ± 1.874
0.0SerMet: 0.0 ± 0.0
0.644SerAsn: 0.644 ± 0.657
1.933SerPro: 1.933 ± 1.133
0.644SerGln: 0.644 ± 0.588
3.866SerArg: 3.866 ± 2.137
2.577SerSer: 2.577 ± 1.743
2.577SerThr: 2.577 ± 1.188
3.866SerVal: 3.866 ± 1.575
1.289SerTrp: 1.289 ± 0.928
3.222SerTyr: 3.222 ± 1.575
0.0SerXaa: 0.0 ± 0.0
Thr
1.933ThrAla: 1.933 ± 0.889
0.0ThrCys: 0.0 ± 0.0
0.644ThrAsp: 0.644 ± 0.478
2.577ThrGlu: 2.577 ± 1.057
0.644ThrPhe: 0.644 ± 0.544
5.799ThrGly: 5.799 ± 1.91
0.644ThrHis: 0.644 ± 0.714
3.866ThrIle: 3.866 ± 1.308
1.933ThrLys: 1.933 ± 1.125
7.088ThrLeu: 7.088 ± 2.125
0.0ThrMet: 0.0 ± 0.0
1.933ThrAsn: 1.933 ± 1.323
3.222ThrPro: 3.222 ± 1.02
2.577ThrGln: 2.577 ± 1.469
0.644ThrArg: 0.644 ± 0.579
3.866ThrSer: 3.866 ± 1.389
5.155ThrThr: 5.155 ± 1.967
3.866ThrVal: 3.866 ± 1.546
1.289ThrTrp: 1.289 ± 1.077
2.577ThrTyr: 2.577 ± 1.362
0.0ThrXaa: 0.0 ± 0.0
Val
10.954ValAla: 10.954 ± 1.508
0.644ValCys: 0.644 ± 0.714
8.376ValAsp: 8.376 ± 1.925
5.155ValGlu: 5.155 ± 1.547
3.866ValPhe: 3.866 ± 1.582
7.732ValGly: 7.732 ± 2.055
0.644ValHis: 0.644 ± 0.588
9.665ValIle: 9.665 ± 1.848
5.155ValLys: 5.155 ± 1.774
9.021ValLeu: 9.021 ± 2.327
2.577ValMet: 2.577 ± 1.043
3.866ValAsn: 3.866 ± 1.574
1.289ValPro: 1.289 ± 0.906
1.289ValGln: 1.289 ± 0.872
1.289ValArg: 1.289 ± 0.851
1.933ValSer: 1.933 ± 0.836
3.866ValThr: 3.866 ± 1.237
7.732ValVal: 7.732 ± 1.925
1.933ValTrp: 1.933 ± 1.021
3.222ValTyr: 3.222 ± 1.076
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.777
0.644TrpCys: 0.644 ± 0.579
1.289TrpAsp: 1.289 ± 0.841
1.289TrpGlu: 1.289 ± 0.928
0.644TrpPhe: 0.644 ± 0.798
1.289TrpGly: 1.289 ± 0.923
0.0TrpHis: 0.0 ± 0.0
1.933TrpIle: 1.933 ± 1.15
0.0TrpLys: 0.0 ± 0.0
0.644TrpLeu: 0.644 ± 0.588
0.644TrpMet: 0.644 ± 0.747
1.289TrpAsn: 1.289 ± 0.956
0.644TrpPro: 0.644 ± 0.588
0.644TrpGln: 0.644 ± 0.714
1.289TrpArg: 1.289 ± 0.79
0.644TrpSer: 0.644 ± 0.478
0.0TrpThr: 0.0 ± 0.0
1.933TrpVal: 1.933 ± 1.279
1.289TrpTrp: 1.289 ± 0.975
1.289TrpTyr: 1.289 ± 0.678
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.933TyrAla: 1.933 ± 1.039
1.289TyrCys: 1.289 ± 0.753
3.222TyrAsp: 3.222 ± 1.811
0.644TyrGlu: 0.644 ± 0.644
0.0TyrPhe: 0.0 ± 0.0
3.222TyrGly: 3.222 ± 1.713
0.644TyrHis: 0.644 ± 0.714
5.799TyrIle: 5.799 ± 1.9
1.933TyrLys: 1.933 ± 0.904
7.088TyrLeu: 7.088 ± 2.145
0.644TyrMet: 0.644 ± 0.733
1.289TyrAsn: 1.289 ± 0.838
0.0TyrPro: 0.0 ± 0.0
1.289TyrGln: 1.289 ± 0.919
1.289TyrArg: 1.289 ± 0.899
1.933TyrSer: 1.933 ± 0.84
1.933TyrThr: 1.933 ± 1.296
2.577TyrVal: 2.577 ± 1.023
0.0TyrTrp: 0.0 ± 0.0
1.933TyrTyr: 1.933 ± 0.836
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (1553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski