Amino acid dipepetide frequency for Escherichia virus Qbeta (Bacteriophage Q-beta)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.762AlaAla: 4.762 ± 0.827
2.721AlaCys: 2.721 ± 0.945
4.082AlaAsp: 4.082 ± 0.872
3.401AlaGlu: 3.401 ± 2.316
4.082AlaPhe: 4.082 ± 1.868
2.721AlaGly: 2.721 ± 1.483
2.041AlaHis: 2.041 ± 0.786
4.082AlaIle: 4.082 ± 1.84
3.401AlaLys: 3.401 ± 0.54
10.204AlaLeu: 10.204 ± 1.978
2.041AlaMet: 2.041 ± 1.12
4.082AlaAsn: 4.082 ± 0.59
1.361AlaPro: 1.361 ± 0.509
1.361AlaGln: 1.361 ± 0.509
2.721AlaArg: 2.721 ± 1.186
7.483AlaSer: 7.483 ± 1.177
5.442AlaThr: 5.442 ± 0.768
7.483AlaVal: 7.483 ± 1.108
1.361AlaTrp: 1.361 ± 0.509
5.442AlaTyr: 5.442 ± 2.712
0.0AlaXaa: 0.0 ± 0.0
Cys
1.361CysAla: 1.361 ± 0.927
0.0CysCys: 0.0 ± 0.0
2.041CysAsp: 2.041 ± 0.906
1.361CysGlu: 1.361 ± 0.927
0.0CysPhe: 0.0 ± 0.0
0.68CysGly: 0.68 ± 0.463
0.0CysHis: 0.0 ± 0.0
2.041CysIle: 2.041 ± 1.39
0.0CysLys: 0.0 ± 0.0
0.68CysLeu: 0.68 ± 0.463
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.361CysPro: 1.361 ± 1.369
0.0CysGln: 0.0 ± 0.0
2.041CysArg: 2.041 ± 0.861
1.361CysSer: 1.361 ± 0.625
2.041CysThr: 2.041 ± 0.906
0.0CysVal: 0.0 ± 0.0
0.68CysTrp: 0.68 ± 0.463
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.041AspAla: 2.041 ± 1.594
0.0AspCys: 0.0 ± 0.0
2.721AspAsp: 2.721 ± 1.186
1.361AspGlu: 1.361 ± 1.085
5.442AspPhe: 5.442 ± 1.46
6.803AspGly: 6.803 ± 1.741
0.68AspHis: 0.68 ± 0.463
6.803AspIle: 6.803 ± 1.094
1.361AspLys: 1.361 ± 0.698
9.524AspLeu: 9.524 ± 2.146
0.0AspMet: 0.0 ± 0.0
2.041AspAsn: 2.041 ± 1.72
5.442AspPro: 5.442 ± 3.028
4.082AspGln: 4.082 ± 1.84
3.401AspArg: 3.401 ± 1.241
4.082AspSer: 4.082 ± 0.715
2.041AspThr: 2.041 ± 0.98
4.082AspVal: 4.082 ± 0.872
3.401AspTrp: 3.401 ± 2.119
1.361AspTyr: 1.361 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
2.721GluAla: 2.721 ± 1.853
1.361GluCys: 1.361 ± 0.927
1.361GluAsp: 1.361 ± 0.509
2.041GluGlu: 2.041 ± 1.594
2.721GluPhe: 2.721 ± 1.581
5.442GluGly: 5.442 ± 1.161
0.0GluHis: 0.0 ± 0.0
2.721GluIle: 2.721 ± 1.018
2.721GluLys: 2.721 ± 0.945
6.122GluLeu: 6.122 ± 0.516
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.361GluPro: 1.361 ± 0.625
0.0GluGln: 0.0 ± 0.0
4.082GluArg: 4.082 ± 1.139
3.401GluSer: 3.401 ± 1.62
2.041GluThr: 2.041 ± 1.071
4.082GluVal: 4.082 ± 1.27
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.762PheAla: 4.762 ± 1.073
0.0PheCys: 0.0 ± 0.0
6.803PheAsp: 6.803 ± 1.282
1.361PheGlu: 1.361 ± 0.927
0.68PhePhe: 0.68 ± 0.463
2.041PheGly: 2.041 ± 1.079
0.0PheHis: 0.0 ± 0.0
2.041PheIle: 2.041 ± 1.079
2.721PheLys: 2.721 ± 1.018
3.401PheLeu: 3.401 ± 2.083
0.0PheMet: 0.0 ± 0.0
2.721PheAsn: 2.721 ± 1.186
0.68PhePro: 0.68 ± 0.573
2.041PheGln: 2.041 ± 0.786
2.721PheArg: 2.721 ± 1.018
8.163PheSer: 8.163 ± 1.57
3.401PheThr: 3.401 ± 1.125
2.041PheVal: 2.041 ± 1.071
0.68PheTrp: 0.68 ± 0.684
2.041PheTyr: 2.041 ± 0.98
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 1.471
1.361GlyCys: 1.361 ± 0.625
4.082GlyAsp: 4.082 ± 1.527
3.401GlyGlu: 3.401 ± 2.083
2.041GlyPhe: 2.041 ± 0.353
4.082GlyGly: 4.082 ± 1.465
1.361GlyHis: 1.361 ± 0.509
3.401GlyIle: 3.401 ± 1.62
6.122GlyLys: 6.122 ± 2.446
4.082GlyLeu: 4.082 ± 2.648
0.68GlyMet: 0.68 ± 0.463
6.122GlyAsn: 6.122 ± 1.906
1.361GlyPro: 1.361 ± 0.927
1.361GlyGln: 1.361 ± 0.509
2.721GlyArg: 2.721 ± 1.523
6.122GlySer: 6.122 ± 1.362
2.721GlyThr: 2.721 ± 0.438
5.442GlyVal: 5.442 ± 2.143
2.041GlyTrp: 2.041 ± 1.39
3.401GlyTyr: 3.401 ± 0.547
0.0GlyXaa: 0.0 ± 0.0
His
0.68HisAla: 0.68 ± 0.684
0.68HisCys: 0.68 ± 0.463
0.68HisAsp: 0.68 ± 0.573
2.041HisGlu: 2.041 ± 0.786
0.68HisPhe: 0.68 ± 0.463
1.361HisGly: 1.361 ± 1.147
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.68HisLeu: 0.68 ± 0.573
0.68HisMet: 0.68 ± 0.463
1.361HisAsn: 1.361 ± 1.147
1.361HisPro: 1.361 ± 0.509
0.0HisGln: 0.0 ± 0.0
2.041HisArg: 2.041 ± 0.98
0.68HisSer: 0.68 ± 0.463
1.361HisThr: 1.361 ± 0.509
0.68HisVal: 0.68 ± 0.573
0.0HisTrp: 0.0 ± 0.0
0.68HisTyr: 0.68 ± 0.463
0.0HisXaa: 0.0 ± 0.0
Ile
4.762IleAla: 4.762 ± 1.239
0.68IleCys: 0.68 ± 0.463
7.483IleAsp: 7.483 ± 2.129
2.721IleGlu: 2.721 ± 0.438
1.361IlePhe: 1.361 ± 0.509
3.401IleGly: 3.401 ± 0.54
0.68IleHis: 0.68 ± 0.463
0.68IleIle: 0.68 ± 0.463
1.361IleLys: 1.361 ± 1.147
4.082IleLeu: 4.082 ± 1.573
0.68IleMet: 0.68 ± 0.463
4.082IleAsn: 4.082 ± 0.715
2.041IlePro: 2.041 ± 1.226
4.082IleGln: 4.082 ± 1.139
4.082IleArg: 4.082 ± 0.715
4.762IleSer: 4.762 ± 2.52
1.361IleThr: 1.361 ± 0.509
2.721IleVal: 2.721 ± 0.963
0.68IleTrp: 0.68 ± 0.684
1.361IleTyr: 1.361 ± 1.369
0.0IleXaa: 0.0 ± 0.0
Lys
1.361LysAla: 1.361 ± 0.927
0.68LysCys: 0.68 ± 0.684
2.041LysAsp: 2.041 ± 1.594
1.361LysGlu: 1.361 ± 1.147
2.721LysPhe: 2.721 ± 1.233
2.721LysGly: 2.721 ± 1.523
1.361LysHis: 1.361 ± 0.509
4.082LysIle: 4.082 ± 0.713
1.361LysLys: 1.361 ± 0.625
4.082LysLeu: 4.082 ± 0.59
0.0LysMet: 0.0 ± 0.0
3.401LysAsn: 3.401 ± 1.292
2.041LysPro: 2.041 ± 1.258
2.041LysGln: 2.041 ± 0.906
2.721LysArg: 2.721 ± 1.331
1.361LysSer: 1.361 ± 0.698
2.721LysThr: 2.721 ± 1.233
2.041LysVal: 2.041 ± 0.906
1.361LysTrp: 1.361 ± 0.698
5.442LysTyr: 5.442 ± 0.875
0.0LysXaa: 0.0 ± 0.0
Leu
10.204LeuAla: 10.204 ± 1.978
0.68LeuCys: 0.68 ± 0.463
5.442LeuAsp: 5.442 ± 2.137
6.122LeuGlu: 6.122 ± 2.182
4.082LeuPhe: 4.082 ± 1.961
5.442LeuGly: 5.442 ± 1.018
1.361LeuHis: 1.361 ± 0.509
5.442LeuIle: 5.442 ± 1.893
5.442LeuLys: 5.442 ± 0.768
10.884LeuLeu: 10.884 ± 1.135
1.361LeuMet: 1.361 ± 0.509
6.803LeuAsn: 6.803 ± 1.839
6.122LeuPro: 6.122 ± 2.359
1.361LeuGln: 1.361 ± 0.625
10.884LeuArg: 10.884 ± 4.072
6.803LeuSer: 6.803 ± 0.851
2.041LeuThr: 2.041 ± 0.98
4.082LeuVal: 4.082 ± 0.59
1.361LeuTrp: 1.361 ± 1.147
3.401LeuTyr: 3.401 ± 1.241
0.0LeuXaa: 0.0 ± 0.0
Met
2.041MetAla: 2.041 ± 0.906
0.0MetCys: 0.0 ± 0.0
0.68MetAsp: 0.68 ± 0.463
0.0MetGlu: 0.0 ± 0.0
0.68MetPhe: 0.68 ± 0.463
0.68MetGly: 0.68 ± 0.463
0.0MetHis: 0.0 ± 0.0
0.68MetIle: 0.68 ± 0.573
0.0MetLys: 0.0 ± 0.0
1.361MetLeu: 1.361 ± 0.509
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.361MetPro: 1.361 ± 1.147
0.0MetGln: 0.0 ± 0.0
0.68MetArg: 0.68 ± 0.684
1.361MetSer: 1.361 ± 0.927
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.082AsnAla: 4.082 ± 1.323
0.0AsnCys: 0.0 ± 0.0
3.401AsnAsp: 3.401 ± 1.241
0.68AsnGlu: 0.68 ± 0.573
0.68AsnPhe: 0.68 ± 0.463
6.803AsnGly: 6.803 ± 2.397
0.0AsnHis: 0.0 ± 0.0
2.721AsnIle: 2.721 ± 1.483
0.68AsnLys: 0.68 ± 0.463
4.762AsnLeu: 4.762 ± 1.944
0.68AsnMet: 0.68 ± 0.463
2.041AsnAsn: 2.041 ± 1.39
7.483AsnPro: 7.483 ± 3.667
0.68AsnGln: 0.68 ± 0.463
4.762AsnArg: 4.762 ± 0.712
2.721AsnSer: 2.721 ± 1.186
2.721AsnThr: 2.721 ± 1.233
2.041AsnVal: 2.041 ± 1.72
0.0AsnTrp: 0.0 ± 0.0
2.041AsnTyr: 2.041 ± 0.906
0.0AsnXaa: 0.0 ± 0.0
Pro
6.122ProAla: 6.122 ± 1.54
0.0ProCys: 0.0 ± 0.0
4.762ProAsp: 4.762 ± 1.239
0.0ProGlu: 0.0 ± 0.0
5.442ProPhe: 5.442 ± 1.612
2.721ProGly: 2.721 ± 1.249
1.361ProHis: 1.361 ± 1.147
2.721ProIle: 2.721 ± 1.901
3.401ProLys: 3.401 ± 1.241
2.721ProLeu: 2.721 ± 0.71
0.0ProMet: 0.0 ± 0.0
0.68ProAsn: 0.68 ± 0.573
3.401ProPro: 3.401 ± 2.557
0.68ProGln: 0.68 ± 0.463
6.122ProArg: 6.122 ± 0.974
6.122ProSer: 6.122 ± 2.158
4.082ProThr: 4.082 ± 2.398
3.401ProVal: 3.401 ± 1.252
1.361ProTrp: 1.361 ± 0.698
1.361ProTyr: 1.361 ± 0.509
0.0ProXaa: 0.0 ± 0.0
Gln
5.442GlnAla: 5.442 ± 2.712
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.68GlnGlu: 0.68 ± 0.573
0.0GlnPhe: 0.0 ± 0.0
1.361GlnGly: 1.361 ± 0.509
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.68GlnLys: 0.68 ± 0.684
4.762GlnLeu: 4.762 ± 1.074
0.0GlnMet: 0.0 ± 0.0
2.041GlnAsn: 2.041 ± 1.071
2.041GlnPro: 2.041 ± 1.594
0.68GlnGln: 0.68 ± 0.573
2.721GlnArg: 2.721 ± 1.018
1.361GlnSer: 1.361 ± 0.509
2.041GlnThr: 2.041 ± 0.906
2.041GlnVal: 2.041 ± 1.071
0.0GlnTrp: 0.0 ± 0.0
2.721GlnTyr: 2.721 ± 0.71
0.0GlnXaa: 0.0 ± 0.0
Arg
7.483ArgAla: 7.483 ± 1.281
2.041ArgCys: 2.041 ± 1.39
4.762ArgAsp: 4.762 ± 1.279
4.762ArgGlu: 4.762 ± 0.646
3.401ArgPhe: 3.401 ± 0.547
5.442ArgGly: 5.442 ± 1.266
3.401ArgHis: 3.401 ± 1.453
2.721ArgIle: 2.721 ± 1.853
2.721ArgLys: 2.721 ± 0.945
6.122ArgLeu: 6.122 ± 1.982
0.0ArgMet: 0.0 ± 0.508
3.401ArgAsn: 3.401 ± 0.54
3.401ArgPro: 3.401 ± 1.252
2.041ArgGln: 2.041 ± 1.071
5.442ArgArg: 5.442 ± 2.426
3.401ArgSer: 3.401 ± 2.316
2.721ArgThr: 2.721 ± 1.331
5.442ArgVal: 5.442 ± 2.411
0.68ArgTrp: 0.68 ± 0.463
2.041ArgTyr: 2.041 ± 0.98
0.0ArgXaa: 0.0 ± 0.0
Ser
4.762SerAla: 4.762 ± 1.944
3.401SerCys: 3.401 ± 1.18
3.401SerAsp: 3.401 ± 0.547
3.401SerGlu: 3.401 ± 2.316
5.442SerPhe: 5.442 ± 1.561
4.082SerGly: 4.082 ± 0.951
0.0SerHis: 0.0 ± 0.0
2.721SerIle: 2.721 ± 0.438
2.721SerLys: 2.721 ± 1.233
9.524SerLeu: 9.524 ± 1.14
0.68SerMet: 0.68 ± 0.463
2.721SerAsn: 2.721 ± 1.018
4.762SerPro: 4.762 ± 1.919
2.721SerGln: 2.721 ± 2.171
5.442SerArg: 5.442 ± 1.018
5.442SerSer: 5.442 ± 1.161
2.721SerThr: 2.721 ± 0.945
7.483SerVal: 7.483 ± 1.864
0.0SerTrp: 0.0 ± 0.0
3.401SerTyr: 3.401 ± 0.547
0.0SerXaa: 0.0 ± 0.0
Thr
4.762ThrAla: 4.762 ± 1.757
0.68ThrCys: 0.68 ± 0.684
3.401ThrAsp: 3.401 ± 1.292
2.041ThrGlu: 2.041 ± 0.906
4.082ThrPhe: 4.082 ± 0.59
1.361ThrGly: 1.361 ± 0.625
1.361ThrHis: 1.361 ± 0.509
3.401ThrIle: 3.401 ± 1.241
4.762ThrLys: 4.762 ± 2.431
5.442ThrLeu: 5.442 ± 2.615
0.68ThrMet: 0.68 ± 0.573
4.082ThrAsn: 4.082 ± 1.524
3.401ThrPro: 3.401 ± 1.241
2.721ThrGln: 2.721 ± 1.331
2.721ThrArg: 2.721 ± 0.945
0.68ThrSer: 0.68 ± 0.573
4.082ThrThr: 4.082 ± 0.715
6.803ThrVal: 6.803 ± 1.462
0.0ThrTrp: 0.0 ± 0.0
1.361ThrTyr: 1.361 ± 0.625
0.0ThrXaa: 0.0 ± 0.0
Val
4.762ValAla: 4.762 ± 0.712
0.68ValCys: 0.68 ± 0.463
4.082ValAsp: 4.082 ± 1.961
2.721ValGlu: 2.721 ± 0.438
2.041ValPhe: 2.041 ± 0.786
4.082ValGly: 4.082 ± 1.032
0.0ValHis: 0.0 ± 0.0
4.082ValIle: 4.082 ± 1.562
2.721ValLys: 2.721 ± 1.331
6.122ValLeu: 6.122 ± 1.197
0.68ValMet: 0.68 ± 0.573
2.041ValAsn: 2.041 ± 1.071
6.122ValPro: 6.122 ± 0.464
2.041ValGln: 2.041 ± 1.071
2.721ValArg: 2.721 ± 1.483
4.762ValSer: 4.762 ± 1.757
10.204ValThr: 10.204 ± 2.891
2.721ValVal: 2.721 ± 0.719
0.68ValTrp: 0.68 ± 0.463
2.041ValTyr: 2.041 ± 0.861
0.0ValXaa: 0.0 ± 0.0
Trp
0.68TrpAla: 0.68 ± 0.463
0.0TrpCys: 0.0 ± 0.0
2.041TrpAsp: 2.041 ± 0.353
1.361TrpGlu: 1.361 ± 0.509
2.041TrpPhe: 2.041 ± 0.98
0.68TrpGly: 0.68 ± 0.463
0.0TrpHis: 0.0 ± 0.0
0.68TrpIle: 0.68 ± 0.463
1.361TrpLys: 1.361 ± 1.147
0.68TrpLeu: 0.68 ± 0.573
0.0TrpMet: 0.0 ± 0.0
0.68TrpAsn: 0.68 ± 0.463
0.68TrpPro: 0.68 ± 0.684
0.68TrpGln: 0.68 ± 0.573
1.361TrpArg: 1.361 ± 1.369
0.68TrpSer: 0.68 ± 0.684
0.68TrpThr: 0.68 ± 0.684
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.68TrpTyr: 0.68 ± 0.573
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.721TyrAla: 2.721 ± 1.331
0.68TyrCys: 0.68 ± 0.684
3.401TyrAsp: 3.401 ± 0.547
1.361TyrGlu: 1.361 ± 0.625
0.68TyrPhe: 0.68 ± 0.573
4.762TyrGly: 4.762 ± 1.959
2.041TyrHis: 2.041 ± 1.079
2.041TyrIle: 2.041 ± 0.861
1.361TyrLys: 1.361 ± 1.085
4.082TyrLeu: 4.082 ± 0.951
0.68TyrMet: 0.68 ± 0.755
0.68TyrAsn: 0.68 ± 0.684
0.68TyrPro: 0.68 ± 0.573
0.0TyrGln: 0.0 ± 0.0
2.721TyrArg: 2.721 ± 1.186
4.082TyrSer: 4.082 ± 0.59
3.401TyrThr: 3.401 ± 1.005
2.721TyrVal: 2.721 ± 1.186
0.68TyrTrp: 0.68 ± 0.684
0.68TyrTyr: 0.68 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski