Amino acid dipepetide frequency for Blackberry virus E

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.374AlaAla: 7.374 ± 1.531
1.229AlaCys: 1.229 ± 0.549
4.097AlaAsp: 4.097 ± 0.946
4.506AlaGlu: 4.506 ± 1.519
1.639AlaPhe: 1.639 ± 0.664
5.326AlaGly: 5.326 ± 1.352
2.458AlaHis: 2.458 ± 0.929
4.916AlaIle: 4.916 ± 2.473
4.916AlaLys: 4.916 ± 0.978
6.964AlaLeu: 6.964 ± 1.666
1.639AlaMet: 1.639 ± 0.596
7.374AlaAsn: 7.374 ± 2.677
6.145AlaPro: 6.145 ± 1.498
2.048AlaGln: 2.048 ± 0.619
2.868AlaArg: 2.868 ± 0.589
3.687AlaSer: 3.687 ± 2.202
8.603AlaThr: 8.603 ± 1.596
6.145AlaVal: 6.145 ± 2.595
0.819AlaTrp: 0.819 ± 0.518
4.097AlaTyr: 4.097 ± 2.322
0.0AlaXaa: 0.0 ± 0.0
Cys
0.41CysAla: 0.41 ± 0.774
0.41CysCys: 0.41 ± 0.619
1.229CysAsp: 1.229 ± 0.697
1.229CysGlu: 1.229 ± 0.503
0.819CysPhe: 0.819 ± 0.464
1.229CysGly: 1.229 ± 1.114
0.41CysHis: 0.41 ± 0.232
0.0CysIle: 0.0 ± 0.0
0.41CysLys: 0.41 ± 0.232
1.229CysLeu: 1.229 ± 0.863
0.819CysMet: 0.819 ± 0.766
0.41CysAsn: 0.41 ± 0.232
1.229CysPro: 1.229 ± 0.926
0.41CysGln: 0.41 ± 0.232
0.41CysArg: 0.41 ± 0.563
1.229CysSer: 1.229 ± 0.503
0.819CysThr: 0.819 ± 0.874
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.229CysTyr: 1.229 ± 1.082
0.0CysXaa: 0.0 ± 0.0
Asp
4.916AspAla: 4.916 ± 1.301
0.41AspCys: 0.41 ± 0.232
2.868AspAsp: 2.868 ± 0.65
4.506AspGlu: 4.506 ± 0.733
5.326AspPhe: 5.326 ± 0.762
0.819AspGly: 0.819 ± 0.694
2.048AspHis: 2.048 ± 0.949
3.277AspIle: 3.277 ± 1.731
2.458AspLys: 2.458 ± 1.023
4.506AspLeu: 4.506 ± 1.45
0.0AspMet: 0.0 ± 0.0
2.458AspAsn: 2.458 ± 0.804
4.097AspPro: 4.097 ± 1.787
2.868AspGln: 2.868 ± 1.032
2.868AspArg: 2.868 ± 0.589
1.639AspSer: 1.639 ± 0.453
3.687AspThr: 3.687 ± 1.136
2.868AspVal: 2.868 ± 0.65
1.229AspTrp: 1.229 ± 0.503
3.687AspTyr: 3.687 ± 2.09
0.0AspXaa: 0.0 ± 0.0
Glu
4.916GluAla: 4.916 ± 0.978
0.819GluCys: 0.819 ± 0.464
3.277GluAsp: 3.277 ± 1.858
2.048GluGlu: 2.048 ± 0.747
2.868GluPhe: 2.868 ± 2.349
2.868GluGly: 2.868 ± 0.63
2.868GluHis: 2.868 ± 1.032
0.41GluIle: 0.41 ± 0.232
2.868GluLys: 2.868 ± 0.775
6.145GluLeu: 6.145 ± 1.571
0.819GluMet: 0.819 ± 0.464
3.687GluAsn: 3.687 ± 1.136
2.048GluPro: 2.048 ± 1.161
2.868GluGln: 2.868 ± 1.032
3.687GluArg: 3.687 ± 0.95
1.229GluSer: 1.229 ± 1.082
2.868GluThr: 2.868 ± 0.791
4.916GluVal: 4.916 ± 2.237
0.0GluTrp: 0.0 ± 0.0
0.819GluTyr: 0.819 ± 0.514
0.0GluXaa: 0.0 ± 0.0
Phe
2.048PheAla: 2.048 ± 1.161
1.229PheCys: 1.229 ± 0.718
2.048PheAsp: 2.048 ± 1.089
2.458PheGlu: 2.458 ± 1.005
1.229PhePhe: 1.229 ± 1.291
1.639PheGly: 1.639 ± 0.575
0.819PheHis: 0.819 ± 1.238
1.229PheIle: 1.229 ± 0.697
3.277PheLys: 3.277 ± 0.813
3.277PheLeu: 3.277 ± 0.889
1.229PheMet: 1.229 ± 0.697
4.506PheAsn: 4.506 ± 0.851
2.458PhePro: 2.458 ± 0.501
0.41PheGln: 0.41 ± 0.232
0.41PheArg: 0.41 ± 0.619
3.277PheSer: 3.277 ± 0.715
3.277PheThr: 3.277 ± 0.943
2.868PheVal: 2.868 ± 0.63
0.0PheTrp: 0.0 ± 0.0
1.639PheTyr: 1.639 ± 0.591
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 0.93
1.229GlyCys: 1.229 ± 0.736
3.687GlyAsp: 3.687 ± 1.319
3.687GlyGlu: 3.687 ± 0.695
2.868GlyPhe: 2.868 ± 1.315
3.277GlyGly: 3.277 ± 0.715
1.639GlyHis: 1.639 ± 0.757
2.048GlyIle: 2.048 ± 0.831
2.868GlyLys: 2.868 ± 1.134
3.277GlyLeu: 3.277 ± 0.754
0.819GlyMet: 0.819 ± 1.026
1.639GlyAsn: 1.639 ± 1.101
2.458GlyPro: 2.458 ± 0.483
2.048GlyGln: 2.048 ± 0.747
2.458GlyArg: 2.458 ± 1.106
2.868GlySer: 2.868 ± 2.125
3.687GlyThr: 3.687 ± 1.136
2.048GlyVal: 2.048 ± 1.103
0.819GlyTrp: 0.819 ± 0.464
0.41GlyTyr: 0.41 ± 0.232
0.0GlyXaa: 0.0 ± 0.0
His
3.687HisAla: 3.687 ± 0.504
1.229HisCys: 1.229 ± 0.644
1.229HisAsp: 1.229 ± 0.367
1.639HisGlu: 1.639 ± 0.591
2.048HisPhe: 2.048 ± 0.619
4.506HisGly: 4.506 ± 1.093
2.048HisHis: 2.048 ± 2.14
1.229HisIle: 1.229 ± 0.688
0.0HisLys: 0.0 ± 0.0
4.097HisLeu: 4.097 ± 1.336
0.41HisMet: 0.41 ± 0.687
1.639HisAsn: 1.639 ± 1.093
1.229HisPro: 1.229 ± 1.114
2.458HisGln: 2.458 ± 0.733
2.048HisArg: 2.048 ± 0.744
2.458HisSer: 2.458 ± 1.009
2.868HisThr: 2.868 ± 1.3
1.639HisVal: 1.639 ± 0.757
0.41HisTrp: 0.41 ± 0.774
2.048HisTyr: 2.048 ± 1.439
0.0HisXaa: 0.0 ± 0.0
Ile
2.048IleAla: 2.048 ± 1.62
0.41IleCys: 0.41 ± 0.619
2.868IleAsp: 2.868 ± 1.611
2.868IleGlu: 2.868 ± 0.791
1.639IlePhe: 1.639 ± 0.929
1.229IleGly: 1.229 ± 0.367
2.458IleHis: 2.458 ± 1.633
0.819IleIle: 0.819 ± 0.694
1.639IleLys: 1.639 ± 0.664
4.097IleLeu: 4.097 ± 2.299
1.639IleMet: 1.639 ± 0.647
1.229IleAsn: 1.229 ± 1.403
3.687IlePro: 3.687 ± 1.383
6.555IleGln: 6.555 ± 1.611
2.458IleArg: 2.458 ± 0.646
2.048IleSer: 2.048 ± 0.883
4.097IleThr: 4.097 ± 0.857
1.229IleVal: 1.229 ± 0.367
0.0IleTrp: 0.0 ± 0.0
0.819IleTyr: 0.819 ± 0.694
0.0IleXaa: 0.0 ± 0.0
Lys
4.916LysAla: 4.916 ± 2.045
0.0LysCys: 0.0 ± 0.0
3.277LysAsp: 3.277 ± 1.443
1.639LysGlu: 1.639 ± 0.929
1.229LysPhe: 1.229 ± 0.549
2.048LysGly: 2.048 ± 1.042
1.229LysHis: 1.229 ± 0.503
2.868LysIle: 2.868 ± 1.229
1.639LysLys: 1.639 ± 0.929
6.964LysLeu: 6.964 ± 1.555
0.819LysMet: 0.819 ± 0.514
0.819LysAsn: 0.819 ± 0.464
3.687LysPro: 3.687 ± 1.136
0.819LysGln: 0.819 ± 0.464
3.277LysArg: 3.277 ± 0.948
3.277LysSer: 3.277 ± 1.181
4.506LysThr: 4.506 ± 2.011
2.048LysVal: 2.048 ± 0.799
0.41LysTrp: 0.41 ± 0.232
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
11.471LeuAla: 11.471 ± 2.232
1.229LeuCys: 1.229 ± 0.549
7.784LeuAsp: 7.784 ± 1.648
7.784LeuGlu: 7.784 ± 1.192
4.916LeuPhe: 4.916 ± 1.094
2.868LeuGly: 2.868 ± 0.775
5.326LeuHis: 5.326 ± 2.545
5.326LeuIle: 5.326 ± 1.315
3.687LeuLys: 3.687 ± 1.323
7.784LeuLeu: 7.784 ± 1.648
0.819LeuMet: 0.819 ± 0.464
4.916LeuAsn: 4.916 ± 1.36
7.784LeuPro: 7.784 ± 1.943
4.506LeuGln: 4.506 ± 1.31
6.555LeuArg: 6.555 ± 0.642
6.964LeuSer: 6.964 ± 1.407
7.374LeuThr: 7.374 ± 2.146
4.506LeuVal: 4.506 ± 1.35
0.819LeuTrp: 0.819 ± 0.464
1.229LeuTyr: 1.229 ± 0.697
0.0LeuXaa: 0.0 ± 0.0
Met
2.868MetAla: 2.868 ± 0.65
0.819MetCys: 0.819 ± 0.694
0.0MetAsp: 0.0 ± 0.0
0.819MetGlu: 0.819 ± 0.464
0.41MetPhe: 0.41 ± 0.619
0.819MetGly: 0.819 ± 0.414
0.41MetHis: 0.41 ± 0.232
0.41MetIle: 0.41 ± 0.232
0.819MetLys: 0.819 ± 0.874
2.048MetLeu: 2.048 ± 0.436
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.819MetPro: 0.819 ± 0.518
2.048MetGln: 2.048 ± 0.436
0.819MetArg: 0.819 ± 0.464
1.639MetSer: 1.639 ± 0.757
0.41MetThr: 0.41 ± 0.232
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.819MetTyr: 0.819 ± 1.173
0.0MetXaa: 0.0 ± 0.0
Asn
4.506AsnAla: 4.506 ± 1.657
0.0AsnCys: 0.0 ± 0.0
2.048AsnAsp: 2.048 ± 1.161
1.639AsnGlu: 1.639 ± 0.613
0.819AsnPhe: 0.819 ± 0.464
2.048AsnGly: 2.048 ± 1.025
1.229AsnHis: 1.229 ± 0.549
1.229AsnIle: 1.229 ± 0.688
3.687AsnLys: 3.687 ± 1.25
5.326AsnLeu: 5.326 ± 1.285
0.819AsnMet: 0.819 ± 1.223
1.229AsnAsn: 1.229 ± 0.367
5.735AsnPro: 5.735 ± 2.382
2.458AsnGln: 2.458 ± 1.822
3.277AsnArg: 3.277 ± 2.3
1.639AsnSer: 1.639 ± 0.575
3.687AsnThr: 3.687 ± 1.032
2.458AsnVal: 2.458 ± 0.93
0.41AsnTrp: 0.41 ± 0.586
0.819AsnTyr: 0.819 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
6.145ProAla: 6.145 ± 3.233
0.0ProCys: 0.0 ± 0.0
4.916ProAsp: 4.916 ± 1.24
4.916ProGlu: 4.916 ± 2.237
1.639ProPhe: 1.639 ± 0.591
2.458ProGly: 2.458 ± 1.005
5.735ProHis: 5.735 ± 1.965
2.868ProIle: 2.868 ± 0.74
5.326ProLys: 5.326 ± 1.991
5.326ProLeu: 5.326 ± 1.442
1.229ProMet: 1.229 ± 0.96
0.819ProAsn: 0.819 ± 1.173
6.964ProPro: 6.964 ± 5.114
1.639ProGln: 1.639 ± 0.929
2.868ProArg: 2.868 ± 0.98
6.555ProSer: 6.555 ± 2.566
4.916ProThr: 4.916 ± 0.96
4.097ProVal: 4.097 ± 0.383
0.41ProTrp: 0.41 ± 0.232
1.639ProTyr: 1.639 ± 1.727
0.0ProXaa: 0.0 ± 0.0
Gln
4.916GlnAla: 4.916 ± 1.502
0.0GlnCys: 0.0 ± 0.0
4.097GlnAsp: 4.097 ± 1.451
1.639GlnGlu: 1.639 ± 0.929
2.048GlnPhe: 2.048 ± 1.042
3.277GlnGly: 3.277 ± 1.635
1.639GlnHis: 1.639 ± 0.575
1.639GlnIle: 1.639 ± 1.101
0.41GlnLys: 0.41 ± 0.232
6.555GlnLeu: 6.555 ± 2.469
0.819GlnMet: 0.819 ± 0.464
0.819GlnAsn: 0.819 ± 0.414
2.048GlnPro: 2.048 ± 0.619
4.097GlnGln: 4.097 ± 1.768
3.277GlnArg: 3.277 ± 1.102
3.277GlnSer: 3.277 ± 1.346
4.916GlnThr: 4.916 ± 2.387
4.097GlnVal: 4.097 ± 1.239
0.819GlnTrp: 0.819 ± 0.414
0.819GlnTyr: 0.819 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
4.097ArgAla: 4.097 ± 1.155
0.819ArgCys: 0.819 ± 1.083
1.639ArgAsp: 1.639 ± 0.575
2.868ArgGlu: 2.868 ± 1.032
1.639ArgPhe: 1.639 ± 0.828
1.229ArgGly: 1.229 ± 0.718
0.41ArgHis: 0.41 ± 0.232
4.916ArgIle: 4.916 ± 2.098
1.639ArgLys: 1.639 ± 0.929
8.193ArgLeu: 8.193 ± 2.279
0.0ArgMet: 0.0 ± 0.0
2.048ArgAsn: 2.048 ± 0.831
2.868ArgPro: 2.868 ± 2.266
3.277ArgGln: 3.277 ± 0.9
4.506ArgArg: 4.506 ± 1.199
2.458ArgSer: 2.458 ± 1.3
6.145ArgThr: 6.145 ± 3.081
2.458ArgVal: 2.458 ± 0.861
0.41ArgTrp: 0.41 ± 0.232
1.639ArgTyr: 1.639 ± 0.664
0.0ArgXaa: 0.0 ± 0.0
Ser
6.145SerAla: 6.145 ± 2.686
0.819SerCys: 0.819 ± 0.694
6.555SerAsp: 6.555 ± 1.33
3.277SerGlu: 3.277 ± 1.017
2.458SerPhe: 2.458 ± 1.083
2.048SerGly: 2.048 ± 0.585
1.639SerHis: 1.639 ± 1.332
2.868SerIle: 2.868 ± 0.89
3.277SerLys: 3.277 ± 1.431
4.097SerLeu: 4.097 ± 1.125
0.819SerMet: 0.819 ± 0.518
2.868SerAsn: 2.868 ± 1.948
2.458SerPro: 2.458 ± 1.009
2.458SerGln: 2.458 ± 1.471
3.687SerArg: 3.687 ± 1.021
4.916SerSer: 4.916 ± 1.643
6.555SerThr: 6.555 ± 1.828
2.868SerVal: 2.868 ± 0.791
0.0SerTrp: 0.0 ± 0.0
2.458SerTyr: 2.458 ± 1.393
0.0SerXaa: 0.0 ± 0.0
Thr
6.555ThrAla: 6.555 ± 1.815
0.41ThrCys: 0.41 ± 0.619
1.229ThrAsp: 1.229 ± 0.736
1.639ThrGlu: 1.639 ± 1.036
4.097ThrPhe: 4.097 ± 0.93
3.687ThrGly: 3.687 ± 0.868
2.868ThrHis: 2.868 ± 1.134
3.277ThrIle: 3.277 ± 0.889
2.048ThrLys: 2.048 ± 0.744
11.061ThrLeu: 11.061 ± 1.667
1.639ThrMet: 1.639 ± 0.453
4.916ThrAsn: 4.916 ± 1.591
7.784ThrPro: 7.784 ± 0.907
4.097ThrGln: 4.097 ± 0.634
3.687ThrArg: 3.687 ± 1.693
4.916ThrSer: 4.916 ± 1.376
7.374ThrThr: 7.374 ± 0.861
5.735ThrVal: 5.735 ± 0.648
0.0ThrTrp: 0.0 ± 0.0
2.458ThrTyr: 2.458 ± 1.243
0.0ThrXaa: 0.0 ± 0.0
Val
2.458ValAla: 2.458 ± 1.426
1.639ValCys: 1.639 ± 1.028
2.048ValAsp: 2.048 ± 1.021
2.048ValGlu: 2.048 ± 0.436
1.639ValPhe: 1.639 ± 0.929
3.277ValGly: 3.277 ± 0.948
2.458ValHis: 2.458 ± 0.733
3.277ValIle: 3.277 ± 0.679
2.868ValLys: 2.868 ± 1.626
9.013ValLeu: 9.013 ± 1.028
0.41ValMet: 0.41 ± 0.232
3.277ValAsn: 3.277 ± 0.997
5.735ValPro: 5.735 ± 1.24
3.277ValGln: 3.277 ± 0.685
2.868ValArg: 2.868 ± 1.032
4.506ValSer: 4.506 ± 1.037
2.048ValThr: 2.048 ± 0.436
3.687ValVal: 3.687 ± 0.45
0.41ValTrp: 0.41 ± 0.232
1.639ValTyr: 1.639 ± 0.751
0.0ValXaa: 0.0 ± 0.0
Trp
1.229TrpAla: 1.229 ± 1.481
0.0TrpCys: 0.0 ± 0.0
0.819TrpAsp: 0.819 ± 0.464
0.41TrpGlu: 0.41 ± 0.232
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.41TrpLys: 0.41 ± 0.232
0.819TrpLeu: 0.819 ± 0.514
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.41TrpPro: 0.41 ± 0.232
0.41TrpGln: 0.41 ± 0.232
0.41TrpArg: 0.41 ± 0.232
0.41TrpSer: 0.41 ± 0.563
0.0TrpThr: 0.0 ± 0.0
1.229TrpVal: 1.229 ± 0.697
0.0TrpTrp: 0.0 ± 0.0
0.41TrpTyr: 0.41 ± 0.232
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.687TyrAla: 3.687 ± 1.25
1.229TyrCys: 1.229 ± 0.697
0.819TyrAsp: 0.819 ± 0.464
0.41TyrGlu: 0.41 ± 0.232
0.0TyrPhe: 0.0 ± 0.0
1.639TyrGly: 1.639 ± 0.575
1.229TyrHis: 1.229 ± 0.718
1.229TyrIle: 1.229 ± 0.697
1.229TyrLys: 1.229 ± 0.697
2.868TyrLeu: 2.868 ± 1.232
0.819TyrMet: 0.819 ± 0.464
0.819TyrAsn: 0.819 ± 0.518
0.819TyrPro: 0.819 ± 0.514
2.458TyrGln: 2.458 ± 1.083
0.819TyrArg: 0.819 ± 0.414
3.277TyrSer: 3.277 ± 0.956
1.639TyrThr: 1.639 ± 0.591
3.687TyrVal: 3.687 ± 0.45
0.0TyrTrp: 0.0 ± 0.0
1.229TyrTyr: 1.229 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2442 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski