Amino acid dipepetide frequency for Bacilladnavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.464AlaAla: 3.464 ± 1.27
1.484AlaCys: 1.484 ± 0.754
3.958AlaAsp: 3.958 ± 1.239
1.979AlaGlu: 1.979 ± 0.738
1.979AlaPhe: 1.979 ± 0.999
4.453AlaGly: 4.453 ± 2.125
1.979AlaHis: 1.979 ± 0.825
3.464AlaIle: 3.464 ± 1.56
5.938AlaLys: 5.938 ± 1.388
3.464AlaLeu: 3.464 ± 1.168
0.99AlaMet: 0.99 ± 1.152
4.453AlaAsn: 4.453 ± 1.39
4.948AlaPro: 4.948 ± 1.953
4.453AlaGln: 4.453 ± 1.149
2.969AlaArg: 2.969 ± 0.839
3.958AlaSer: 3.958 ± 1.431
4.453AlaThr: 4.453 ± 1.027
2.969AlaVal: 2.969 ± 1.327
1.484AlaTrp: 1.484 ± 0.806
2.969AlaTyr: 2.969 ± 1.056
0.0AlaXaa: 0.0 ± 0.0
Cys
1.484CysAla: 1.484 ± 0.84
1.979CysCys: 1.979 ± 1.198
1.484CysAsp: 1.484 ± 1.039
0.0CysGlu: 0.0 ± 0.0
1.979CysPhe: 1.979 ± 0.937
0.495CysGly: 0.495 ± 0.41
0.495CysHis: 0.495 ± 0.411
1.484CysIle: 1.484 ± 0.84
1.979CysLys: 1.979 ± 0.83
2.969CysLeu: 2.969 ± 1.154
0.0CysMet: 0.0 ± 0.0
0.99CysAsn: 0.99 ± 0.522
1.484CysPro: 1.484 ± 1.231
0.99CysGln: 0.99 ± 0.586
1.484CysArg: 1.484 ± 0.84
3.464CysSer: 3.464 ± 1.483
1.484CysThr: 1.484 ± 0.845
1.979CysVal: 1.979 ± 1.237
0.0CysTrp: 0.0 ± 0.0
0.495CysTyr: 0.495 ± 0.421
0.0CysXaa: 0.0 ± 0.0
Asp
4.453AspAla: 4.453 ± 0.981
3.464AspCys: 3.464 ± 1.151
4.948AspAsp: 4.948 ± 1.72
2.969AspGlu: 2.969 ± 1.007
1.979AspPhe: 1.979 ± 0.891
4.453AspGly: 4.453 ± 1.209
1.979AspHis: 1.979 ± 1.644
3.958AspIle: 3.958 ± 1.411
2.474AspLys: 2.474 ± 1.241
4.948AspLeu: 4.948 ± 1.336
2.969AspMet: 2.969 ± 1.215
1.484AspAsn: 1.484 ± 0.683
6.927AspPro: 6.927 ± 1.954
0.495AspGln: 0.495 ± 0.41
3.464AspArg: 3.464 ± 1.754
0.99AspSer: 0.99 ± 0.557
1.979AspThr: 1.979 ± 0.92
3.958AspVal: 3.958 ± 1.148
0.0AspTrp: 0.0 ± 0.0
0.99AspTyr: 0.99 ± 0.822
0.0AspXaa: 0.0 ± 0.0
Glu
4.453GluAla: 4.453 ± 1.196
0.495GluCys: 0.495 ± 0.421
2.474GluAsp: 2.474 ± 0.834
3.464GluGlu: 3.464 ± 1.229
1.484GluPhe: 1.484 ± 0.579
5.443GluGly: 5.443 ± 1.503
0.99GluHis: 0.99 ± 0.535
2.474GluIle: 2.474 ± 1.028
4.453GluLys: 4.453 ± 1.954
2.969GluLeu: 2.969 ± 1.08
0.495GluMet: 0.495 ± 0.388
1.484GluAsn: 1.484 ± 0.806
2.969GluPro: 2.969 ± 1.174
2.474GluGln: 2.474 ± 1.217
0.99GluArg: 0.99 ± 0.695
1.484GluSer: 1.484 ± 0.579
1.484GluThr: 1.484 ± 0.692
3.958GluVal: 3.958 ± 2.014
0.495GluTrp: 0.495 ± 0.619
1.979GluTyr: 1.979 ± 0.813
0.0GluXaa: 0.0 ± 0.0
Phe
2.969PheAla: 2.969 ± 1.074
0.0PheCys: 0.0 ± 0.0
2.969PheAsp: 2.969 ± 1.2
0.495PheGlu: 0.495 ± 0.588
1.979PhePhe: 1.979 ± 1.038
2.474PheGly: 2.474 ± 1.061
0.99PheHis: 0.99 ± 0.557
2.474PheIle: 2.474 ± 1.301
1.979PheLys: 1.979 ± 0.899
3.464PheLeu: 3.464 ± 1.434
0.495PheMet: 0.495 ± 0.518
1.484PheAsn: 1.484 ± 1.231
3.958PhePro: 3.958 ± 0.742
2.969PheGln: 2.969 ± 1.97
2.474PheArg: 2.474 ± 0.94
4.453PheSer: 4.453 ± 2.16
2.474PheThr: 2.474 ± 1.016
4.453PheVal: 4.453 ± 1.961
0.495PheTrp: 0.495 ± 0.41
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.464GlyAla: 3.464 ± 0.633
0.99GlyCys: 0.99 ± 0.51
3.464GlyAsp: 3.464 ± 1.405
3.464GlyGlu: 3.464 ± 1.187
1.484GlyPhe: 1.484 ± 0.656
3.958GlyGly: 3.958 ± 0.915
0.495GlyHis: 0.495 ± 0.41
5.938GlyIle: 5.938 ± 2.788
3.464GlyLys: 3.464 ± 1.113
4.948GlyLeu: 4.948 ± 1.218
0.495GlyMet: 0.495 ± 0.41
1.484GlyAsn: 1.484 ± 0.723
1.979GlyPro: 1.979 ± 1.045
2.969GlyGln: 2.969 ± 1.195
5.443GlyArg: 5.443 ± 1.354
3.464GlySer: 3.464 ± 1.355
4.453GlyThr: 4.453 ± 1.316
3.464GlyVal: 3.464 ± 1.442
0.495GlyTrp: 0.495 ± 0.41
1.979GlyTyr: 1.979 ± 0.728
0.0GlyXaa: 0.0 ± 0.0
His
1.484HisAla: 1.484 ± 0.692
0.99HisCys: 0.99 ± 0.453
1.484HisAsp: 1.484 ± 0.692
0.99HisGlu: 0.99 ± 0.775
0.495HisPhe: 0.495 ± 0.411
0.99HisGly: 0.99 ± 0.522
0.0HisHis: 0.0 ± 0.0
1.484HisIle: 1.484 ± 0.576
0.99HisLys: 0.99 ± 0.701
2.969HisLeu: 2.969 ± 0.918
0.0HisMet: 0.0 ± 0.0
0.495HisAsn: 0.495 ± 0.588
1.484HisPro: 1.484 ± 0.859
0.99HisGln: 0.99 ± 0.514
1.979HisArg: 1.979 ± 0.813
3.464HisSer: 3.464 ± 1.214
0.0HisThr: 0.0 ± 0.0
0.495HisVal: 0.495 ± 0.411
0.495HisTrp: 0.495 ± 0.411
0.99HisTyr: 0.99 ± 0.522
0.0HisXaa: 0.0 ± 0.0
Ile
4.948IleAla: 4.948 ± 1.29
1.484IleCys: 1.484 ± 0.89
4.948IleAsp: 4.948 ± 1.34
0.99IleGlu: 0.99 ± 0.514
2.969IlePhe: 2.969 ± 1.155
1.979IleGly: 1.979 ± 0.615
2.474IleHis: 2.474 ± 1.423
1.484IleIle: 1.484 ± 0.719
2.969IleLys: 2.969 ± 1.221
2.969IleLeu: 2.969 ± 0.923
1.484IleMet: 1.484 ± 0.54
2.969IleAsn: 2.969 ± 1.308
3.958IlePro: 3.958 ± 1.364
1.979IleGln: 1.979 ± 0.873
3.958IleArg: 3.958 ± 1.367
3.958IleSer: 3.958 ± 1.012
1.979IleThr: 1.979 ± 0.922
2.474IleVal: 2.474 ± 0.694
0.495IleTrp: 0.495 ± 0.41
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.464LysAla: 3.464 ± 1.521
2.969LysCys: 2.969 ± 1.727
1.979LysAsp: 1.979 ± 0.988
3.958LysGlu: 3.958 ± 1.492
0.99LysPhe: 0.99 ± 0.668
1.979LysGly: 1.979 ± 1.36
0.99LysHis: 0.99 ± 0.667
1.979LysIle: 1.979 ± 0.728
7.422LysLys: 7.422 ± 3.43
4.948LysLeu: 4.948 ± 1.534
1.484LysMet: 1.484 ± 1.185
3.958LysAsn: 3.958 ± 1.243
2.474LysPro: 2.474 ± 0.977
2.969LysGln: 2.969 ± 0.99
4.453LysArg: 4.453 ± 1.785
4.948LysSer: 4.948 ± 1.661
0.99LysThr: 0.99 ± 0.557
4.948LysVal: 4.948 ± 1.379
1.484LysTrp: 1.484 ± 1.302
4.948LysTyr: 4.948 ± 2.114
0.0LysXaa: 0.0 ± 0.0
Leu
4.453LeuAla: 4.453 ± 1.533
0.495LeuCys: 0.495 ± 0.41
4.453LeuAsp: 4.453 ± 0.665
3.464LeuGlu: 3.464 ± 1.051
4.453LeuPhe: 4.453 ± 1.431
2.474LeuGly: 2.474 ± 1.183
3.464LeuHis: 3.464 ± 1.028
3.464LeuIle: 3.464 ± 1.138
4.948LeuLys: 4.948 ± 1.784
5.443LeuLeu: 5.443 ± 1.599
2.474LeuMet: 2.474 ± 0.863
3.464LeuAsn: 3.464 ± 1.132
3.958LeuPro: 3.958 ± 0.879
2.969LeuGln: 2.969 ± 0.914
6.432LeuArg: 6.432 ± 1.557
6.432LeuSer: 6.432 ± 2.325
5.938LeuThr: 5.938 ± 2.643
2.474LeuVal: 2.474 ± 1.327
0.99LeuTrp: 0.99 ± 1.176
3.958LeuTyr: 3.958 ± 2.016
0.0LeuXaa: 0.0 ± 0.0
Met
1.484MetAla: 1.484 ± 0.943
0.495MetCys: 0.495 ± 0.421
0.495MetAsp: 0.495 ± 0.421
0.99MetGlu: 0.99 ± 0.744
0.99MetPhe: 0.99 ± 0.667
0.99MetGly: 0.99 ± 0.522
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.474MetLys: 2.474 ± 1.112
1.979MetLeu: 1.979 ± 0.793
0.0MetMet: 0.0 ± 0.0
2.969MetAsn: 2.969 ± 0.852
1.484MetPro: 1.484 ± 0.852
0.99MetGln: 0.99 ± 0.51
0.99MetArg: 0.99 ± 0.69
1.979MetSer: 1.979 ± 0.763
0.495MetThr: 0.495 ± 0.41
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.99MetTyr: 0.99 ± 0.736
0.0MetXaa: 0.0 ± 0.0
Asn
1.979AsnAla: 1.979 ± 0.771
0.99AsnCys: 0.99 ± 0.842
2.474AsnAsp: 2.474 ± 1.02
3.958AsnGlu: 3.958 ± 2.058
0.495AsnPhe: 0.495 ± 0.41
1.979AsnGly: 1.979 ± 1.135
0.99AsnHis: 0.99 ± 0.522
0.99AsnIle: 0.99 ± 0.701
0.0AsnLys: 0.0 ± 0.0
1.484AsnLeu: 1.484 ± 0.463
0.99AsnMet: 0.99 ± 0.821
1.484AsnAsn: 1.484 ± 0.514
5.938AsnPro: 5.938 ± 1.962
2.474AsnGln: 2.474 ± 1.175
2.969AsnArg: 2.969 ± 1.032
3.464AsnSer: 3.464 ± 1.451
3.464AsnThr: 3.464 ± 1.558
1.484AsnVal: 1.484 ± 0.692
0.495AsnTrp: 0.495 ± 0.388
2.474AsnTyr: 2.474 ± 0.927
0.0AsnXaa: 0.0 ± 0.0
Pro
2.969ProAla: 2.969 ± 0.865
2.474ProCys: 2.474 ± 1.307
4.948ProAsp: 4.948 ± 2.306
2.969ProGlu: 2.969 ± 1.123
3.958ProPhe: 3.958 ± 1.167
4.948ProGly: 4.948 ± 1.574
1.484ProHis: 1.484 ± 1.233
2.969ProIle: 2.969 ± 1.035
4.948ProLys: 4.948 ± 1.617
3.958ProLeu: 3.958 ± 1.353
1.484ProMet: 1.484 ± 0.974
4.948ProAsn: 4.948 ± 1.573
9.401ProPro: 9.401 ± 2.585
1.979ProGln: 1.979 ± 1.185
4.948ProArg: 4.948 ± 1.753
7.917ProSer: 7.917 ± 2.06
7.422ProThr: 7.422 ± 1.419
5.443ProVal: 5.443 ± 1.549
1.484ProTrp: 1.484 ± 0.761
0.495ProTyr: 0.495 ± 0.41
0.0ProXaa: 0.0 ± 0.0
Gln
2.474GlnAla: 2.474 ± 0.768
0.495GlnCys: 0.495 ± 0.468
1.979GlnAsp: 1.979 ± 0.736
1.484GlnGlu: 1.484 ± 0.831
1.979GlnPhe: 1.979 ± 1.045
2.969GlnGly: 2.969 ± 1.156
0.495GlnHis: 0.495 ± 0.41
2.474GlnIle: 2.474 ± 1.537
1.979GlnLys: 1.979 ± 1.074
1.979GlnLeu: 1.979 ± 0.903
0.495GlnMet: 0.495 ± 0.468
1.484GlnAsn: 1.484 ± 1.163
0.99GlnPro: 0.99 ± 0.524
0.99GlnGln: 0.99 ± 0.557
1.484GlnArg: 1.484 ± 1.405
4.453GlnSer: 4.453 ± 1.178
4.453GlnThr: 4.453 ± 1.325
1.484GlnVal: 1.484 ± 1.233
1.484GlnTrp: 1.484 ± 0.786
2.474GlnTyr: 2.474 ± 0.806
0.0GlnXaa: 0.0 ± 0.0
Arg
3.464ArgAla: 3.464 ± 1.441
0.99ArgCys: 0.99 ± 0.695
3.958ArgAsp: 3.958 ± 1.041
2.474ArgGlu: 2.474 ± 1.838
3.958ArgPhe: 3.958 ± 1.128
2.474ArgGly: 2.474 ± 1.314
0.495ArgHis: 0.495 ± 0.41
1.979ArgIle: 1.979 ± 0.793
5.938ArgLys: 5.938 ± 2.723
8.906ArgLeu: 8.906 ± 2.337
0.99ArgMet: 0.99 ± 0.667
1.979ArgAsn: 1.979 ± 0.658
2.969ArgPro: 2.969 ± 1.194
2.474ArgGln: 2.474 ± 1.679
4.948ArgArg: 4.948 ± 2.847
4.948ArgSer: 4.948 ± 1.072
4.453ArgThr: 4.453 ± 1.177
2.969ArgVal: 2.969 ± 1.013
1.484ArgTrp: 1.484 ± 1.134
1.979ArgTyr: 1.979 ± 0.966
0.0ArgXaa: 0.0 ± 0.0
Ser
5.443SerAla: 5.443 ± 1.089
0.99SerCys: 0.99 ± 0.552
3.464SerAsp: 3.464 ± 1.035
4.453SerGlu: 4.453 ± 1.175
4.948SerPhe: 4.948 ± 1.262
5.938SerGly: 5.938 ± 1.846
0.99SerHis: 0.99 ± 0.535
3.464SerIle: 3.464 ± 0.853
4.453SerLys: 4.453 ± 1.759
5.443SerLeu: 5.443 ± 2.3
1.484SerMet: 1.484 ± 1.089
0.99SerAsn: 0.99 ± 0.524
8.906SerPro: 8.906 ± 3.221
1.484SerGln: 1.484 ± 0.943
3.958SerArg: 3.958 ± 1.118
5.443SerSer: 5.443 ± 1.353
5.938SerThr: 5.938 ± 1.805
3.464SerVal: 3.464 ± 1.198
1.979SerTrp: 1.979 ± 0.878
2.474SerTyr: 2.474 ± 0.657
0.0SerXaa: 0.0 ± 0.0
Thr
3.464ThrAla: 3.464 ± 1.492
0.99ThrCys: 0.99 ± 0.845
1.979ThrAsp: 1.979 ± 0.977
1.979ThrGlu: 1.979 ± 0.621
2.969ThrPhe: 2.969 ± 0.854
5.443ThrGly: 5.443 ± 2.259
1.484ThrHis: 1.484 ± 0.852
3.958ThrIle: 3.958 ± 1.265
1.979ThrLys: 1.979 ± 0.661
4.948ThrLeu: 4.948 ± 2.145
1.484ThrMet: 1.484 ± 0.587
1.979ThrAsn: 1.979 ± 1.177
7.917ThrPro: 7.917 ± 1.554
0.99ThrGln: 0.99 ± 0.51
1.484ThrArg: 1.484 ± 0.761
3.464ThrSer: 3.464 ± 0.978
5.938ThrThr: 5.938 ± 2.42
2.474ThrVal: 2.474 ± 1.092
1.979ThrTrp: 1.979 ± 0.708
3.464ThrTyr: 3.464 ± 1.712
0.0ThrXaa: 0.0 ± 0.0
Val
5.443ValAla: 5.443 ± 1.189
1.979ValCys: 1.979 ± 0.788
2.969ValAsp: 2.969 ± 1.309
2.969ValGlu: 2.969 ± 1.046
2.969ValPhe: 2.969 ± 1.148
2.474ValGly: 2.474 ± 1.291
1.484ValHis: 1.484 ± 0.939
3.464ValIle: 3.464 ± 1.233
2.969ValLys: 2.969 ± 1.145
5.443ValLeu: 5.443 ± 1.801
0.99ValMet: 0.99 ± 0.628
1.484ValAsn: 1.484 ± 0.957
5.938ValPro: 5.938 ± 1.221
1.979ValGln: 1.979 ± 1.052
2.474ValArg: 2.474 ± 0.925
3.464ValSer: 3.464 ± 0.958
1.484ValThr: 1.484 ± 0.88
4.453ValVal: 4.453 ± 1.468
0.0ValTrp: 0.0 ± 0.0
1.484ValTyr: 1.484 ± 0.656
0.0ValXaa: 0.0 ± 0.0
Trp
0.495TrpAla: 0.495 ± 0.41
1.979TrpCys: 1.979 ± 0.599
1.979TrpAsp: 1.979 ± 0.839
0.495TrpGlu: 0.495 ± 0.388
0.495TrpPhe: 0.495 ± 0.411
0.495TrpGly: 0.495 ± 0.388
0.495TrpHis: 0.495 ± 0.41
0.99TrpIle: 0.99 ± 0.731
0.495TrpLys: 0.495 ± 0.518
0.99TrpLeu: 0.99 ± 0.728
0.495TrpMet: 0.495 ± 0.468
0.0TrpAsn: 0.0 ± 0.0
0.99TrpPro: 0.99 ± 0.51
1.484TrpGln: 1.484 ± 1.222
1.484TrpArg: 1.484 ± 0.873
1.484TrpSer: 1.484 ± 1.007
0.495TrpThr: 0.495 ± 0.41
0.495TrpVal: 0.495 ± 0.421
0.0TrpTrp: 0.0 ± 0.0
0.495TrpTyr: 0.495 ± 0.619
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.958TyrAla: 3.958 ± 2.0
0.99TyrCys: 0.99 ± 0.453
2.474TyrAsp: 2.474 ± 1.123
2.969TyrGlu: 2.969 ± 1.056
0.495TyrPhe: 0.495 ± 0.411
1.484TyrGly: 1.484 ± 1.039
0.495TyrHis: 0.495 ± 0.518
1.979TyrIle: 1.979 ± 1.011
1.484TyrLys: 1.484 ± 0.756
1.979TyrLeu: 1.979 ± 0.886
0.0TyrMet: 0.0 ± 0.406
0.99TyrAsn: 0.99 ± 0.624
2.474TyrPro: 2.474 ± 0.4
0.0TyrGln: 0.0 ± 0.0
5.443TyrArg: 5.443 ± 1.814
2.474TyrSer: 2.474 ± 1.124
0.99TyrThr: 0.99 ± 0.667
2.474TyrVal: 2.474 ± 0.911
0.99TyrTrp: 0.99 ± 0.522
2.969TyrTyr: 2.969 ± 1.375
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski