Amino acid dipepetide frequency for Aspergillus fumigatus polymycovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.685AlaAla: 15.685 ± 2.127
3.815AlaCys: 3.815 ± 2.151
4.239AlaAsp: 4.239 ± 1.351
6.783AlaGlu: 6.783 ± 1.518
3.815AlaPhe: 3.815 ± 0.667
5.087AlaGly: 5.087 ± 0.43
2.12AlaHis: 2.12 ± 0.944
5.087AlaIle: 5.087 ± 1.962
5.935AlaLys: 5.935 ± 1.992
10.598AlaLeu: 10.598 ± 2.132
2.967AlaMet: 2.967 ± 0.442
2.543AlaAsn: 2.543 ± 1.213
5.935AlaPro: 5.935 ± 2.589
4.239AlaGln: 4.239 ± 0.814
10.598AlaArg: 10.598 ± 3.58
9.326AlaSer: 9.326 ± 1.21
7.206AlaThr: 7.206 ± 2.897
8.902AlaVal: 8.902 ± 1.882
0.848AlaTrp: 0.848 ± 0.698
2.967AlaTyr: 2.967 ± 0.189
0.0AlaXaa: 0.0 ± 0.0
Cys
1.696CysAla: 1.696 ± 1.46
0.424CysCys: 0.424 ± 0.387
0.848CysAsp: 0.848 ± 0.774
0.424CysGlu: 0.424 ± 0.369
0.0CysPhe: 0.0 ± 0.0
0.848CysGly: 0.848 ± 0.73
0.0CysHis: 0.0 ± 0.0
0.848CysIle: 0.848 ± 0.73
0.424CysLys: 0.424 ± 0.369
0.848CysLeu: 0.848 ± 0.446
0.424CysMet: 0.424 ± 0.369
0.424CysAsn: 0.424 ± 0.365
0.848CysPro: 0.848 ± 0.446
0.424CysGln: 0.424 ± 0.369
0.0CysArg: 0.0 ± 0.0
0.848CysSer: 0.848 ± 0.333
1.272CysThr: 1.272 ± 0.621
1.272CysVal: 1.272 ± 1.095
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.75AspAla: 9.75 ± 1.479
0.424AspCys: 0.424 ± 0.365
5.935AspAsp: 5.935 ± 1.995
2.543AspGlu: 2.543 ± 1.749
1.272AspPhe: 1.272 ± 0.227
6.359AspGly: 6.359 ± 2.041
2.967AspHis: 2.967 ± 0.698
3.815AspIle: 3.815 ± 1.02
1.696AspLys: 1.696 ± 0.666
4.663AspLeu: 4.663 ± 1.869
1.696AspMet: 1.696 ± 0.568
1.696AspAsn: 1.696 ± 0.921
6.783AspPro: 6.783 ± 2.772
1.272AspGln: 1.272 ± 0.227
4.239AspArg: 4.239 ± 0.222
1.272AspSer: 1.272 ± 0.227
3.815AspThr: 3.815 ± 1.3
3.815AspVal: 3.815 ± 1.691
0.848AspTrp: 0.848 ± 0.446
2.543AspTyr: 2.543 ± 1.434
0.0AspXaa: 0.0 ± 0.0
Glu
4.663GluAla: 4.663 ± 1.2
0.424GluCys: 0.424 ± 0.369
2.543GluAsp: 2.543 ± 0.751
2.967GluGlu: 2.967 ± 1.671
2.967GluPhe: 2.967 ± 1.215
3.815GluGly: 3.815 ± 2.009
2.12GluHis: 2.12 ± 0.438
1.696GluIle: 1.696 ± 1.655
2.12GluLys: 2.12 ± 1.275
4.239GluLeu: 4.239 ± 0.501
0.0GluMet: 0.0 ± 0.0
0.848GluAsn: 0.848 ± 0.73
2.967GluPro: 2.967 ± 0.889
1.696GluGln: 1.696 ± 0.353
3.815GluArg: 3.815 ± 0.841
1.272GluSer: 1.272 ± 0.673
2.543GluThr: 2.543 ± 1.114
1.272GluVal: 1.272 ± 0.227
0.848GluTrp: 0.848 ± 0.73
1.696GluTyr: 1.696 ± 0.803
0.0GluXaa: 0.0 ± 0.0
Phe
3.391PheAla: 3.391 ± 0.536
0.424PheCys: 0.424 ± 0.369
3.815PheAsp: 3.815 ± 0.681
0.424PheGlu: 0.424 ± 0.387
0.848PhePhe: 0.848 ± 0.774
3.391PheGly: 3.391 ± 1.48
1.696PheHis: 1.696 ± 0.492
1.696PheIle: 1.696 ± 0.98
0.848PheLys: 0.848 ± 0.446
2.543PheLeu: 2.543 ± 1.358
0.848PheMet: 0.848 ± 0.333
0.424PheAsn: 0.424 ± 0.369
0.848PhePro: 0.848 ± 0.774
0.0PheGln: 0.0 ± 0.0
1.272PheArg: 1.272 ± 0.227
2.543PheSer: 2.543 ± 0.643
2.12PheThr: 2.12 ± 0.657
4.239PheVal: 4.239 ± 0.72
0.424PheTrp: 0.424 ± 0.563
1.272PheTyr: 1.272 ± 0.673
0.0PheXaa: 0.0 ± 0.0
Gly
4.663GlyAla: 4.663 ± 1.262
0.848GlyCys: 0.848 ± 0.738
5.511GlyAsp: 5.511 ± 0.987
2.967GlyGlu: 2.967 ± 0.733
0.848GlyPhe: 0.848 ± 0.333
6.783GlyGly: 6.783 ± 3.843
2.12GlyHis: 2.12 ± 0.856
2.12GlyIle: 2.12 ± 0.604
0.848GlyLys: 0.848 ± 0.333
6.783GlyLeu: 6.783 ± 2.396
1.696GlyMet: 1.696 ± 0.42
2.543GlyAsn: 2.543 ± 1.204
6.359GlyPro: 6.359 ± 0.747
1.696GlyGln: 1.696 ± 1.751
6.359GlyArg: 6.359 ± 2.386
5.935GlySer: 5.935 ± 2.471
3.391GlyThr: 3.391 ± 0.984
7.63GlyVal: 7.63 ± 0.998
0.424GlyTrp: 0.424 ± 0.369
1.696GlyTyr: 1.696 ± 0.803
0.0GlyXaa: 0.0 ± 0.0
His
2.12HisAla: 2.12 ± 0.657
0.0HisCys: 0.0 ± 0.0
2.12HisAsp: 2.12 ± 1.37
1.272HisGlu: 1.272 ± 0.673
1.272HisPhe: 1.272 ± 0.604
4.239HisGly: 4.239 ± 0.752
1.272HisHis: 1.272 ± 0.227
0.848HisIle: 0.848 ± 0.401
0.424HisLys: 0.424 ± 0.387
3.391HisLeu: 3.391 ± 0.536
0.848HisMet: 0.848 ± 0.401
0.848HisAsn: 0.848 ± 0.401
2.543HisPro: 2.543 ± 0.643
0.424HisGln: 0.424 ± 0.387
1.272HisArg: 1.272 ± 0.227
1.272HisSer: 1.272 ± 0.597
2.543HisThr: 2.543 ± 1.287
2.12HisVal: 2.12 ± 1.137
0.0HisTrp: 0.0 ± 0.0
1.272HisTyr: 1.272 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
5.935IleAla: 5.935 ± 2.003
0.424IleCys: 0.424 ± 0.365
3.391IleAsp: 3.391 ± 1.231
1.272IleGlu: 1.272 ± 1.095
1.272IlePhe: 1.272 ± 0.621
1.272IleGly: 1.272 ± 0.679
0.848IleHis: 0.848 ± 1.126
2.12IleIle: 2.12 ± 0.856
2.543IleLys: 2.543 ± 1.208
2.967IleLeu: 2.967 ± 0.839
1.272IleMet: 1.272 ± 0.586
0.424IleAsn: 0.424 ± 0.365
1.696IlePro: 1.696 ± 1.315
0.848IleGln: 0.848 ± 1.126
0.424IleArg: 0.424 ± 0.365
3.815IleSer: 3.815 ± 1.209
2.967IleThr: 2.967 ± 1.037
3.391IleVal: 3.391 ± 0.702
0.0IleTrp: 0.0 ± 0.0
0.848IleTyr: 0.848 ± 0.446
0.0IleXaa: 0.0 ± 0.0
Lys
3.815LysAla: 3.815 ± 1.448
0.424LysCys: 0.424 ± 0.387
2.543LysAsp: 2.543 ± 1.474
0.424LysGlu: 0.424 ± 0.369
0.424LysPhe: 0.424 ± 0.369
3.815LysGly: 3.815 ± 1.448
0.424LysHis: 0.424 ± 0.369
1.696LysIle: 1.696 ± 0.509
0.848LysLys: 0.848 ± 0.738
4.239LysLeu: 4.239 ± 2.728
0.0LysMet: 0.0 ± 0.0
1.272LysAsn: 1.272 ± 0.621
1.696LysPro: 1.696 ± 1.109
0.0LysGln: 0.0 ± 0.0
2.543LysArg: 2.543 ± 1.032
1.696LysSer: 1.696 ± 0.42
0.424LysThr: 0.424 ± 0.563
2.967LysVal: 2.967 ± 1.032
0.848LysTrp: 0.848 ± 0.738
0.848LysTyr: 0.848 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
11.022LeuAla: 11.022 ± 0.254
0.424LeuCys: 0.424 ± 0.365
3.391LeuAsp: 3.391 ± 1.196
5.087LeuGlu: 5.087 ± 0.871
3.391LeuPhe: 3.391 ± 0.84
5.935LeuGly: 5.935 ± 1.355
3.815LeuHis: 3.815 ± 1.197
3.391LeuIle: 3.391 ± 1.457
3.391LeuLys: 3.391 ± 1.386
8.902LeuLeu: 8.902 ± 2.691
1.272LeuMet: 1.272 ± 0.679
2.967LeuAsn: 2.967 ± 1.671
3.815LeuPro: 3.815 ± 1.478
1.696LeuGln: 1.696 ± 0.492
7.206LeuArg: 7.206 ± 1.349
7.63LeuSer: 7.63 ± 1.039
6.359LeuThr: 6.359 ± 0.56
9.326LeuVal: 9.326 ± 1.511
0.424LeuTrp: 0.424 ± 0.369
2.12LeuTyr: 2.12 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
2.967MetAla: 2.967 ± 1.757
0.848MetCys: 0.848 ± 0.401
0.848MetAsp: 0.848 ± 0.738
0.848MetGlu: 0.848 ± 0.73
2.12MetPhe: 2.12 ± 0.525
0.424MetGly: 0.424 ± 0.369
0.424MetHis: 0.424 ± 0.369
0.424MetIle: 0.424 ± 0.365
0.848MetLys: 0.848 ± 0.401
1.272MetLeu: 1.272 ± 0.751
0.424MetMet: 0.424 ± 0.369
0.0MetAsn: 0.0 ± 0.0
2.12MetPro: 2.12 ± 0.604
0.0MetGln: 0.0 ± 0.0
1.272MetArg: 1.272 ± 0.227
2.12MetSer: 2.12 ± 1.469
0.0MetThr: 0.0 ± 0.0
0.848MetVal: 0.848 ± 0.401
0.0MetTrp: 0.0 ± 0.0
0.848MetTyr: 0.848 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
2.967AsnAla: 2.967 ± 0.698
0.0AsnCys: 0.0 ± 0.0
0.424AsnAsp: 0.424 ± 0.369
1.272AsnGlu: 1.272 ± 0.673
0.0AsnPhe: 0.0 ± 0.0
1.272AsnGly: 1.272 ± 0.717
0.848AsnHis: 0.848 ± 0.401
1.696AsnIle: 1.696 ± 1.047
0.848AsnLys: 0.848 ± 0.401
2.543AsnLeu: 2.543 ± 1.173
0.848AsnMet: 0.848 ± 0.401
0.848AsnAsn: 0.848 ± 0.401
1.696AsnPro: 1.696 ± 0.803
1.272AsnGln: 1.272 ± 0.673
2.967AsnArg: 2.967 ± 1.002
0.848AsnSer: 0.848 ± 0.608
2.12AsnThr: 2.12 ± 0.856
2.12AsnVal: 2.12 ± 0.856
0.0AsnTrp: 0.0 ± 0.0
0.848AsnTyr: 0.848 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
7.63ProAla: 7.63 ± 2.386
0.424ProCys: 0.424 ± 0.387
4.239ProAsp: 4.239 ± 1.049
3.391ProGlu: 3.391 ± 1.212
0.848ProPhe: 0.848 ± 0.446
6.359ProGly: 6.359 ± 0.823
2.12ProHis: 2.12 ± 0.797
1.272ProIle: 1.272 ± 0.833
1.272ProLys: 1.272 ± 1.112
3.815ProLeu: 3.815 ± 1.341
0.848ProMet: 0.848 ± 0.666
0.848ProAsn: 0.848 ± 0.401
5.935ProPro: 5.935 ± 1.01
1.272ProGln: 1.272 ± 0.227
5.511ProArg: 5.511 ± 1.432
5.511ProSer: 5.511 ± 2.381
7.63ProThr: 7.63 ± 0.716
6.359ProVal: 6.359 ± 1.429
0.424ProTrp: 0.424 ± 0.365
2.12ProTyr: 2.12 ± 0.58
0.0ProXaa: 0.0 ± 0.0
Gln
3.391GlnAla: 3.391 ± 1.985
0.424GlnCys: 0.424 ± 0.365
1.272GlnAsp: 1.272 ± 0.717
1.272GlnGlu: 1.272 ± 0.833
2.967GlnPhe: 2.967 ± 1.288
0.0GlnGly: 0.0 ± 0.0
0.424GlnHis: 0.424 ± 0.365
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.967GlnLeu: 2.967 ± 0.583
0.0GlnMet: 0.0 ± 0.0
0.424GlnAsn: 0.424 ± 0.365
1.272GlnPro: 1.272 ± 0.621
0.424GlnGln: 0.424 ± 0.563
2.543GlnArg: 2.543 ± 1.242
2.12GlnSer: 2.12 ± 0.389
0.848GlnThr: 0.848 ± 0.333
2.543GlnVal: 2.543 ± 0.643
0.424GlnTrp: 0.424 ± 0.369
1.696GlnTyr: 1.696 ± 1.215
0.0GlnXaa: 0.0 ± 0.0
Arg
8.478ArgAla: 8.478 ± 3.185
0.0ArgCys: 0.0 ± 0.0
3.391ArgAsp: 3.391 ± 0.934
2.12ArgGlu: 2.12 ± 1.053
3.391ArgPhe: 3.391 ± 1.293
4.239ArgGly: 4.239 ± 1.251
1.696ArgHis: 1.696 ± 1.005
1.696ArgIle: 1.696 ± 1.071
1.272ArgLys: 1.272 ± 0.586
8.902ArgLeu: 8.902 ± 0.991
2.12ArgMet: 2.12 ± 1.102
3.391ArgAsn: 3.391 ± 1.352
5.511ArgPro: 5.511 ± 2.835
3.391ArgGln: 3.391 ± 0.256
5.935ArgArg: 5.935 ± 1.217
7.206ArgSer: 7.206 ± 0.618
2.967ArgThr: 2.967 ± 1.434
6.359ArgVal: 6.359 ± 2.448
0.0ArgTrp: 0.0 ± 0.0
2.12ArgTyr: 2.12 ± 0.856
0.0ArgXaa: 0.0 ± 0.0
Ser
8.902SerAla: 8.902 ± 2.492
0.0SerCys: 0.0 ± 0.0
4.239SerAsp: 4.239 ± 1.111
2.967SerGlu: 2.967 ± 0.583
3.815SerPhe: 3.815 ± 0.857
7.206SerGly: 7.206 ± 2.566
1.696SerHis: 1.696 ± 0.921
3.391SerIle: 3.391 ± 0.796
2.12SerLys: 2.12 ± 0.58
10.598SerLeu: 10.598 ± 1.205
0.848SerMet: 0.848 ± 0.401
0.848SerAsn: 0.848 ± 0.333
4.663SerPro: 4.663 ± 0.578
1.696SerGln: 1.696 ± 0.666
3.391SerArg: 3.391 ± 0.98
5.511SerSer: 5.511 ± 1.92
4.239SerThr: 4.239 ± 1.783
5.087SerVal: 5.087 ± 1.931
0.848SerTrp: 0.848 ± 0.401
1.696SerTyr: 1.696 ± 1.136
0.0SerXaa: 0.0 ± 0.0
Thr
6.783ThrAla: 6.783 ± 1.045
0.424ThrCys: 0.424 ± 0.365
4.239ThrAsp: 4.239 ± 0.518
2.543ThrGlu: 2.543 ± 1.032
2.12ThrPhe: 2.12 ± 0.438
4.239ThrGly: 4.239 ± 0.875
1.696ThrHis: 1.696 ± 0.921
1.272ThrIle: 1.272 ± 1.162
1.272ThrLys: 1.272 ± 0.604
5.087ThrLeu: 5.087 ± 1.601
1.696ThrMet: 1.696 ± 0.42
0.424ThrAsn: 0.424 ± 0.563
5.935ThrPro: 5.935 ± 1.036
0.848ThrGln: 0.848 ± 0.401
4.239ThrArg: 4.239 ± 0.752
5.511ThrSer: 5.511 ± 0.408
4.663ThrThr: 4.663 ± 1.267
5.087ThrVal: 5.087 ± 0.966
0.0ThrTrp: 0.0 ± 0.0
1.272ThrTyr: 1.272 ± 0.699
0.0ThrXaa: 0.0 ± 0.0
Val
8.902ValAla: 8.902 ± 1.72
0.848ValCys: 0.848 ± 0.446
10.174ValAsp: 10.174 ± 1.104
4.239ValGlu: 4.239 ± 1.592
1.272ValPhe: 1.272 ± 0.227
2.12ValGly: 2.12 ± 0.407
2.967ValHis: 2.967 ± 0.661
2.967ValIle: 2.967 ± 0.726
4.239ValLys: 4.239 ± 1.375
5.935ValLeu: 5.935 ± 2.176
0.424ValMet: 0.424 ± 0.365
3.815ValAsn: 3.815 ± 0.877
5.511ValPro: 5.511 ± 1.04
2.12ValGln: 2.12 ± 1.354
6.783ValArg: 6.783 ± 0.622
6.783ValSer: 6.783 ± 0.511
3.391ValThr: 3.391 ± 1.712
5.935ValVal: 5.935 ± 0.979
1.272ValTrp: 1.272 ± 0.979
2.543ValTyr: 2.543 ± 0.837
0.0ValXaa: 0.0 ± 0.0
Trp
0.424TrpAla: 0.424 ± 0.369
0.848TrpCys: 0.848 ± 0.401
1.696TrpAsp: 1.696 ± 0.353
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.848TrpGly: 0.848 ± 0.73
0.0TrpHis: 0.0 ± 0.0
0.424TrpIle: 0.424 ± 0.387
0.0TrpLys: 0.0 ± 0.0
0.424TrpLeu: 0.424 ± 0.365
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.848TrpGln: 0.848 ± 0.333
0.424TrpArg: 0.424 ± 0.387
0.424TrpSer: 0.424 ± 0.563
0.0TrpThr: 0.0 ± 0.0
0.848TrpVal: 0.848 ± 0.608
0.0TrpTrp: 0.0 ± 0.0
0.424TrpTyr: 0.424 ± 0.369
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.935TyrAla: 5.935 ± 0.611
0.424TyrCys: 0.424 ± 0.387
2.543TyrAsp: 2.543 ± 0.454
1.696TyrGlu: 1.696 ± 0.492
0.424TyrPhe: 0.424 ± 0.369
2.967TyrGly: 2.967 ± 0.97
0.848TyrHis: 0.848 ± 0.519
1.272TyrIle: 1.272 ± 0.699
0.0TyrLys: 0.0 ± 0.0
0.848TyrLeu: 0.848 ± 0.401
0.0TyrMet: 0.0 ± 0.0
0.848TyrAsn: 0.848 ± 0.446
1.696TyrPro: 1.696 ± 0.7
0.848TyrGln: 0.848 ± 0.608
3.391TyrArg: 3.391 ± 0.84
2.12TyrSer: 2.12 ± 1.046
0.848TyrThr: 0.848 ± 0.401
2.12TyrVal: 2.12 ± 0.757
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2360 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski