Amino acid dipepetide frequency for Yam virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.515AlaAla: 7.515 ± 4.034
1.002AlaCys: 1.002 ± 0.526
0.501AlaAsp: 0.501 ± 1.127
5.01AlaGlu: 5.01 ± 1.388
3.507AlaPhe: 3.507 ± 1.675
4.509AlaGly: 4.509 ± 0.992
1.002AlaHis: 1.002 ± 0.744
4.509AlaIle: 4.509 ± 2.819
4.008AlaLys: 4.008 ± 1.364
10.02AlaLeu: 10.02 ± 2.277
1.503AlaMet: 1.503 ± 0.789
6.513AlaAsn: 6.513 ± 2.315
4.509AlaPro: 4.509 ± 2.591
4.509AlaGln: 4.509 ± 1.558
3.006AlaArg: 3.006 ± 1.727
5.511AlaSer: 5.511 ± 1.811
7.014AlaThr: 7.014 ± 3.936
3.006AlaVal: 3.006 ± 0.96
1.002AlaTrp: 1.002 ± 0.933
3.006AlaTyr: 3.006 ± 1.578
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 1.127
0.0CysCys: 0.0 ± 0.0
0.501CysAsp: 0.501 ± 0.263
1.503CysGlu: 1.503 ± 0.789
1.002CysPhe: 1.002 ± 0.744
1.002CysGly: 1.002 ± 0.526
0.0CysHis: 0.0 ± 0.0
1.002CysIle: 1.002 ± 0.933
1.503CysLys: 1.503 ± 1.469
1.002CysLeu: 1.002 ± 0.526
0.501CysMet: 0.501 ± 1.127
1.002CysAsn: 1.002 ± 0.744
1.503CysPro: 1.503 ± 0.728
1.503CysGln: 1.503 ± 0.789
0.501CysArg: 0.501 ± 0.263
2.505CysSer: 2.505 ± 0.955
2.505CysThr: 2.505 ± 1.487
0.501CysVal: 0.501 ± 0.845
0.0CysTrp: 0.0 ± 0.0
1.002CysTyr: 1.002 ± 0.933
0.0CysXaa: 0.0 ± 0.0
Asp
2.004AspAla: 2.004 ± 1.052
2.004AspCys: 2.004 ± 1.487
3.507AspAsp: 3.507 ± 1.52
3.006AspGlu: 3.006 ± 1.136
4.509AspPhe: 4.509 ± 1.693
3.006AspGly: 3.006 ± 1.538
0.501AspHis: 0.501 ± 1.022
2.505AspIle: 2.505 ± 1.315
3.507AspLys: 3.507 ± 1.51
3.006AspLeu: 3.006 ± 1.727
0.501AspMet: 0.501 ± 0.263
3.006AspAsn: 3.006 ± 1.808
3.006AspPro: 3.006 ± 0.999
1.503AspGln: 1.503 ± 0.864
2.505AspArg: 2.505 ± 0.948
2.004AspSer: 2.004 ± 0.832
3.507AspThr: 3.507 ± 0.572
3.507AspVal: 3.507 ± 1.52
3.006AspTrp: 3.006 ± 1.524
0.501AspTyr: 0.501 ± 0.263
0.0AspXaa: 0.0 ± 0.0
Glu
5.01GluAla: 5.01 ± 1.225
0.501GluCys: 0.501 ± 0.263
3.507GluAsp: 3.507 ± 2.183
4.509GluGlu: 4.509 ± 2.367
2.004GluPhe: 2.004 ± 1.44
2.505GluGly: 2.505 ± 0.948
0.0GluHis: 0.0 ± 0.0
3.507GluIle: 3.507 ± 0.572
4.509GluLys: 4.509 ± 1.815
8.016GluLeu: 8.016 ± 2.029
2.004GluMet: 2.004 ± 1.052
2.505GluAsn: 2.505 ± 1.315
3.006GluPro: 3.006 ± 1.524
2.505GluGln: 2.505 ± 0.881
3.006GluArg: 3.006 ± 1.304
2.505GluSer: 2.505 ± 0.694
3.006GluThr: 3.006 ± 1.578
4.008GluVal: 4.008 ± 1.578
0.501GluTrp: 0.501 ± 0.263
1.503GluTyr: 1.503 ± 0.728
0.0GluXaa: 0.0 ± 0.0
Phe
3.006PheAla: 3.006 ± 1.366
2.505PheCys: 2.505 ± 0.694
3.507PheAsp: 3.507 ± 1.166
3.507PheGlu: 3.507 ± 1.51
2.505PhePhe: 2.505 ± 1.235
1.503PheGly: 1.503 ± 0.728
2.004PheHis: 2.004 ± 1.052
2.004PheIle: 2.004 ± 0.832
3.006PheLys: 3.006 ± 0.999
7.515PheLeu: 7.515 ± 3.323
1.503PheMet: 1.503 ± 1.324
2.505PheAsn: 2.505 ± 1.448
2.004PhePro: 2.004 ± 0.803
2.505PheGln: 2.505 ± 0.948
3.006PheArg: 3.006 ± 1.787
2.505PheSer: 2.505 ± 1.315
6.012PheThr: 6.012 ± 1.568
1.503PheVal: 1.503 ± 0.789
1.002PheTrp: 1.002 ± 0.969
1.002PheTyr: 1.002 ± 0.969
0.0PheXaa: 0.0 ± 0.0
Gly
6.012GlyAla: 6.012 ± 2.702
2.505GlyCys: 2.505 ± 1.598
3.507GlyAsp: 3.507 ± 1.035
0.501GlyGlu: 0.501 ± 0.263
4.008GlyPhe: 4.008 ± 1.606
2.004GlyGly: 2.004 ± 1.35
2.004GlyHis: 2.004 ± 0.803
2.004GlyIle: 2.004 ± 1.052
2.004GlyLys: 2.004 ± 0.803
4.008GlyLeu: 4.008 ± 3.449
0.501GlyMet: 0.501 ± 0.263
2.505GlyAsn: 2.505 ± 0.948
0.501GlyPro: 0.501 ± 0.263
1.503GlyGln: 1.503 ± 0.864
1.503GlyArg: 1.503 ± 1.091
2.004GlySer: 2.004 ± 1.694
3.006GlyThr: 3.006 ± 1.791
3.006GlyVal: 3.006 ± 4.281
0.501GlyTrp: 0.501 ± 0.263
0.501GlyTyr: 0.501 ± 0.263
0.0GlyXaa: 0.0 ± 0.0
His
1.503HisAla: 1.503 ± 0.789
0.501HisCys: 0.501 ± 1.127
1.002HisAsp: 1.002 ± 0.526
1.503HisGlu: 1.503 ± 0.789
2.004HisPhe: 2.004 ± 0.803
2.004HisGly: 2.004 ± 1.019
1.503HisHis: 1.503 ± 0.914
1.503HisIle: 1.503 ± 0.728
1.503HisLys: 1.503 ± 1.091
2.505HisLeu: 2.505 ± 0.955
0.501HisMet: 0.501 ± 0.263
0.501HisAsn: 0.501 ± 0.263
1.503HisPro: 1.503 ± 0.789
1.002HisGln: 1.002 ± 0.744
1.002HisArg: 1.002 ± 0.744
2.004HisSer: 2.004 ± 0.969
1.503HisThr: 1.503 ± 0.789
1.002HisVal: 1.002 ± 0.933
0.0HisTrp: 0.0 ± 0.0
1.002HisTyr: 1.002 ± 0.744
0.0HisXaa: 0.0 ± 0.0
Ile
6.012IleAla: 6.012 ± 2.38
0.501IleCys: 0.501 ± 0.263
1.503IleAsp: 1.503 ± 0.789
4.008IleGlu: 4.008 ± 1.578
3.006IlePhe: 3.006 ± 1.169
1.503IleGly: 1.503 ± 0.864
2.505IleHis: 2.505 ± 0.955
2.004IleIle: 2.004 ± 1.052
5.01IleLys: 5.01 ± 1.455
8.016IleLeu: 8.016 ± 2.096
1.002IleMet: 1.002 ± 0.526
3.006IleAsn: 3.006 ± 1.578
3.006IlePro: 3.006 ± 1.455
3.006IleGln: 3.006 ± 1.136
3.507IleArg: 3.507 ± 1.161
4.008IleSer: 4.008 ± 3.635
4.008IleThr: 4.008 ± 3.282
3.507IleVal: 3.507 ± 2.085
0.0IleTrp: 0.0 ± 0.0
3.006IleTyr: 3.006 ± 1.791
0.0IleXaa: 0.0 ± 0.0
Lys
6.513LysAla: 6.513 ± 2.156
0.0LysCys: 0.0 ± 0.0
2.505LysAsp: 2.505 ± 0.948
3.006LysGlu: 3.006 ± 1.524
2.505LysPhe: 2.505 ± 0.948
1.002LysGly: 1.002 ± 0.526
1.503LysHis: 1.503 ± 0.728
4.008LysIle: 4.008 ± 1.041
3.006LysLys: 3.006 ± 1.578
6.513LysLeu: 6.513 ± 2.747
0.501LysMet: 0.501 ± 1.127
2.505LysAsn: 2.505 ± 1.315
3.006LysPro: 3.006 ± 1.136
0.501LysGln: 0.501 ± 0.263
2.004LysArg: 2.004 ± 0.875
6.012LysSer: 6.012 ± 2.559
7.515LysThr: 7.515 ± 1.656
4.509LysVal: 4.509 ± 1.58
0.0LysTrp: 0.0 ± 0.0
1.002LysTyr: 1.002 ± 0.933
0.0LysXaa: 0.0 ± 0.0
Leu
6.012LeuAla: 6.012 ± 3.21
1.503LeuCys: 1.503 ± 0.728
4.008LeuAsp: 4.008 ± 3.127
7.014LeuGlu: 7.014 ± 1.731
7.515LeuPhe: 7.515 ± 2.924
6.513LeuGly: 6.513 ± 2.322
1.002LeuHis: 1.002 ± 0.526
7.014LeuIle: 7.014 ± 2.621
6.513LeuLys: 6.513 ± 2.812
8.016LeuLeu: 8.016 ± 5.732
2.505LeuMet: 2.505 ± 1.025
5.511LeuAsn: 5.511 ± 2.5
6.513LeuPro: 6.513 ± 2.972
4.008LeuGln: 4.008 ± 1.371
4.509LeuArg: 4.509 ± 1.257
9.018LeuSer: 9.018 ± 3.07
9.018LeuThr: 9.018 ± 2.718
5.511LeuVal: 5.511 ± 3.08
2.004LeuTrp: 2.004 ± 1.052
1.503LeuTyr: 1.503 ± 0.789
0.0LeuXaa: 0.0 ± 0.0
Met
2.505MetAla: 2.505 ± 0.881
0.0MetCys: 0.0 ± 0.0
1.002MetAsp: 1.002 ± 0.526
0.501MetGlu: 0.501 ± 0.263
1.002MetPhe: 1.002 ± 0.969
0.501MetGly: 0.501 ± 0.263
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.002MetLys: 1.002 ± 0.526
3.006MetLeu: 3.006 ± 0.999
0.0MetMet: 0.0 ± 0.0
2.505MetAsn: 2.505 ± 1.315
1.002MetPro: 1.002 ± 0.933
1.002MetGln: 1.002 ± 1.542
2.004MetArg: 2.004 ± 1.052
0.501MetSer: 0.501 ± 0.845
1.002MetThr: 1.002 ± 0.526
1.503MetVal: 1.503 ± 0.789
0.0MetTrp: 0.0 ± 0.0
0.501MetTyr: 0.501 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
4.509AsnAla: 4.509 ± 1.312
1.503AsnCys: 1.503 ± 0.864
2.505AsnAsp: 2.505 ± 0.881
1.002AsnGlu: 1.002 ± 0.969
3.006AsnPhe: 3.006 ± 1.455
2.505AsnGly: 2.505 ± 0.948
0.501AsnHis: 0.501 ± 0.263
5.01AsnIle: 5.01 ± 2.051
3.006AsnLys: 3.006 ± 0.999
6.012AsnLeu: 6.012 ± 1.003
0.501AsnMet: 0.501 ± 0.263
2.004AsnAsn: 2.004 ± 2.381
4.509AsnPro: 4.509 ± 1.693
2.505AsnGln: 2.505 ± 0.881
1.002AsnArg: 1.002 ± 0.526
5.01AsnSer: 5.01 ± 0.995
4.008AsnThr: 4.008 ± 1.165
1.503AsnVal: 1.503 ± 0.789
0.501AsnTrp: 0.501 ± 1.127
1.503AsnTyr: 1.503 ± 0.864
0.0AsnXaa: 0.0 ± 0.0
Pro
5.01ProAla: 5.01 ± 4.689
1.002ProCys: 1.002 ± 1.542
3.006ProAsp: 3.006 ± 1.249
4.509ProGlu: 4.509 ± 1.257
2.004ProPhe: 2.004 ± 1.052
1.503ProGly: 1.503 ± 0.728
2.505ProHis: 2.505 ± 1.829
4.008ProIle: 4.008 ± 1.041
4.008ProLys: 4.008 ± 1.041
5.511ProLeu: 5.511 ± 2.927
0.501ProMet: 0.501 ± 0.736
1.002ProAsn: 1.002 ± 0.526
2.004ProPro: 2.004 ± 3.102
0.501ProGln: 0.501 ± 0.263
2.004ProArg: 2.004 ± 1.052
4.509ProSer: 4.509 ± 1.842
2.505ProThr: 2.505 ± 0.881
3.507ProVal: 3.507 ± 1.51
0.501ProTrp: 0.501 ± 0.263
1.503ProTyr: 1.503 ± 0.789
0.0ProXaa: 0.0 ± 0.0
Gln
2.505GlnAla: 2.505 ± 0.881
1.002GlnCys: 1.002 ± 0.526
3.006GlnAsp: 3.006 ± 0.579
2.004GlnGlu: 2.004 ± 1.44
1.503GlnPhe: 1.503 ± 0.864
2.505GlnGly: 2.505 ± 1.489
1.503GlnHis: 1.503 ± 0.728
3.006GlnIle: 3.006 ± 1.366
0.0GlnLys: 0.0 ± 0.0
8.016GlnLeu: 8.016 ± 2.546
1.503GlnMet: 1.503 ± 0.735
1.002GlnAsn: 1.002 ± 0.969
3.006GlnPro: 3.006 ± 1.701
0.501GlnGln: 0.501 ± 0.263
0.501GlnArg: 0.501 ± 0.263
3.006GlnSer: 3.006 ± 0.579
2.505GlnThr: 2.505 ± 0.694
1.002GlnVal: 1.002 ± 0.744
0.501GlnTrp: 0.501 ± 0.263
0.501GlnTyr: 0.501 ± 0.263
0.0GlnXaa: 0.0 ± 0.0
Arg
2.505ArgAla: 2.505 ± 0.881
1.002ArgCys: 1.002 ± 0.526
2.004ArgAsp: 2.004 ± 0.969
3.006ArgGlu: 3.006 ± 1.136
3.507ArgPhe: 3.507 ± 1.15
1.002ArgGly: 1.002 ± 1.542
1.503ArgHis: 1.503 ± 1.091
2.505ArgIle: 2.505 ± 2.023
2.505ArgLys: 2.505 ± 1.315
5.01ArgLeu: 5.01 ± 1.455
1.002ArgMet: 1.002 ± 0.526
2.505ArgAsn: 2.505 ± 0.948
1.503ArgPro: 1.503 ± 0.728
3.006ArgGln: 3.006 ± 1.69
1.002ArgArg: 1.002 ± 0.526
4.008ArgSer: 4.008 ± 2.701
2.505ArgThr: 2.505 ± 1.448
1.002ArgVal: 1.002 ± 0.969
0.0ArgTrp: 0.0 ± 0.0
3.507ArgTyr: 3.507 ± 1.035
0.0ArgXaa: 0.0 ± 0.0
Ser
3.507SerAla: 3.507 ± 2.012
1.503SerCys: 1.503 ± 0.914
4.509SerAsp: 4.509 ± 1.842
5.511SerGlu: 5.511 ± 2.075
3.006SerPhe: 3.006 ± 1.136
2.505SerGly: 2.505 ± 1.899
2.505SerHis: 2.505 ± 0.948
3.006SerIle: 3.006 ± 1.136
3.006SerLys: 3.006 ± 1.578
6.012SerLeu: 6.012 ± 3.908
1.002SerMet: 1.002 ± 0.526
5.511SerAsn: 5.511 ± 2.833
3.507SerPro: 3.507 ± 1.619
3.507SerGln: 3.507 ± 1.629
5.01SerArg: 5.01 ± 0.995
4.509SerSer: 4.509 ± 0.992
7.014SerThr: 7.014 ± 3.692
3.006SerVal: 3.006 ± 1.787
0.0SerTrp: 0.0 ± 0.0
2.004SerTyr: 2.004 ± 1.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.511ThrAla: 5.511 ± 1.094
1.002ThrCys: 1.002 ± 0.744
5.01ThrAsp: 5.01 ± 1.064
4.509ThrGlu: 4.509 ± 1.278
7.014ThrPhe: 7.014 ± 1.423
4.008ThrGly: 4.008 ± 2.992
3.507ThrHis: 3.507 ± 1.166
6.012ThrIle: 6.012 ± 2.781
3.507ThrLys: 3.507 ± 1.441
6.012ThrLeu: 6.012 ± 3.576
1.503ThrMet: 1.503 ± 0.789
3.507ThrAsn: 3.507 ± 0.572
6.012ThrPro: 6.012 ± 2.528
1.002ThrGln: 1.002 ± 0.969
3.006ThrArg: 3.006 ± 1.808
3.507ThrSer: 3.507 ± 2.035
3.507ThrThr: 3.507 ± 3.212
5.511ThrVal: 5.511 ± 1.78
0.0ThrTrp: 0.0 ± 0.0
1.002ThrTyr: 1.002 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
5.511ValAla: 5.511 ± 4.152
0.501ValCys: 0.501 ± 0.263
3.006ValAsp: 3.006 ± 0.579
3.006ValGlu: 3.006 ± 1.136
1.503ValPhe: 1.503 ± 1.938
3.507ValGly: 3.507 ± 3.374
0.501ValHis: 0.501 ± 0.263
5.01ValIle: 5.01 ± 1.064
3.507ValLys: 3.507 ± 0.572
4.008ValLeu: 4.008 ± 3.072
1.503ValMet: 1.503 ± 0.789
1.503ValAsn: 1.503 ± 0.789
1.503ValPro: 1.503 ± 0.864
3.006ValGln: 3.006 ± 1.136
3.006ValArg: 3.006 ± 0.999
3.507ValSer: 3.507 ± 1.349
2.505ValThr: 2.505 ± 2.023
2.004ValVal: 2.004 ± 1.487
0.501ValTrp: 0.501 ± 1.127
2.004ValTyr: 2.004 ± 0.803
0.0ValXaa: 0.0 ± 0.0
Trp
1.002TrpAla: 1.002 ± 0.933
0.0TrpCys: 0.0 ± 0.0
1.002TrpAsp: 1.002 ± 0.969
0.501TrpGlu: 0.501 ± 0.263
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.002TrpIle: 1.002 ± 0.526
1.503TrpLys: 1.503 ± 0.789
1.002TrpLeu: 1.002 ± 1.542
0.0TrpMet: 0.0 ± 0.0
1.503TrpAsn: 1.503 ± 0.864
0.0TrpPro: 0.0 ± 0.0
0.501TrpGln: 0.501 ± 1.127
1.002TrpArg: 1.002 ± 0.526
0.0TrpSer: 0.0 ± 0.0
0.501TrpThr: 0.501 ± 0.263
0.501TrpVal: 0.501 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.509TyrAla: 4.509 ± 1.312
1.002TyrCys: 1.002 ± 0.933
1.503TyrAsp: 1.503 ± 0.789
0.501TyrGlu: 0.501 ± 0.263
0.0TyrPhe: 0.0 ± 0.0
0.501TyrGly: 0.501 ± 0.263
1.002TyrHis: 1.002 ± 0.744
2.505TyrIle: 2.505 ± 1.315
1.503TyrLys: 1.503 ± 1.469
2.004TyrLeu: 2.004 ± 0.832
0.501TyrMet: 0.501 ± 0.263
2.004TyrAsn: 2.004 ± 1.35
0.0TyrPro: 0.0 ± 0.0
1.002TyrGln: 1.002 ± 0.744
1.503TyrArg: 1.503 ± 0.914
3.507TyrSer: 3.507 ± 1.349
1.503TyrThr: 1.503 ± 0.789
1.503TyrVal: 1.503 ± 0.728
0.0TyrTrp: 0.0 ± 0.0
0.501TyrTyr: 0.501 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1997 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski