Amino acid dipepetide frequency for Panicum mosaic virus (strain United States/Kansas 109S) (PMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.344AlaAla: 5.344 ± 1.168
0.668AlaCys: 0.668 ± 0.362
2.004AlaAsp: 2.004 ± 1.603
4.676AlaGlu: 4.676 ± 1.784
2.672AlaPhe: 2.672 ± 1.458
4.008AlaGly: 4.008 ± 2.744
1.336AlaHis: 1.336 ± 0.723
4.008AlaIle: 4.008 ± 1.468
4.008AlaLys: 4.008 ± 1.588
5.344AlaLeu: 5.344 ± 1.317
1.336AlaMet: 1.336 ± 0.595
3.34AlaAsn: 3.34 ± 1.808
4.008AlaPro: 4.008 ± 1.558
2.672AlaGln: 2.672 ± 2.945
6.012AlaArg: 6.012 ± 2.716
3.34AlaSer: 3.34 ± 1.143
7.348AlaThr: 7.348 ± 2.12
6.012AlaVal: 6.012 ± 1.712
1.336AlaTrp: 1.336 ± 1.127
1.336AlaTyr: 1.336 ± 0.723
0.0AlaXaa: 0.0 ± 0.0
Cys
0.668CysAla: 0.668 ± 0.362
0.0CysCys: 0.0 ± 0.0
1.336CysAsp: 1.336 ± 0.723
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.668CysGly: 0.668 ± 0.362
0.668CysHis: 0.668 ± 0.362
1.336CysIle: 1.336 ± 0.723
1.336CysLys: 1.336 ± 0.723
2.672CysLeu: 2.672 ± 1.859
0.668CysMet: 0.668 ± 0.362
0.0CysAsn: 0.0 ± 0.0
2.672CysPro: 2.672 ± 1.454
1.336CysGln: 1.336 ± 1.127
1.336CysArg: 1.336 ± 0.723
1.336CysSer: 1.336 ± 0.723
0.0CysThr: 0.0 ± 0.0
1.336CysVal: 1.336 ± 1.127
0.0CysTrp: 0.0 ± 0.0
1.336CysTyr: 1.336 ± 1.594
0.0CysXaa: 0.0 ± 0.0
Asp
4.008AspAla: 4.008 ± 1.108
0.668AspCys: 0.668 ± 1.335
2.672AspAsp: 2.672 ± 1.446
3.34AspGlu: 3.34 ± 1.157
0.668AspPhe: 0.668 ± 0.362
4.676AspGly: 4.676 ± 1.624
1.336AspHis: 1.336 ± 1.127
0.668AspIle: 0.668 ± 0.362
4.676AspLys: 4.676 ± 1.43
2.672AspLeu: 2.672 ± 0.899
0.668AspMet: 0.668 ± 0.362
1.336AspAsn: 1.336 ± 0.723
4.008AspPro: 4.008 ± 1.655
2.672AspGln: 2.672 ± 1.446
5.344AspArg: 5.344 ± 2.889
4.008AspSer: 4.008 ± 1.46
3.34AspThr: 3.34 ± 1.189
0.668AspVal: 0.668 ± 1.335
0.0AspTrp: 0.0 ± 0.0
1.336AspTyr: 1.336 ± 0.729
0.0AspXaa: 0.0 ± 0.0
Glu
3.34GluAla: 3.34 ± 1.808
0.668GluCys: 0.668 ± 0.362
3.34GluAsp: 3.34 ± 3.177
3.34GluGlu: 3.34 ± 1.157
3.34GluPhe: 3.34 ± 1.808
1.336GluGly: 1.336 ± 0.723
2.004GluHis: 2.004 ± 1.01
4.008GluIle: 4.008 ± 1.558
2.004GluLys: 2.004 ± 1.559
4.008GluLeu: 4.008 ± 0.845
1.336GluMet: 1.336 ± 0.729
0.668GluAsn: 0.668 ± 1.66
2.004GluPro: 2.004 ± 1.358
2.004GluGln: 2.004 ± 1.085
4.008GluArg: 4.008 ± 1.46
4.676GluSer: 4.676 ± 1.784
4.008GluThr: 4.008 ± 2.366
4.008GluVal: 4.008 ± 2.021
1.336GluTrp: 1.336 ± 0.729
0.668GluTyr: 0.668 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.004PheAla: 2.004 ± 0.734
1.336PheCys: 1.336 ± 0.723
2.672PheAsp: 2.672 ± 1.446
2.004PheGlu: 2.004 ± 0.734
0.668PhePhe: 0.668 ± 0.362
2.004PheGly: 2.004 ± 1.085
0.0PheHis: 0.0 ± 0.0
2.004PheIle: 2.004 ± 1.085
4.008PheLys: 4.008 ± 1.357
1.336PheLeu: 1.336 ± 0.723
0.668PheMet: 0.668 ± 1.965
1.336PheAsn: 1.336 ± 3.32
0.668PhePro: 0.668 ± 0.362
0.0PheGln: 0.0 ± 0.0
1.336PheArg: 1.336 ± 0.723
2.672PheSer: 2.672 ± 2.634
0.668PheThr: 0.668 ± 1.335
2.004PheVal: 2.004 ± 1.085
0.0PheTrp: 0.0 ± 0.0
2.672PheTyr: 2.672 ± 0.899
0.0PheXaa: 0.0 ± 0.0
Gly
5.344GlyAla: 5.344 ± 3.401
0.668GlyCys: 0.668 ± 0.362
3.34GlyAsp: 3.34 ± 1.417
2.004GlyGlu: 2.004 ± 1.01
3.34GlyPhe: 3.34 ± 1.808
2.004GlyGly: 2.004 ± 1.085
0.0GlyHis: 0.0 ± 0.0
4.008GlyIle: 4.008 ± 1.562
6.012GlyLys: 6.012 ± 1.991
6.68GlyLeu: 6.68 ± 2.813
1.336GlyMet: 1.336 ± 0.723
2.672GlyAsn: 2.672 ± 1.458
2.004GlyPro: 2.004 ± 1.615
2.004GlyGln: 2.004 ± 1.582
4.008GlyArg: 4.008 ± 2.366
3.34GlySer: 3.34 ± 3.872
1.336GlyThr: 1.336 ± 0.723
4.008GlyVal: 4.008 ± 1.46
1.336GlyTrp: 1.336 ± 0.729
2.004GlyTyr: 2.004 ± 1.085
0.0GlyXaa: 0.0 ± 0.0
His
1.336HisAla: 1.336 ± 1.127
0.0HisCys: 0.0 ± 0.0
0.668HisAsp: 0.668 ± 0.362
1.336HisGlu: 1.336 ± 1.835
0.668HisPhe: 0.668 ± 0.362
3.34HisGly: 3.34 ± 1.808
2.004HisHis: 2.004 ± 2.052
1.336HisIle: 1.336 ± 0.723
1.336HisLys: 1.336 ± 0.723
1.336HisLeu: 1.336 ± 0.729
0.668HisMet: 0.668 ± 0.362
1.336HisAsn: 1.336 ± 0.723
2.004HisPro: 2.004 ± 1.085
0.668HisGln: 0.668 ± 0.362
1.336HisArg: 1.336 ± 1.652
2.004HisSer: 2.004 ± 1.01
3.34HisThr: 3.34 ± 1.722
1.336HisVal: 1.336 ± 0.723
0.668HisTrp: 0.668 ± 0.362
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.668IleAla: 0.668 ± 0.362
0.668IleCys: 0.668 ± 0.362
3.34IleAsp: 3.34 ± 1.157
4.008IleGlu: 4.008 ± 2.185
2.004IlePhe: 2.004 ± 1.085
1.336IleGly: 1.336 ± 0.729
4.008IleHis: 4.008 ± 1.46
1.336IleIle: 1.336 ± 1.772
2.004IleLys: 2.004 ± 1.085
3.34IleLeu: 3.34 ± 1.371
0.0IleMet: 0.0 ± 0.0
0.668IleAsn: 0.668 ± 0.362
4.008IlePro: 4.008 ± 3.163
3.34IleGln: 3.34 ± 1.835
0.668IleArg: 0.668 ± 0.362
4.676IleSer: 4.676 ± 1.784
4.008IleThr: 4.008 ± 2.17
2.672IleVal: 2.672 ± 1.334
1.336IleTrp: 1.336 ± 1.127
2.672IleTyr: 2.672 ± 0.899
0.0IleXaa: 0.0 ± 0.0
Lys
4.008LysAla: 4.008 ± 2.17
1.336LysCys: 1.336 ± 1.594
3.34LysAsp: 3.34 ± 1.722
4.008LysGlu: 4.008 ± 1.562
0.0LysPhe: 0.0 ± 0.0
3.34LysGly: 3.34 ± 1.157
0.668LysHis: 0.668 ± 0.362
2.672LysIle: 2.672 ± 1.364
2.672LysLys: 2.672 ± 1.446
3.34LysLeu: 3.34 ± 1.157
1.336LysMet: 1.336 ± 1.506
2.004LysAsn: 2.004 ± 1.085
4.008LysPro: 4.008 ± 1.46
2.672LysGln: 2.672 ± 0.899
2.672LysArg: 2.672 ± 0.899
7.348LysSer: 7.348 ± 1.805
4.676LysThr: 4.676 ± 1.43
3.34LysVal: 3.34 ± 1.808
2.672LysTrp: 2.672 ± 1.446
4.008LysTyr: 4.008 ± 1.916
0.668LysXaa: 0.668 ± 0.362
Leu
6.68LeuAla: 6.68 ± 1.834
1.336LeuCys: 1.336 ± 1.594
2.004LeuAsp: 2.004 ± 1.01
4.676LeuGlu: 4.676 ± 2.531
0.668LeuPhe: 0.668 ± 1.707
6.012LeuGly: 6.012 ± 1.299
2.672LeuHis: 2.672 ± 1.446
2.672LeuIle: 2.672 ± 1.607
4.008LeuLys: 4.008 ± 2.17
12.024LeuLeu: 12.024 ± 7.31
1.336LeuMet: 1.336 ± 0.782
1.336LeuAsn: 1.336 ± 0.723
5.344LeuPro: 5.344 ± 3.11
4.676LeuGln: 4.676 ± 2.436
6.68LeuArg: 6.68 ± 2.712
12.692LeuSer: 12.692 ± 2.782
5.344LeuThr: 5.344 ± 1.13
10.02LeuVal: 10.02 ± 2.608
2.004LeuTrp: 2.004 ± 2.444
3.34LeuTyr: 3.34 ± 2.11
0.0LeuXaa: 0.0 ± 0.0
Met
1.336MetAla: 1.336 ± 1.594
0.668MetCys: 0.668 ± 0.362
0.668MetAsp: 0.668 ± 0.886
2.004MetGlu: 2.004 ± 1.01
0.668MetPhe: 0.668 ± 0.362
1.336MetGly: 1.336 ± 0.729
0.0MetHis: 0.0 ± 0.0
0.668MetIle: 0.668 ± 0.362
2.004MetLys: 2.004 ± 1.085
1.336MetLeu: 1.336 ± 1.127
0.0MetMet: 0.0 ± 0.0
2.672MetAsn: 2.672 ± 0.899
1.336MetPro: 1.336 ± 0.723
0.668MetGln: 0.668 ± 0.362
1.336MetArg: 1.336 ± 1.772
2.672MetSer: 2.672 ± 1.334
0.0MetThr: 0.0 ± 0.0
0.668MetVal: 0.668 ± 0.362
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.336AsnCys: 1.336 ± 0.723
1.336AsnAsp: 1.336 ± 0.723
2.004AsnGlu: 2.004 ± 1.085
1.336AsnPhe: 1.336 ± 3.32
2.672AsnGly: 2.672 ± 1.095
0.668AsnHis: 0.668 ± 0.362
2.004AsnIle: 2.004 ± 1.615
1.336AsnLys: 1.336 ± 1.473
3.34AsnLeu: 3.34 ± 0.909
0.0AsnMet: 0.0 ± 0.0
1.336AsnAsn: 1.336 ± 0.723
4.676AsnPro: 4.676 ± 1.601
2.004AsnGln: 2.004 ± 1.085
0.668AsnArg: 0.668 ± 0.886
2.004AsnSer: 2.004 ± 0.734
1.336AsnThr: 1.336 ± 1.473
2.672AsnVal: 2.672 ± 1.607
0.668AsnTrp: 0.668 ± 0.362
1.336AsnTyr: 1.336 ± 0.729
0.0AsnXaa: 0.0 ± 0.0
Pro
4.676ProAla: 4.676 ± 1.39
1.336ProCys: 1.336 ± 0.723
2.004ProAsp: 2.004 ± 1.085
4.676ProGlu: 4.676 ± 1.347
2.004ProPhe: 2.004 ± 1.01
1.336ProGly: 1.336 ± 0.729
0.668ProHis: 0.668 ± 0.886
4.008ProIle: 4.008 ± 1.468
4.676ProLys: 4.676 ± 1.601
6.012ProLeu: 6.012 ± 3.509
0.0ProMet: 0.0 ± 0.0
2.004ProAsn: 2.004 ± 2.429
2.672ProPro: 2.672 ± 3.304
1.336ProGln: 1.336 ± 0.723
7.348ProArg: 7.348 ± 1.641
7.348ProSer: 7.348 ± 2.589
4.676ProThr: 4.676 ± 1.301
4.008ProVal: 4.008 ± 1.916
1.336ProTrp: 1.336 ± 0.723
0.668ProTyr: 0.668 ± 1.335
0.0ProXaa: 0.0 ± 0.0
Gln
4.008GlnAla: 4.008 ± 1.108
2.004GlnCys: 2.004 ± 1.01
2.672GlnAsp: 2.672 ± 1.334
1.336GlnGlu: 1.336 ± 2.422
0.668GlnPhe: 0.668 ± 0.362
2.672GlnGly: 2.672 ± 1.458
2.004GlnHis: 2.004 ± 1.01
2.004GlnIle: 2.004 ± 1.085
4.008GlnLys: 4.008 ± 1.382
4.676GlnLeu: 4.676 ± 1.912
0.668GlnMet: 0.668 ± 0.362
0.668GlnAsn: 0.668 ± 1.707
3.34GlnPro: 3.34 ± 1.808
1.336GlnGln: 1.336 ± 1.473
2.004GlnArg: 2.004 ± 1.358
0.668GlnSer: 0.668 ± 1.335
2.004GlnThr: 2.004 ± 1.603
0.668GlnVal: 0.668 ± 0.886
1.336GlnTrp: 1.336 ± 0.723
3.34GlnTyr: 3.34 ± 0.909
0.0GlnXaa: 0.0 ± 0.0
Arg
2.672ArgAla: 2.672 ± 2.458
1.336ArgCys: 1.336 ± 0.723
2.004ArgAsp: 2.004 ± 2.153
2.672ArgGlu: 2.672 ± 3.773
4.676ArgPhe: 4.676 ± 1.141
5.344ArgGly: 5.344 ± 4.916
2.004ArgHis: 2.004 ± 1.085
1.336ArgIle: 1.336 ± 0.723
2.672ArgLys: 2.672 ± 1.016
6.012ArgLeu: 6.012 ± 1.4
3.34ArgMet: 3.34 ± 1.417
2.672ArgAsn: 2.672 ± 1.458
3.34ArgPro: 3.34 ± 1.417
2.672ArgGln: 2.672 ± 3.266
2.672ArgArg: 2.672 ± 1.458
4.008ArgSer: 4.008 ± 2.681
4.676ArgThr: 4.676 ± 1.39
6.012ArgVal: 6.012 ± 2.255
0.668ArgTrp: 0.668 ± 0.886
4.008ArgTyr: 4.008 ± 1.46
0.0ArgXaa: 0.0 ± 0.0
Ser
8.016SerAla: 8.016 ± 3.354
1.336SerCys: 1.336 ± 0.729
4.676SerAsp: 4.676 ± 1.784
2.004SerGlu: 2.004 ± 0.734
1.336SerPhe: 1.336 ± 0.729
4.008SerGly: 4.008 ± 2.503
2.004SerHis: 2.004 ± 0.734
4.676SerIle: 4.676 ± 1.784
4.008SerLys: 4.008 ± 1.46
10.688SerLeu: 10.688 ± 2.945
1.336SerMet: 1.336 ± 1.127
1.336SerAsn: 1.336 ± 0.723
5.344SerPro: 5.344 ± 4.508
1.336SerGln: 1.336 ± 1.473
2.672SerArg: 2.672 ± 1.454
3.34SerSer: 3.34 ± 1.717
4.008SerThr: 4.008 ± 4.537
6.68SerVal: 6.68 ± 2.031
3.34SerTrp: 3.34 ± 1.731
5.344SerTyr: 5.344 ± 3.593
0.0SerXaa: 0.0 ± 0.0
Thr
6.68ThrAla: 6.68 ± 2.378
1.336ThrCys: 1.336 ± 1.127
4.008ThrAsp: 4.008 ± 0.845
3.34ThrGlu: 3.34 ± 1.157
2.004ThrPhe: 2.004 ± 1.358
2.672ThrGly: 2.672 ± 2.458
1.336ThrHis: 1.336 ± 0.723
4.008ThrIle: 4.008 ± 2.186
2.672ThrLys: 2.672 ± 1.334
4.008ThrLeu: 4.008 ± 2.366
2.004ThrMet: 2.004 ± 1.085
1.336ThrAsn: 1.336 ± 0.723
6.68ThrPro: 6.68 ± 3.096
2.004ThrGln: 2.004 ± 1.085
5.344ThrArg: 5.344 ± 3.639
3.34ThrSer: 3.34 ± 0.909
5.344ThrThr: 5.344 ± 2.136
2.672ThrVal: 2.672 ± 2.458
0.0ThrTrp: 0.0 ± 0.0
2.004ThrTyr: 2.004 ± 0.734
0.0ThrXaa: 0.0 ± 0.0
Val
7.348ValAla: 7.348 ± 2.252
0.668ValCys: 0.668 ± 0.362
4.008ValAsp: 4.008 ± 1.357
3.34ValGlu: 3.34 ± 1.189
1.336ValPhe: 1.336 ± 0.723
5.344ValGly: 5.344 ± 1.568
2.004ValHis: 2.004 ± 1.01
2.672ValIle: 2.672 ± 0.899
3.34ValLys: 3.34 ± 1.808
8.684ValLeu: 8.684 ± 1.464
0.0ValMet: 0.0 ± 0.0
2.672ValAsn: 2.672 ± 1.095
2.672ValPro: 2.672 ± 2.061
3.34ValGln: 3.34 ± 1.722
4.676ValArg: 4.676 ± 1.624
2.004ValSer: 2.004 ± 0.734
4.008ValThr: 4.008 ± 1.46
7.348ValVal: 7.348 ± 2.729
0.668ValTrp: 0.668 ± 0.362
2.672ValTyr: 2.672 ± 1.607
0.0ValXaa: 0.0 ± 0.0
Trp
2.004TrpAla: 2.004 ± 2.444
0.668TrpCys: 0.668 ± 0.362
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
2.004TrpPhe: 2.004 ± 1.01
0.668TrpGly: 0.668 ± 0.362
0.0TrpHis: 0.0 ± 0.0
0.668TrpIle: 0.668 ± 0.886
0.0TrpLys: 0.0 ± 0.0
2.672TrpLeu: 2.672 ± 1.446
2.004TrpMet: 2.004 ± 1.085
2.004TrpAsn: 2.004 ± 2.052
0.0TrpPro: 0.0 ± 0.0
2.672TrpGln: 2.672 ± 0.899
1.336TrpArg: 1.336 ± 0.729
0.0TrpSer: 0.0 ± 0.0
1.336TrpThr: 1.336 ± 0.723
0.668TrpVal: 0.668 ± 0.362
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.668TyrAla: 0.668 ± 0.362
0.668TyrCys: 0.668 ± 1.707
2.672TyrAsp: 2.672 ± 1.446
1.336TyrGlu: 1.336 ± 0.723
0.668TyrPhe: 0.668 ± 1.335
2.004TyrGly: 2.004 ± 1.085
1.336TyrHis: 1.336 ± 3.414
0.668TyrIle: 0.668 ± 0.362
4.008TyrLys: 4.008 ± 2.17
5.344TyrLeu: 5.344 ± 1.922
0.668TyrMet: 0.668 ± 0.362
1.336TyrAsn: 1.336 ± 0.723
2.004TyrPro: 2.004 ± 1.354
2.672TyrGln: 2.672 ± 1.859
3.34TyrArg: 3.34 ± 2.329
6.012TyrSer: 6.012 ± 2.892
1.336TyrThr: 1.336 ± 0.723
2.004TyrVal: 2.004 ± 0.734
0.0TyrTrp: 0.0 ± 0.0
2.672TyrTyr: 2.672 ± 1.446
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.668XaaGly: 0.668 ± 0.362
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski