Amino acid dipepetide frequency for Jatropha mosaic India virus-[Lucknow]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.51AlaAla: 5.51 ± 1.767
0.918AlaCys: 0.918 ± 0.727
0.918AlaAsp: 0.918 ± 0.602
0.0AlaGlu: 0.0 ± 0.0
0.918AlaPhe: 0.918 ± 0.857
1.837AlaGly: 1.837 ± 1.224
1.837AlaHis: 1.837 ± 0.942
2.755AlaIle: 2.755 ± 1.306
2.755AlaLys: 2.755 ± 1.306
6.428AlaLeu: 6.428 ± 1.353
0.918AlaMet: 0.918 ± 0.915
2.755AlaAsn: 2.755 ± 1.227
1.837AlaPro: 1.837 ± 0.983
2.755AlaGln: 2.755 ± 1.117
4.591AlaArg: 4.591 ± 1.566
5.51AlaSer: 5.51 ± 2.887
4.591AlaThr: 4.591 ± 2.048
2.755AlaVal: 2.755 ± 1.618
2.755AlaTrp: 2.755 ± 1.084
0.918AlaTyr: 0.918 ± 0.602
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.918CysGlu: 0.918 ± 0.727
0.918CysPhe: 0.918 ± 0.947
1.837CysGly: 1.837 ± 0.925
0.918CysHis: 0.918 ± 0.857
1.837CysIle: 1.837 ± 0.97
0.918CysLys: 0.918 ± 0.727
0.918CysLeu: 0.918 ± 0.947
1.837CysMet: 1.837 ± 1.398
0.918CysAsn: 0.918 ± 0.602
0.918CysPro: 0.918 ± 0.947
0.918CysGln: 0.918 ± 0.602
0.918CysArg: 0.918 ± 0.602
4.591CysSer: 4.591 ± 1.992
0.918CysThr: 0.918 ± 0.727
0.918CysVal: 0.918 ± 0.727
0.918CysTrp: 0.918 ± 0.915
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.837AspAla: 1.837 ± 1.205
0.0AspCys: 0.0 ± 0.0
2.755AspAsp: 2.755 ± 1.679
2.755AspGlu: 2.755 ± 0.799
1.837AspPhe: 1.837 ± 0.97
4.591AspGly: 4.591 ± 2.182
1.837AspHis: 1.837 ± 1.155
1.837AspIle: 1.837 ± 0.97
3.673AspLys: 3.673 ± 1.116
7.346AspLeu: 7.346 ± 3.473
0.918AspMet: 0.918 ± 0.915
0.918AspAsn: 0.918 ± 0.727
2.755AspPro: 2.755 ± 1.266
0.918AspGln: 0.918 ± 0.602
2.755AspArg: 2.755 ± 1.292
6.428AspSer: 6.428 ± 1.714
2.755AspThr: 2.755 ± 1.156
6.428AspVal: 6.428 ± 1.984
0.918AspTrp: 0.918 ± 0.602
0.918AspTyr: 0.918 ± 0.602
0.0AspXaa: 0.0 ± 0.0
Glu
4.591GluAla: 4.591 ± 1.845
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
2.755GluGlu: 2.755 ± 1.306
3.673GluPhe: 3.673 ± 1.744
4.591GluGly: 4.591 ± 1.055
0.918GluHis: 0.918 ± 0.915
0.918GluIle: 0.918 ± 1.073
0.0GluLys: 0.0 ± 0.0
4.591GluLeu: 4.591 ± 1.801
0.0GluMet: 0.0 ± 0.0
5.51GluAsn: 5.51 ± 1.817
2.755GluPro: 2.755 ± 1.084
1.837GluGln: 1.837 ± 0.97
0.0GluArg: 0.0 ± 0.0
5.51GluSer: 5.51 ± 1.889
2.755GluThr: 2.755 ± 1.807
1.837GluVal: 1.837 ± 0.97
1.837GluTrp: 1.837 ± 0.925
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.837PheCys: 1.837 ± 1.16
3.673PheAsp: 3.673 ± 1.398
0.918PheGlu: 0.918 ± 0.727
2.755PhePhe: 2.755 ± 1.292
2.755PheGly: 2.755 ± 0.877
2.755PheHis: 2.755 ± 1.807
2.755PheIle: 2.755 ± 1.807
5.51PheLys: 5.51 ± 3.715
5.51PheLeu: 5.51 ± 1.427
0.918PheMet: 0.918 ± 0.602
2.755PheAsn: 2.755 ± 1.189
0.918PhePro: 0.918 ± 0.947
2.755PheGln: 2.755 ± 1.679
3.673PheArg: 3.673 ± 1.882
0.0PheSer: 0.0 ± 0.0
2.755PheThr: 2.755 ± 0.799
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.918PheTyr: 0.918 ± 0.727
0.0PheXaa: 0.0 ± 0.0
Gly
1.837GlyAla: 1.837 ± 1.205
1.837GlyCys: 1.837 ± 0.958
3.673GlyAsp: 3.673 ± 1.042
5.51GlyGlu: 5.51 ± 1.853
2.755GlyPhe: 2.755 ± 1.936
3.673GlyGly: 3.673 ± 1.042
2.755GlyHis: 2.755 ± 1.679
1.837GlyIle: 1.837 ± 0.925
6.428GlyLys: 6.428 ± 2.624
3.673GlyLeu: 3.673 ± 1.105
0.918GlyMet: 0.918 ± 0.57
0.918GlyAsn: 0.918 ± 1.073
3.673GlyPro: 3.673 ± 1.609
2.755GlyGln: 2.755 ± 0.954
0.918GlyArg: 0.918 ± 0.602
1.837GlySer: 1.837 ± 0.925
4.591GlyThr: 4.591 ± 2.336
2.755GlyVal: 2.755 ± 1.74
0.0GlyTrp: 0.0 ± 0.0
0.918GlyTyr: 0.918 ± 0.947
0.0GlyXaa: 0.0 ± 0.0
His
0.918HisAla: 0.918 ± 0.727
1.837HisCys: 1.837 ± 1.26
0.918HisAsp: 0.918 ± 0.727
2.755HisGlu: 2.755 ± 1.266
2.755HisPhe: 2.755 ± 1.266
2.755HisGly: 2.755 ± 1.658
1.837HisHis: 1.837 ± 1.26
2.755HisIle: 2.755 ± 1.411
0.0HisLys: 0.0 ± 0.0
1.837HisLeu: 1.837 ± 1.205
0.0HisMet: 0.0 ± 0.0
4.591HisAsn: 4.591 ± 1.877
0.918HisPro: 0.918 ± 0.602
1.837HisGln: 1.837 ± 1.179
4.591HisArg: 4.591 ± 2.579
1.837HisSer: 1.837 ± 1.155
2.755HisThr: 2.755 ± 1.703
2.755HisVal: 2.755 ± 1.857
0.0HisTrp: 0.0 ± 0.0
0.918HisTyr: 0.918 ± 0.602
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
2.755IleCys: 2.755 ± 1.304
2.755IleAsp: 2.755 ± 1.423
1.837IleGlu: 1.837 ± 1.034
2.755IlePhe: 2.755 ± 1.084
0.918IleGly: 0.918 ± 0.727
0.0IleHis: 0.0 ± 0.0
4.591IleIle: 4.591 ± 1.749
5.51IleLys: 5.51 ± 1.427
2.755IleLeu: 2.755 ± 0.954
0.918IleMet: 0.918 ± 0.78
3.673IleAsn: 3.673 ± 1.429
0.918IlePro: 0.918 ± 0.602
2.755IleGln: 2.755 ± 1.181
6.428IleArg: 6.428 ± 1.718
4.591IleSer: 4.591 ± 1.477
1.837IleThr: 1.837 ± 1.831
0.918IleVal: 0.918 ± 0.602
0.918IleTrp: 0.918 ± 0.915
2.755IleTyr: 2.755 ± 0.877
0.0IleXaa: 0.0 ± 0.0
Lys
1.837LysAla: 1.837 ± 1.034
0.918LysCys: 0.918 ± 0.602
2.755LysAsp: 2.755 ± 1.807
3.673LysGlu: 3.673 ± 1.609
2.755LysPhe: 2.755 ± 1.423
0.918LysGly: 0.918 ± 0.602
1.837LysHis: 1.837 ± 0.925
6.428LysIle: 6.428 ± 2.768
0.0LysLys: 0.0 ± 0.0
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
4.591LysAsn: 4.591 ± 1.722
3.673LysPro: 3.673 ± 1.427
0.918LysGln: 0.918 ± 0.947
5.51LysArg: 5.51 ± 1.735
5.51LysSer: 5.51 ± 2.038
3.673LysThr: 3.673 ± 0.932
4.591LysVal: 4.591 ± 2.793
0.918LysTrp: 0.918 ± 0.727
3.673LysTyr: 3.673 ± 0.932
0.0LysXaa: 0.0 ± 0.0
Leu
2.755LeuAla: 2.755 ± 1.156
1.837LeuCys: 1.837 ± 1.205
8.264LeuAsp: 8.264 ± 2.811
4.591LeuGlu: 4.591 ± 1.369
0.918LeuPhe: 0.918 ± 0.857
5.51LeuGly: 5.51 ± 1.453
4.591LeuHis: 4.591 ± 1.348
6.428LeuIle: 6.428 ± 3.089
5.51LeuLys: 5.51 ± 1.399
1.837LeuLeu: 1.837 ± 1.398
0.918LeuMet: 0.918 ± 0.727
3.673LeuAsn: 3.673 ± 1.396
0.0LeuPro: 0.0 ± 0.0
1.837LeuGln: 1.837 ± 0.942
8.264LeuArg: 8.264 ± 3.226
3.673LeuSer: 3.673 ± 2.41
5.51LeuThr: 5.51 ± 1.728
4.591LeuVal: 4.591 ± 1.395
0.918LeuTrp: 0.918 ± 1.073
5.51LeuTyr: 5.51 ± 1.546
0.0LeuXaa: 0.0 ± 0.0
Met
0.918MetAla: 0.918 ± 0.727
1.837MetCys: 1.837 ± 1.224
4.591MetAsp: 4.591 ± 2.521
0.0MetGlu: 0.0 ± 0.0
2.755MetPhe: 2.755 ± 2.18
1.837MetGly: 1.837 ± 0.983
1.837MetHis: 1.837 ± 1.224
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
3.673MetLeu: 3.673 ± 1.234
0.918MetMet: 0.918 ± 1.073
1.837MetAsn: 1.837 ± 0.97
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.918MetArg: 0.918 ± 0.857
1.837MetSer: 1.837 ± 0.699
0.0MetThr: 0.0 ± 0.0
0.918MetVal: 0.918 ± 0.947
1.837MetTrp: 1.837 ± 0.942
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.591AsnAla: 4.591 ± 1.383
1.837AsnCys: 1.837 ± 1.713
2.755AsnAsp: 2.755 ± 1.306
1.837AsnGlu: 1.837 ± 1.16
0.0AsnPhe: 0.0 ± 0.0
2.755AsnGly: 2.755 ± 1.304
3.673AsnHis: 3.673 ± 1.381
1.837AsnIle: 1.837 ± 0.699
0.918AsnLys: 0.918 ± 0.602
6.428AsnLeu: 6.428 ± 2.243
2.755AsnMet: 2.755 ± 2.045
1.837AsnAsn: 1.837 ± 0.958
4.591AsnPro: 4.591 ± 1.108
0.918AsnGln: 0.918 ± 0.727
2.755AsnArg: 2.755 ± 1.469
5.51AsnSer: 5.51 ± 1.762
2.755AsnThr: 2.755 ± 1.423
1.837AsnVal: 1.837 ± 1.205
0.0AsnTrp: 0.0 ± 0.0
2.755AsnTyr: 2.755 ± 1.084
0.0AsnXaa: 0.0 ± 0.0
Pro
3.673ProAla: 3.673 ± 1.079
1.837ProCys: 1.837 ± 1.16
3.673ProAsp: 3.673 ± 2.065
2.755ProGlu: 2.755 ± 1.306
1.837ProPhe: 1.837 ± 1.034
2.755ProGly: 2.755 ± 0.954
2.755ProHis: 2.755 ± 1.807
0.918ProIle: 0.918 ± 0.857
4.591ProLys: 4.591 ± 3.012
4.591ProLeu: 4.591 ± 1.342
0.918ProMet: 0.918 ± 0.727
2.755ProAsn: 2.755 ± 1.423
1.837ProPro: 1.837 ± 0.983
5.51ProGln: 5.51 ± 2.726
5.51ProArg: 5.51 ± 1.932
3.673ProSer: 3.673 ± 1.873
4.591ProThr: 4.591 ± 2.141
3.673ProVal: 3.673 ± 1.215
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.673GlnAla: 3.673 ± 1.579
0.918GlnCys: 0.918 ± 0.915
2.755GlnAsp: 2.755 ± 1.301
0.918GlnGlu: 0.918 ± 0.727
0.918GlnPhe: 0.918 ± 0.602
1.837GlnGly: 1.837 ± 1.205
0.918GlnHis: 0.918 ± 1.073
2.755GlnIle: 2.755 ± 1.423
2.755GlnLys: 2.755 ± 1.791
2.755GlnLeu: 2.755 ± 2.133
1.837GlnMet: 1.837 ± 1.398
1.837GlnAsn: 1.837 ± 0.925
4.591GlnPro: 4.591 ± 2.956
3.673GlnGln: 3.673 ± 1.02
0.918GlnArg: 0.918 ± 1.073
3.673GlnSer: 3.673 ± 1.151
1.837GlnThr: 1.837 ± 1.421
1.837GlnVal: 1.837 ± 0.699
0.0GlnTrp: 0.0 ± 0.0
1.837GlnTyr: 1.837 ± 0.699
0.0GlnXaa: 0.0 ± 0.0
Arg
5.51ArgAla: 5.51 ± 1.774
2.755ArgCys: 2.755 ± 1.791
3.673ArgAsp: 3.673 ± 1.882
2.755ArgGlu: 2.755 ± 1.227
5.51ArgPhe: 5.51 ± 1.755
3.673ArgGly: 3.673 ± 1.215
1.837ArgHis: 1.837 ± 1.16
1.837ArgIle: 1.837 ± 1.224
2.755ArgLys: 2.755 ± 1.449
3.673ArgLeu: 3.673 ± 1.634
0.918ArgMet: 0.918 ± 0.727
1.837ArgAsn: 1.837 ± 0.925
9.183ArgPro: 9.183 ± 2.17
2.755ArgGln: 2.755 ± 1.49
8.264ArgArg: 8.264 ± 3.885
5.51ArgSer: 5.51 ± 1.762
2.755ArgThr: 2.755 ± 1.304
6.428ArgVal: 6.428 ± 1.44
0.918ArgTrp: 0.918 ± 0.727
1.837ArgTyr: 1.837 ± 1.16
0.0ArgXaa: 0.0 ± 0.0
Ser
4.591SerAla: 4.591 ± 2.174
0.0SerCys: 0.0 ± 0.0
3.673SerAsp: 3.673 ± 1.02
1.837SerGlu: 1.837 ± 0.942
0.918SerPhe: 0.918 ± 0.727
1.837SerGly: 1.837 ± 0.925
0.0SerHis: 0.0 ± 0.0
2.755SerIle: 2.755 ± 1.304
5.51SerLys: 5.51 ± 1.735
4.591SerLeu: 4.591 ± 1.012
1.837SerMet: 1.837 ± 1.488
5.51SerAsn: 5.51 ± 1.184
9.183SerPro: 9.183 ± 2.028
1.837SerGln: 1.837 ± 0.983
7.346SerArg: 7.346 ± 2.333
11.938SerSer: 11.938 ± 4.75
9.183SerThr: 9.183 ± 4.273
7.346SerVal: 7.346 ± 2.981
0.0SerTrp: 0.0 ± 0.0
3.673SerTyr: 3.673 ± 1.346
0.0SerXaa: 0.0 ± 0.0
Thr
6.428ThrAla: 6.428 ± 2.423
0.0ThrCys: 0.0 ± 0.0
0.918ThrAsp: 0.918 ± 0.915
1.837ThrGlu: 1.837 ± 1.224
2.755ThrPhe: 2.755 ± 1.227
4.591ThrGly: 4.591 ± 1.549
4.591ThrHis: 4.591 ± 1.785
1.837ThrIle: 1.837 ± 1.205
2.755ThrLys: 2.755 ± 1.292
5.51ThrLeu: 5.51 ± 1.59
1.837ThrMet: 1.837 ± 1.034
2.755ThrAsn: 2.755 ± 1.292
4.591ThrPro: 4.591 ± 1.031
0.0ThrGln: 0.0 ± 0.0
4.591ThrArg: 4.591 ± 1.972
4.591ThrSer: 4.591 ± 2.944
3.673ThrThr: 3.673 ± 1.862
5.51ThrVal: 5.51 ± 2.186
1.837ThrTrp: 1.837 ± 1.421
1.837ThrTyr: 1.837 ± 0.942
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
3.673ValAsp: 3.673 ± 1.042
2.755ValGlu: 2.755 ± 1.934
2.755ValPhe: 2.755 ± 1.449
2.755ValGly: 2.755 ± 1.469
2.755ValHis: 2.755 ± 1.301
3.673ValIle: 3.673 ± 1.353
3.673ValLys: 3.673 ± 1.215
6.428ValLeu: 6.428 ± 3.438
2.755ValMet: 2.755 ± 1.292
0.918ValAsn: 0.918 ± 0.727
3.673ValPro: 3.673 ± 0.826
6.428ValGln: 6.428 ± 1.468
4.591ValArg: 4.591 ± 2.793
2.755ValSer: 2.755 ± 0.954
2.755ValThr: 2.755 ± 2.18
1.837ValVal: 1.837 ± 1.453
1.837ValTrp: 1.837 ± 0.699
4.591ValTyr: 4.591 ± 1.827
0.0ValXaa: 0.0 ± 0.0
Trp
1.837TrpAla: 1.837 ± 1.205
0.0TrpCys: 0.0 ± 0.0
0.918TrpAsp: 0.918 ± 0.947
0.918TrpGlu: 0.918 ± 0.915
0.918TrpPhe: 0.918 ± 0.602
0.918TrpGly: 0.918 ± 0.602
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.837TrpMet: 1.837 ± 0.97
0.918TrpAsn: 0.918 ± 1.073
0.0TrpPro: 0.0 ± 0.0
0.918TrpGln: 0.918 ± 0.602
0.918TrpArg: 0.918 ± 0.857
0.918TrpSer: 0.918 ± 0.727
1.837TrpThr: 1.837 ± 0.97
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.755TrpTyr: 2.755 ± 0.954
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.755TyrAla: 2.755 ± 1.292
0.0TyrCys: 0.0 ± 0.0
1.837TyrAsp: 1.837 ± 1.16
2.755TyrGlu: 2.755 ± 1.084
2.755TyrPhe: 2.755 ± 0.877
0.918TyrGly: 0.918 ± 0.602
0.918TyrHis: 0.918 ± 0.947
1.837TyrIle: 1.837 ± 0.699
0.918TyrLys: 0.918 ± 0.602
4.591TyrLeu: 4.591 ± 1.342
1.837TyrMet: 1.837 ± 0.889
1.837TyrAsn: 1.837 ± 0.699
1.837TyrPro: 1.837 ± 0.983
0.918TyrGln: 0.918 ± 0.727
0.918TyrArg: 0.918 ± 0.727
4.591TyrSer: 4.591 ± 1.025
0.918TyrThr: 0.918 ± 0.915
3.673TyrVal: 3.673 ± 1.02
0.0TyrTrp: 0.0 ± 0.0
0.918TyrTyr: 0.918 ± 0.857
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1090 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski