Amino acid dipepetide frequency for Cacao yellow vein banding virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.77AlaAla: 3.77 ± 1.875
0.838AlaCys: 0.838 ± 0.417
2.933AlaAsp: 2.933 ± 3.535
2.933AlaGlu: 2.933 ± 0.905
2.514AlaPhe: 2.514 ± 1.25
3.77AlaGly: 3.77 ± 2.983
1.676AlaHis: 1.676 ± 0.833
6.284AlaIle: 6.284 ± 2.143
3.77AlaLys: 3.77 ± 1.139
7.541AlaLeu: 7.541 ± 4.896
2.514AlaMet: 2.514 ± 1.25
2.933AlaAsn: 2.933 ± 1.458
3.351AlaPro: 3.351 ± 1.666
3.351AlaGln: 3.351 ± 1.007
3.77AlaArg: 3.77 ± 1.184
5.865AlaSer: 5.865 ± 3.202
5.446AlaThr: 5.446 ± 3.846
4.608AlaVal: 4.608 ± 2.291
0.419AlaTrp: 0.419 ± 0.208
2.095AlaTyr: 2.095 ± 0.829
0.419AlaXaa: 0.419 ± 0.208
Cys
0.838CysAla: 0.838 ± 0.417
0.419CysCys: 0.419 ± 0.208
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.257CysPhe: 1.257 ± 0.962
0.838CysGly: 0.838 ± 0.417
0.419CysHis: 0.419 ± 0.208
1.257CysIle: 1.257 ± 0.95
2.095CysLys: 2.095 ± 1.042
2.095CysLeu: 2.095 ± 0.829
0.0CysMet: 0.0 ± 0.0
1.676CysAsn: 1.676 ± 0.833
0.419CysPro: 0.419 ± 0.208
1.257CysGln: 1.257 ± 0.962
0.838CysArg: 0.838 ± 0.417
0.838CysSer: 0.838 ± 0.417
0.419CysThr: 0.419 ± 0.208
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.419CysTyr: 0.419 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.933AspAla: 2.933 ± 2.11
1.257AspCys: 1.257 ± 0.95
3.77AspAsp: 3.77 ± 1.875
1.676AspGlu: 1.676 ± 0.833
1.676AspPhe: 1.676 ± 0.833
2.514AspGly: 2.514 ± 1.25
2.514AspHis: 2.514 ± 1.9
2.933AspIle: 2.933 ± 1.458
2.095AspLys: 2.095 ± 0.965
3.77AspLeu: 3.77 ± 2.033
1.676AspMet: 1.676 ± 0.867
3.77AspAsn: 3.77 ± 0.929
4.608AspPro: 4.608 ± 1.947
3.77AspGln: 3.77 ± 1.184
2.514AspArg: 2.514 ± 5.408
3.351AspSer: 3.351 ± 1.666
1.676AspThr: 1.676 ± 0.833
0.838AspVal: 0.838 ± 0.417
1.257AspTrp: 1.257 ± 0.625
2.933AspTyr: 2.933 ± 0.997
0.0AspXaa: 0.0 ± 0.0
Glu
5.027GluAla: 5.027 ± 3.588
0.419GluCys: 0.419 ± 0.208
4.189GluAsp: 4.189 ± 1.958
13.825GluGlu: 13.825 ± 2.787
2.514GluPhe: 2.514 ± 1.329
3.77GluGly: 3.77 ± 1.184
1.676GluHis: 1.676 ± 0.833
5.865GluIle: 5.865 ± 1.045
5.865GluLys: 5.865 ± 3.098
7.122GluLeu: 7.122 ± 1.934
0.838GluMet: 0.838 ± 0.417
2.095GluAsn: 2.095 ± 0.79
3.351GluPro: 3.351 ± 1.666
2.095GluGln: 2.095 ± 0.79
5.865GluArg: 5.865 ± 1.539
2.933GluSer: 2.933 ± 1.458
2.514GluThr: 2.514 ± 0.778
2.514GluVal: 2.514 ± 2.227
0.419GluTrp: 0.419 ± 0.208
2.933GluTyr: 2.933 ± 1.174
0.0GluXaa: 0.0 ± 0.0
Phe
1.676PheAla: 1.676 ± 0.833
0.419PheCys: 0.419 ± 0.208
1.676PheAsp: 1.676 ± 2.195
1.257PheGlu: 1.257 ± 1.103
1.257PhePhe: 1.257 ± 0.625
0.838PheGly: 0.838 ± 0.417
1.257PheHis: 1.257 ± 2.347
2.514PheIle: 2.514 ± 0.842
2.095PheLys: 2.095 ± 0.79
1.676PheLeu: 1.676 ± 0.855
1.257PheMet: 1.257 ± 0.595
1.257PheAsn: 1.257 ± 0.962
2.933PhePro: 2.933 ± 1.458
2.095PheGln: 2.095 ± 0.79
2.095PheArg: 2.095 ± 0.829
2.514PheSer: 2.514 ± 1.25
2.933PheThr: 2.933 ± 1.458
2.514PheVal: 2.514 ± 0.959
0.838PheTrp: 0.838 ± 0.417
2.933PheTyr: 2.933 ± 0.997
0.419PheXaa: 0.419 ± 1.211
Gly
4.608GlyAla: 4.608 ± 2.15
0.838GlyCys: 0.838 ± 0.417
2.095GlyAsp: 2.095 ± 0.829
3.77GlyGlu: 3.77 ± 1.11
1.257GlyPhe: 1.257 ± 1.103
1.257GlyGly: 1.257 ± 0.625
1.257GlyHis: 1.257 ± 1.858
4.189GlyIle: 4.189 ± 1.318
3.351GlyLys: 3.351 ± 1.075
2.933GlyLeu: 2.933 ± 1.458
0.419GlyMet: 0.419 ± 0.208
2.933GlyAsn: 2.933 ± 1.458
1.676GlyPro: 1.676 ± 0.833
2.514GlyGln: 2.514 ± 1.25
2.933GlyArg: 2.933 ± 1.458
2.933GlySer: 2.933 ± 1.458
4.608GlyThr: 4.608 ± 1.469
4.189GlyVal: 4.189 ± 2.083
1.676GlyTrp: 1.676 ± 0.833
0.838GlyTyr: 0.838 ± 0.417
0.0GlyXaa: 0.0 ± 0.0
His
2.514HisAla: 2.514 ± 1.25
1.257HisCys: 1.257 ± 0.962
0.838HisAsp: 0.838 ± 0.417
0.419HisGlu: 0.419 ± 0.208
2.095HisPhe: 2.095 ± 0.79
1.257HisGly: 1.257 ± 0.625
0.0HisHis: 0.0 ± 0.0
1.257HisIle: 1.257 ± 0.625
2.095HisLys: 2.095 ± 1.497
1.257HisLeu: 1.257 ± 0.962
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.419HisPro: 0.419 ± 0.208
2.095HisGln: 2.095 ± 1.606
2.933HisArg: 2.933 ± 1.174
0.419HisSer: 0.419 ± 0.208
0.838HisThr: 0.838 ± 1.098
2.095HisVal: 2.095 ± 0.829
1.257HisTrp: 1.257 ± 0.625
0.838HisTyr: 0.838 ± 0.417
0.419HisXaa: 0.419 ± 1.211
Ile
4.189IleAla: 4.189 ± 1.18
2.095IleCys: 2.095 ± 1.042
3.77IleAsp: 3.77 ± 1.032
5.865IleGlu: 5.865 ± 1.639
2.095IlePhe: 2.095 ± 1.497
2.514IleGly: 2.514 ± 1.25
1.676IleHis: 1.676 ± 0.833
2.933IleIle: 2.933 ± 1.152
3.351IleLys: 3.351 ± 1.007
4.608IleLeu: 4.608 ± 1.199
1.257IleMet: 1.257 ± 0.625
3.351IleAsn: 3.351 ± 1.666
4.189IlePro: 4.189 ± 2.083
2.514IleGln: 2.514 ± 2.206
3.77IleArg: 3.77 ± 1.875
3.351IleSer: 3.351 ± 3.177
4.608IleThr: 4.608 ± 1.913
4.189IleVal: 4.189 ± 2.083
0.0IleTrp: 0.0 ± 0.0
0.419IleTyr: 0.419 ± 0.208
0.0IleXaa: 0.0 ± 0.0
Lys
5.027LysAla: 5.027 ± 4.614
2.095LysCys: 2.095 ± 0.829
4.608LysAsp: 4.608 ± 1.199
6.703LysGlu: 6.703 ± 1.469
1.676LysPhe: 1.676 ± 0.867
4.608LysGly: 4.608 ± 1.038
2.933LysHis: 2.933 ± 3.149
3.77LysIle: 3.77 ± 1.875
5.446LysLys: 5.446 ± 2.982
8.379LysLeu: 8.379 ± 3.106
2.095LysMet: 2.095 ± 0.874
2.095LysAsn: 2.095 ± 1.042
3.351LysPro: 3.351 ± 1.075
2.514LysGln: 2.514 ± 1.454
2.514LysArg: 2.514 ± 1.923
2.095LysSer: 2.095 ± 1.042
2.514LysThr: 2.514 ± 0.959
5.027LysVal: 5.027 ± 3.271
0.838LysTrp: 0.838 ± 1.22
0.838LysTyr: 0.838 ± 0.417
0.0LysXaa: 0.0 ± 0.0
Leu
5.446LeuAla: 5.446 ± 0.798
0.838LeuCys: 0.838 ± 1.098
3.77LeuAsp: 3.77 ± 0.861
9.636LeuGlu: 9.636 ± 8.926
1.257LeuPhe: 1.257 ± 0.625
7.541LeuGly: 7.541 ± 2.369
2.514LeuHis: 2.514 ± 1.454
3.351LeuIle: 3.351 ± 3.013
7.122LeuLys: 7.122 ± 7.255
4.189LeuLeu: 4.189 ± 1.964
0.0LeuMet: 0.0 ± 0.0
4.189LeuAsn: 4.189 ± 5.371
6.703LeuPro: 6.703 ± 1.469
5.446LeuGln: 5.446 ± 2.101
4.608LeuArg: 4.608 ± 1.457
5.865LeuSer: 5.865 ± 2.989
4.608LeuThr: 4.608 ± 3.19
6.703LeuVal: 6.703 ± 1.156
0.0LeuTrp: 0.0 ± 0.0
3.77LeuTyr: 3.77 ± 1.032
0.0LeuXaa: 0.0 ± 0.0
Met
1.676MetAla: 1.676 ± 0.833
0.0MetCys: 0.0 ± 0.0
1.676MetAsp: 1.676 ± 0.833
2.514MetGlu: 2.514 ± 0.842
0.838MetPhe: 0.838 ± 0.417
0.419MetGly: 0.419 ± 0.208
0.419MetHis: 0.419 ± 0.208
1.257MetIle: 1.257 ± 0.625
0.838MetLys: 0.838 ± 0.417
2.095MetLeu: 2.095 ± 0.829
0.419MetMet: 0.419 ± 0.208
0.419MetAsn: 0.419 ± 0.208
1.676MetPro: 1.676 ± 0.833
1.257MetGln: 1.257 ± 0.95
0.838MetArg: 0.838 ± 0.417
1.257MetSer: 1.257 ± 1.881
0.838MetThr: 0.838 ± 0.417
2.514MetVal: 2.514 ± 0.842
0.419MetTrp: 0.419 ± 0.208
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.351AsnAla: 3.351 ± 1.666
0.838AsnCys: 0.838 ± 0.417
1.257AsnAsp: 1.257 ± 0.625
2.514AsnGlu: 2.514 ± 0.842
2.933AsnPhe: 2.933 ± 1.458
1.676AsnGly: 1.676 ± 0.833
0.419AsnHis: 0.419 ± 0.208
1.676AsnIle: 1.676 ± 0.833
2.514AsnLys: 2.514 ± 0.959
7.541AsnLeu: 7.541 ± 7.543
1.257AsnMet: 1.257 ± 0.625
3.351AsnAsn: 3.351 ± 0.909
2.095AsnPro: 2.095 ± 1.042
2.095AsnGln: 2.095 ± 1.042
1.676AsnArg: 1.676 ± 0.855
4.189AsnSer: 4.189 ± 1.18
2.514AsnThr: 2.514 ± 1.25
2.095AsnVal: 2.095 ± 0.965
0.0AsnTrp: 0.0 ± 0.0
2.514AsnTyr: 2.514 ± 1.329
0.0AsnXaa: 0.0 ± 0.0
Pro
7.122ProAla: 7.122 ± 2.585
0.0ProCys: 0.0 ± 0.0
3.77ProAsp: 3.77 ± 1.875
3.351ProGlu: 3.351 ± 1.666
2.095ProPhe: 2.095 ± 1.042
2.933ProGly: 2.933 ± 1.458
0.838ProHis: 0.838 ± 0.417
2.095ProIle: 2.095 ± 1.042
4.189ProLys: 4.189 ± 0.761
2.095ProLeu: 2.095 ± 1.042
0.838ProMet: 0.838 ± 0.776
1.257ProAsn: 1.257 ± 0.625
5.446ProPro: 5.446 ± 2.708
2.095ProGln: 2.095 ± 1.497
1.676ProArg: 1.676 ± 0.855
4.189ProSer: 4.189 ± 1.318
3.77ProThr: 3.77 ± 1.032
2.514ProVal: 2.514 ± 0.959
0.838ProTrp: 0.838 ± 0.417
0.419ProTyr: 0.419 ± 0.208
1.257ProXaa: 1.257 ± 0.95
Gln
3.351GlnAla: 3.351 ± 1.199
0.0GlnCys: 0.0 ± 0.0
0.838GlnAsp: 0.838 ± 1.068
5.027GlnGlu: 5.027 ± 2.215
2.095GlnPhe: 2.095 ± 1.042
2.095GlnGly: 2.095 ± 0.829
0.419GlnHis: 0.419 ± 0.208
4.608GlnIle: 4.608 ± 3.122
4.608GlnLys: 4.608 ± 1.947
9.217GlnLeu: 9.217 ± 4.836
1.676GlnMet: 1.676 ± 0.833
1.676GlnAsn: 1.676 ± 0.833
0.838GlnPro: 0.838 ± 0.417
4.608GlnGln: 4.608 ± 0.834
3.77GlnArg: 3.77 ± 1.184
2.514GlnSer: 2.514 ± 0.959
2.514GlnThr: 2.514 ± 0.778
1.676GlnVal: 1.676 ± 0.833
0.838GlnTrp: 0.838 ± 1.068
1.257GlnTyr: 1.257 ± 0.625
0.419GlnXaa: 0.419 ± 0.208
Arg
5.027ArgAla: 5.027 ± 1.633
0.0ArgCys: 0.0 ± 0.0
2.933ArgAsp: 2.933 ± 1.458
2.933ArgGlu: 2.933 ± 1.458
2.514ArgPhe: 2.514 ± 1.923
2.095ArgGly: 2.095 ± 0.829
2.095ArgHis: 2.095 ± 1.042
3.351ArgIle: 3.351 ± 1.007
3.77ArgLys: 3.77 ± 1.875
4.608ArgLeu: 4.608 ± 1.703
2.095ArgMet: 2.095 ± 0.829
2.933ArgAsn: 2.933 ± 1.458
2.933ArgPro: 2.933 ± 2.448
1.257ArgGln: 1.257 ± 0.625
3.351ArgArg: 3.351 ± 1.666
3.77ArgSer: 3.77 ± 2.101
3.351ArgThr: 3.351 ± 1.007
4.608ArgVal: 4.608 ± 3.057
2.933ArgTrp: 2.933 ± 1.458
1.676ArgTyr: 1.676 ± 0.833
0.0ArgXaa: 0.0 ± 0.0
Ser
3.77SerAla: 3.77 ± 1.032
0.419SerCys: 0.419 ± 0.208
3.351SerAsp: 3.351 ± 2.209
3.77SerGlu: 3.77 ± 1.875
2.514SerPhe: 2.514 ± 0.959
3.77SerGly: 3.77 ± 1.184
0.419SerHis: 0.419 ± 1.211
2.514SerIle: 2.514 ± 0.778
4.189SerLys: 4.189 ± 1.403
4.608SerLeu: 4.608 ± 4.818
0.419SerMet: 0.419 ± 0.208
3.77SerAsn: 3.77 ± 1.608
1.676SerPro: 1.676 ± 0.833
5.865SerGln: 5.865 ± 1.617
5.027SerArg: 5.027 ± 2.5
6.703SerSer: 6.703 ± 1.616
5.865SerThr: 5.865 ± 1.224
2.933SerVal: 2.933 ± 0.82
1.257SerTrp: 1.257 ± 0.625
1.257SerTyr: 1.257 ± 0.95
0.419SerXaa: 0.419 ± 0.208
Thr
6.284ThrAla: 6.284 ± 5.228
1.257ThrCys: 1.257 ± 0.625
2.933ThrAsp: 2.933 ± 0.997
4.608ThrGlu: 4.608 ± 1.554
1.257ThrPhe: 1.257 ± 0.962
4.189ThrGly: 4.189 ± 2.083
0.838ThrHis: 0.838 ± 0.417
4.189ThrIle: 4.189 ± 1.318
5.027ThrLys: 5.027 ± 2.416
4.608ThrLeu: 4.608 ± 2.66
1.676ThrMet: 1.676 ± 0.867
2.095ThrAsn: 2.095 ± 1.042
0.838ThrPro: 0.838 ± 1.068
2.933ThrGln: 2.933 ± 1.152
2.933ThrArg: 2.933 ± 0.905
3.77ThrSer: 3.77 ± 1.875
2.933ThrThr: 2.933 ± 0.997
2.095ThrVal: 2.095 ± 0.829
0.0ThrTrp: 0.0 ± 0.0
2.095ThrTyr: 2.095 ± 1.042
0.0ThrXaa: 0.0 ± 0.0
Val
1.257ValAla: 1.257 ± 0.625
0.838ValCys: 0.838 ± 0.417
2.933ValAsp: 2.933 ± 1.316
2.514ValGlu: 2.514 ± 1.323
3.351ValPhe: 3.351 ± 1.666
1.676ValGly: 1.676 ± 0.833
1.676ValHis: 1.676 ± 0.833
4.189ValIle: 4.189 ± 2.083
2.933ValLys: 2.933 ± 0.997
4.189ValLeu: 4.189 ± 1.054
2.095ValMet: 2.095 ± 0.743
2.095ValAsn: 2.095 ± 0.965
3.77ValPro: 3.77 ± 1.875
3.351ValGln: 3.351 ± 0.909
4.189ValArg: 4.189 ± 1.318
3.77ValSer: 3.77 ± 2.827
2.933ValThr: 2.933 ± 2.448
3.351ValVal: 3.351 ± 1.075
0.419ValTrp: 0.419 ± 0.208
3.77ValTyr: 3.77 ± 2.238
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.417
0.0TrpCys: 0.0 ± 0.0
1.257TrpAsp: 1.257 ± 0.625
1.257TrpGlu: 1.257 ± 0.625
0.419TrpPhe: 0.419 ± 0.208
1.257TrpGly: 1.257 ± 0.625
0.419TrpHis: 0.419 ± 0.208
0.419TrpIle: 0.419 ± 0.208
0.419TrpLys: 0.419 ± 0.208
1.257TrpLeu: 1.257 ± 0.625
0.0TrpMet: 0.0 ± 0.0
0.838TrpAsn: 0.838 ± 0.417
0.419TrpPro: 0.419 ± 0.208
0.838TrpGln: 0.838 ± 1.22
0.838TrpArg: 0.838 ± 0.417
1.257TrpSer: 1.257 ± 0.625
0.838TrpThr: 0.838 ± 0.417
0.419TrpVal: 0.419 ± 0.208
0.0TrpTrp: 0.0 ± 0.0
0.419TrpTyr: 0.419 ± 1.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.095TyrAla: 2.095 ± 2.011
1.257TyrCys: 1.257 ± 0.625
2.933TyrAsp: 2.933 ± 1.458
0.838TyrGlu: 0.838 ± 0.417
0.838TyrPhe: 0.838 ± 2.076
1.257TyrGly: 1.257 ± 0.625
0.838TyrHis: 0.838 ± 0.417
1.676TyrIle: 1.676 ± 0.833
3.351TyrLys: 3.351 ± 1.814
3.351TyrLeu: 3.351 ± 1.038
0.419TyrMet: 0.419 ± 0.208
3.351TyrAsn: 3.351 ± 0.909
2.095TyrPro: 2.095 ± 1.042
2.095TyrGln: 2.095 ± 0.965
1.676TyrArg: 1.676 ± 0.833
2.933TyrSer: 2.933 ± 0.997
0.419TyrThr: 0.419 ± 1.211
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.838TyrTyr: 0.838 ± 1.068
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.419XaaGlu: 0.419 ± 1.211
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.419XaaIle: 0.419 ± 0.208
0.0XaaLys: 0.0 ± 0.0
0.419XaaLeu: 0.419 ± 1.211
0.0XaaMet: 0.0 ± 0.0
0.419XaaAsn: 0.419 ± 1.211
0.419XaaPro: 0.419 ± 0.208
0.0XaaGln: 0.0 ± 0.0
0.419XaaArg: 0.419 ± 0.208
0.0XaaSer: 0.0 ± 0.0
0.419XaaThr: 0.419 ± 0.208
0.419XaaVal: 0.419 ± 0.208
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2388 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski