Amino acid dipepetide frequency for Chickpea yellow dwarf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.015AlaAla: 1.015 ± 0.716
2.03AlaCys: 2.03 ± 1.941
2.03AlaAsp: 2.03 ± 0.918
3.046AlaGlu: 3.046 ± 0.519
0.0AlaPhe: 0.0 ± 0.0
3.046AlaGly: 3.046 ± 2.904
1.015AlaHis: 1.015 ± 0.971
5.076AlaIle: 5.076 ± 1.048
6.091AlaLys: 6.091 ± 2.537
5.076AlaLeu: 5.076 ± 3.98
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
3.046AlaPro: 3.046 ± 1.701
0.0AlaGln: 0.0 ± 0.0
1.015AlaArg: 1.015 ± 0.716
5.076AlaSer: 5.076 ± 1.481
9.137AlaThr: 9.137 ± 2.695
1.015AlaVal: 1.015 ± 0.971
0.0AlaTrp: 0.0 ± 0.0
2.03AlaTyr: 2.03 ± 1.63
0.0AlaXaa: 0.0 ± 0.0
Cys
1.015CysAla: 1.015 ± 0.971
0.0CysCys: 0.0 ± 0.0
2.03CysAsp: 2.03 ± 0.867
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.015CysGly: 1.015 ± 0.716
2.03CysHis: 2.03 ± 0.867
1.015CysIle: 1.015 ± 0.716
1.015CysLys: 1.015 ± 0.716
2.03CysLeu: 2.03 ± 0.867
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.03CysPro: 2.03 ± 0.898
0.0CysGln: 0.0 ± 0.0
1.015CysArg: 1.015 ± 0.971
1.015CysSer: 1.015 ± 0.971
0.0CysThr: 0.0 ± 0.0
1.015CysVal: 1.015 ± 0.971
1.015CysTrp: 1.015 ± 0.971
1.015CysTyr: 1.015 ± 1.425
0.0CysXaa: 0.0 ± 0.0
Asp
1.015AspAla: 1.015 ± 1.425
0.0AspCys: 0.0 ± 0.0
2.03AspAsp: 2.03 ± 0.867
5.076AspGlu: 5.076 ± 2.165
8.122AspPhe: 8.122 ± 3.51
4.061AspGly: 4.061 ± 1.734
0.0AspHis: 0.0 ± 0.0
6.091AspIle: 6.091 ± 1.461
4.061AspLys: 4.061 ± 1.292
4.061AspLeu: 4.061 ± 1.667
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
6.091AspPro: 6.091 ± 2.054
2.03AspGln: 2.03 ± 0.918
0.0AspArg: 0.0 ± 0.0
2.03AspSer: 2.03 ± 0.867
2.03AspThr: 2.03 ± 0.918
1.015AspVal: 1.015 ± 0.716
2.03AspTrp: 2.03 ± 0.898
2.03AspTyr: 2.03 ± 0.918
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
5.076GluAsp: 5.076 ± 2.277
5.076GluGlu: 5.076 ± 1.048
4.061GluPhe: 4.061 ± 1.734
4.061GluGly: 4.061 ± 1.734
0.0GluHis: 0.0 ± 0.0
3.046GluIle: 3.046 ± 1.317
0.0GluLys: 0.0 ± 0.0
4.061GluLeu: 4.061 ± 1.292
2.03GluMet: 2.03 ± 0.867
4.061GluAsn: 4.061 ± 0.968
3.046GluPro: 3.046 ± 1.36
1.015GluGln: 1.015 ± 0.716
3.046GluArg: 3.046 ± 1.532
1.015GluSer: 1.015 ± 0.716
3.046GluThr: 3.046 ± 1.317
3.046GluVal: 3.046 ± 1.418
4.061GluTrp: 4.061 ± 1.734
8.122GluTyr: 8.122 ± 2.248
0.0GluXaa: 0.0 ± 0.0
Phe
1.015PheAla: 1.015 ± 1.425
1.015PheCys: 1.015 ± 0.716
3.046PheAsp: 3.046 ± 0.519
2.03PheGlu: 2.03 ± 0.867
5.076PhePhe: 5.076 ± 1.048
1.015PheGly: 1.015 ± 1.425
0.0PheHis: 0.0 ± 0.0
3.046PheIle: 3.046 ± 1.36
3.046PheLys: 3.046 ± 1.728
6.091PheLeu: 6.091 ± 2.6
0.0PheMet: 0.0 ± 0.0
2.03PheAsn: 2.03 ± 1.429
7.107PhePro: 7.107 ± 2.377
1.015PheGln: 1.015 ± 0.971
6.091PheArg: 6.091 ± 1.037
1.015PheSer: 1.015 ± 0.971
3.046PheThr: 3.046 ± 1.302
3.046PheVal: 3.046 ± 2.904
2.03PheTrp: 2.03 ± 1.941
2.03PheTyr: 2.03 ± 0.898
0.0PheXaa: 0.0 ± 0.0
Gly
2.03GlyAla: 2.03 ± 1.941
1.015GlyCys: 1.015 ± 0.971
2.03GlyAsp: 2.03 ± 0.867
5.076GlyGlu: 5.076 ± 2.38
1.015GlyPhe: 1.015 ± 1.425
6.091GlyGly: 6.091 ± 3.455
0.0GlyHis: 0.0 ± 0.0
1.015GlyIle: 1.015 ± 0.716
4.061GlyLys: 4.061 ± 1.796
0.0GlyLeu: 0.0 ± 0.0
1.015GlyMet: 1.015 ± 0.971
7.107GlyAsn: 7.107 ± 0.618
5.076GlyPro: 5.076 ± 2.226
0.0GlyGln: 0.0 ± 0.0
3.046GlyArg: 3.046 ± 1.422
7.107GlySer: 7.107 ± 1.381
4.061GlyThr: 4.061 ± 3.092
4.061GlyVal: 4.061 ± 1.971
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
4.061HisCys: 4.061 ± 1.734
0.0HisAsp: 0.0 ± 0.0
4.061HisGlu: 4.061 ± 1.734
2.03HisPhe: 2.03 ± 1.941
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.03HisIle: 2.03 ± 0.898
0.0HisLys: 0.0 ± 0.0
3.046HisLeu: 3.046 ± 1.532
2.03HisMet: 2.03 ± 0.93
2.03HisAsn: 2.03 ± 1.429
3.046HisPro: 3.046 ± 1.532
2.03HisGln: 2.03 ± 0.867
2.03HisArg: 2.03 ± 0.867
3.046HisSer: 3.046 ± 0.519
2.03HisThr: 2.03 ± 1.647
0.0HisVal: 0.0 ± 0.0
3.046HisTrp: 3.046 ± 0.519
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.076IleAla: 5.076 ± 2.165
1.015IleCys: 1.015 ± 0.716
1.015IleAsp: 1.015 ± 0.716
2.03IleGlu: 2.03 ± 0.867
4.061IlePhe: 4.061 ± 1.292
1.015IleGly: 1.015 ± 1.425
1.015IleHis: 1.015 ± 0.716
7.107IleIle: 7.107 ± 2.048
4.061IleLys: 4.061 ± 1.292
3.046IleLeu: 3.046 ± 2.904
1.015IleMet: 1.015 ± 0.716
0.0IleAsn: 0.0 ± 0.0
2.03IlePro: 2.03 ± 1.431
6.091IleGln: 6.091 ± 2.6
2.03IleArg: 2.03 ± 0.867
4.061IleSer: 4.061 ± 1.667
4.061IleThr: 4.061 ± 1.734
2.03IleVal: 2.03 ± 1.429
1.015IleTrp: 1.015 ± 0.823
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.046LysAla: 3.046 ± 2.273
1.015LysCys: 1.015 ± 0.971
5.076LysAsp: 5.076 ± 0.942
3.046LysGlu: 3.046 ± 0.519
2.03LysPhe: 2.03 ± 0.898
2.03LysGly: 2.03 ± 0.898
2.03LysHis: 2.03 ± 1.941
1.015LysIle: 1.015 ± 0.716
6.091LysLys: 6.091 ± 2.594
6.091LysLeu: 6.091 ± 2.6
1.015LysMet: 1.015 ± 0.971
5.076LysAsn: 5.076 ± 2.226
0.0LysPro: 0.0 ± 0.0
6.091LysGln: 6.091 ± 2.031
4.061LysArg: 4.061 ± 0.848
4.061LysSer: 4.061 ± 1.292
2.03LysThr: 2.03 ± 0.898
2.03LysVal: 2.03 ± 1.63
0.0LysTrp: 0.0 ± 0.0
6.091LysTyr: 6.091 ± 1.152
0.0LysXaa: 0.0 ± 0.0
Leu
4.061LeuAla: 4.061 ± 0.968
0.0LeuCys: 0.0 ± 0.0
3.046LeuAsp: 3.046 ± 0.519
9.137LeuGlu: 9.137 ± 2.695
3.046LeuPhe: 3.046 ± 1.317
3.046LeuGly: 3.046 ± 1.728
10.152LeuHis: 10.152 ± 4.698
6.091LeuIle: 6.091 ± 3.978
1.015LeuLys: 1.015 ± 1.425
3.046LeuLeu: 3.046 ± 2.775
0.0LeuMet: 0.0 ± 0.0
3.046LeuAsn: 3.046 ± 1.36
2.03LeuPro: 2.03 ± 1.63
4.061LeuGln: 4.061 ± 0.848
3.046LeuArg: 3.046 ± 1.36
1.015LeuSer: 1.015 ± 0.823
4.061LeuThr: 4.061 ± 1.734
4.061LeuVal: 4.061 ± 2.655
2.03LeuTrp: 2.03 ± 0.867
5.076LeuTyr: 5.076 ± 1.214
0.0LeuXaa: 0.0 ± 0.0
Met
1.015MetAla: 1.015 ± 0.971
0.0MetCys: 0.0 ± 0.0
1.015MetAsp: 1.015 ± 0.823
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
3.046MetIle: 3.046 ± 0.519
0.0MetLys: 0.0 ± 0.0
1.015MetLeu: 1.015 ± 1.425
0.0MetMet: 0.0 ± 0.0
1.015MetAsn: 1.015 ± 0.971
3.046MetPro: 3.046 ± 0.519
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.03MetSer: 2.03 ± 1.431
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.03MetTyr: 2.03 ± 1.941
0.0MetXaa: 0.0 ± 0.0
Asn
2.03AsnAla: 2.03 ± 0.898
2.03AsnCys: 2.03 ± 0.898
1.015AsnAsp: 1.015 ± 0.716
1.015AsnGlu: 1.015 ± 0.971
0.0AsnPhe: 0.0 ± 0.0
3.046AsnGly: 3.046 ± 1.728
4.061AsnHis: 4.061 ± 1.734
4.061AsnIle: 4.061 ± 1.734
2.03AsnLys: 2.03 ± 1.63
2.03AsnLeu: 2.03 ± 0.867
0.0AsnMet: 0.0 ± 0.0
6.091AsnAsn: 6.091 ± 2.6
3.046AsnPro: 3.046 ± 0.519
3.046AsnGln: 3.046 ± 2.912
2.03AsnArg: 2.03 ± 0.867
9.137AsnSer: 9.137 ± 2.652
3.046AsnThr: 3.046 ± 2.273
2.03AsnVal: 2.03 ± 0.898
1.015AsnTrp: 1.015 ± 0.716
2.03AsnTyr: 2.03 ± 1.429
0.0AsnXaa: 0.0 ± 0.0
Pro
6.091ProAla: 6.091 ± 2.235
1.015ProCys: 1.015 ± 0.716
4.061ProAsp: 4.061 ± 1.734
3.046ProGlu: 3.046 ± 1.532
4.061ProPhe: 4.061 ± 0.848
7.107ProGly: 7.107 ± 5.34
7.107ProHis: 7.107 ± 3.192
0.0ProIle: 0.0 ± 0.0
1.015ProLys: 1.015 ± 0.716
0.0ProLeu: 0.0 ± 0.0
0.0ProMet: 0.0 ± 0.0
5.076ProAsn: 5.076 ± 1.715
2.03ProPro: 2.03 ± 0.867
1.015ProGln: 1.015 ± 0.971
3.046ProArg: 3.046 ± 0.519
6.091ProSer: 6.091 ± 1.683
7.107ProThr: 7.107 ± 2.32
4.061ProVal: 4.061 ± 2.612
0.0ProTrp: 0.0 ± 0.0
2.03ProTyr: 2.03 ± 0.867
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
5.076GlnAsp: 5.076 ± 2.226
4.061GlnGlu: 4.061 ± 2.603
0.0GlnPhe: 0.0 ± 0.0
1.015GlnGly: 1.015 ± 0.971
1.015GlnHis: 1.015 ± 0.823
0.0GlnIle: 0.0 ± 0.0
2.03GlnLys: 2.03 ± 0.898
6.091GlnLeu: 6.091 ± 2.72
1.015GlnMet: 1.015 ± 0.81
0.0GlnAsn: 0.0 ± 0.0
2.03GlnPro: 2.03 ± 0.867
0.0GlnGln: 0.0 ± 0.0
2.03GlnArg: 2.03 ± 1.429
2.03GlnSer: 2.03 ± 0.867
5.076GlnThr: 5.076 ± 1.048
5.076GlnVal: 5.076 ± 1.099
1.015GlnTrp: 1.015 ± 0.971
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
2.03ArgCys: 2.03 ± 0.867
6.091ArgAsp: 6.091 ± 2.6
1.015ArgGlu: 1.015 ± 0.716
2.03ArgPhe: 2.03 ± 1.941
2.03ArgGly: 2.03 ± 0.867
2.03ArgHis: 2.03 ± 0.898
0.0ArgIle: 0.0 ± 0.0
1.015ArgLys: 1.015 ± 0.971
3.046ArgLeu: 3.046 ± 0.519
1.015ArgMet: 1.015 ± 0.971
4.061ArgAsn: 4.061 ± 1.043
3.046ArgPro: 3.046 ± 1.418
1.015ArgGln: 1.015 ± 1.425
6.091ArgArg: 6.091 ± 1.365
3.046ArgSer: 3.046 ± 1.302
7.107ArgThr: 7.107 ± 1.121
4.061ArgVal: 4.061 ± 0.848
2.03ArgTrp: 2.03 ± 0.867
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.091SerAla: 6.091 ± 1.683
0.0SerCys: 0.0 ± 0.0
3.046SerAsp: 3.046 ± 1.36
1.015SerGlu: 1.015 ± 0.716
4.061SerPhe: 4.061 ± 1.734
6.091SerGly: 6.091 ± 3.587
2.03SerHis: 2.03 ± 1.429
1.015SerIle: 1.015 ± 0.823
10.152SerLys: 10.152 ± 4.334
6.091SerLeu: 6.091 ± 2.6
1.015SerMet: 1.015 ± 0.883
4.061SerAsn: 4.061 ± 2.612
7.107SerPro: 7.107 ± 3.192
2.03SerGln: 2.03 ± 1.431
5.076SerArg: 5.076 ± 2.763
10.152SerSer: 10.152 ± 4.285
7.107SerThr: 7.107 ± 3.417
3.046SerVal: 3.046 ± 1.418
1.015SerTrp: 1.015 ± 0.716
3.046SerTyr: 3.046 ± 2.775
0.0SerXaa: 0.0 ± 0.0
Thr
6.091ThrAla: 6.091 ± 1.683
0.0ThrCys: 0.0 ± 0.0
2.03ThrAsp: 2.03 ± 1.941
7.107ThrGlu: 7.107 ± 2.868
4.061ThrPhe: 4.061 ± 2.701
3.046ThrGly: 3.046 ± 0.519
0.0ThrHis: 0.0 ± 0.0
3.046ThrIle: 3.046 ± 0.519
6.091ThrLys: 6.091 ± 2.6
2.03ThrLeu: 2.03 ± 1.941
1.015ThrMet: 1.015 ± 0.971
4.061ThrAsn: 4.061 ± 1.292
3.046ThrPro: 3.046 ± 2.904
2.03ThrGln: 2.03 ± 0.918
3.046ThrArg: 3.046 ± 1.532
11.168ThrSer: 11.168 ± 5.031
9.137ThrThr: 9.137 ± 3.562
3.046ThrVal: 3.046 ± 1.422
1.015ThrTrp: 1.015 ± 0.971
6.091ThrTyr: 6.091 ± 1.517
0.0ThrXaa: 0.0 ± 0.0
Val
5.076ValAla: 5.076 ± 3.98
2.03ValCys: 2.03 ± 1.63
2.03ValAsp: 2.03 ± 0.867
1.015ValGlu: 1.015 ± 0.716
5.076ValPhe: 5.076 ± 2.38
5.076ValGly: 5.076 ± 1.423
1.015ValHis: 1.015 ± 0.971
2.03ValIle: 2.03 ± 1.429
4.061ValLys: 4.061 ± 1.971
3.046ValLeu: 3.046 ± 1.317
1.015ValMet: 1.015 ± 1.135
4.061ValAsn: 4.061 ± 1.9
3.046ValPro: 3.046 ± 1.418
0.0ValGln: 0.0 ± 0.0
2.03ValArg: 2.03 ± 1.941
5.076ValSer: 5.076 ± 1.308
1.015ValThr: 1.015 ± 0.971
4.061ValVal: 4.061 ± 1.836
1.015ValTrp: 1.015 ± 0.823
1.015ValTyr: 1.015 ± 0.971
0.0ValXaa: 0.0 ± 0.0
Trp
3.046TrpAla: 3.046 ± 1.36
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.015TrpGlu: 1.015 ± 0.823
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.015TrpHis: 1.015 ± 0.971
0.0TrpIle: 0.0 ± 0.0
3.046TrpLys: 3.046 ± 1.728
6.091TrpLeu: 6.091 ± 1.461
1.015TrpMet: 1.015 ± 0.823
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.03TrpArg: 2.03 ± 0.867
2.03TrpSer: 2.03 ± 0.867
2.03TrpThr: 2.03 ± 1.941
1.015TrpVal: 1.015 ± 0.971
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.03TyrAla: 2.03 ± 1.941
0.0TyrCys: 0.0 ± 0.0
2.03TyrAsp: 2.03 ± 0.898
0.0TyrGlu: 0.0 ± 0.0
4.061TyrPhe: 4.061 ± 1.796
1.015TyrGly: 1.015 ± 0.716
0.0TyrHis: 0.0 ± 0.0
3.046TyrIle: 3.046 ± 1.36
3.046TyrLys: 3.046 ± 2.912
5.076TyrLeu: 5.076 ± 0.942
0.0TyrMet: 0.0 ± 0.0
1.015TyrAsn: 1.015 ± 0.716
4.061TyrPro: 4.061 ± 1.292
6.091TyrGln: 6.091 ± 2.321
0.0TyrArg: 0.0 ± 0.0
4.061TyrSer: 4.061 ± 0.968
2.03TyrThr: 2.03 ± 1.469
5.076TyrVal: 5.076 ± 1.715
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (986 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski