Amino acid dipepetide frequency for Citrus chlorotic dwarf associated virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.63AlaAla: 3.63 ± 1.456
0.907AlaCys: 0.907 ± 0.771
3.63AlaAsp: 3.63 ± 1.367
3.63AlaGlu: 3.63 ± 2.125
6.352AlaPhe: 6.352 ± 1.604
1.815AlaGly: 1.815 ± 0.792
0.907AlaHis: 0.907 ± 0.771
3.63AlaIle: 3.63 ± 1.406
3.63AlaLys: 3.63 ± 1.113
4.537AlaLeu: 4.537 ± 2.211
0.907AlaMet: 0.907 ± 0.864
1.815AlaAsn: 1.815 ± 1.375
0.0AlaPro: 0.0 ± 0.0
7.26AlaGln: 7.26 ± 2.136
7.26AlaArg: 7.26 ± 1.325
5.445AlaSer: 5.445 ± 1.614
1.815AlaThr: 1.815 ± 0.832
1.815AlaVal: 1.815 ± 0.792
0.907AlaTrp: 0.907 ± 0.915
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.907CysCys: 0.907 ± 0.769
0.907CysAsp: 0.907 ± 0.769
1.815CysGlu: 1.815 ± 1.043
0.0CysPhe: 0.0 ± 0.0
2.722CysGly: 2.722 ± 1.147
0.907CysHis: 0.907 ± 0.879
1.815CysIle: 1.815 ± 1.043
2.722CysLys: 2.722 ± 1.408
0.0CysLeu: 0.0 ± 0.0
0.907CysMet: 0.907 ± 0.687
0.907CysAsn: 0.907 ± 0.879
2.722CysPro: 2.722 ± 2.306
1.815CysGln: 1.815 ± 1.375
3.63CysArg: 3.63 ± 1.696
2.722CysSer: 2.722 ± 0.999
0.907CysThr: 0.907 ± 0.769
4.537CysVal: 4.537 ± 1.345
0.907CysTrp: 0.907 ± 0.879
0.907CysTyr: 0.907 ± 0.769
0.0CysXaa: 0.0 ± 0.0
Asp
2.722AspAla: 2.722 ± 1.524
1.815AspCys: 1.815 ± 1.758
1.815AspAsp: 1.815 ± 1.187
6.352AspGlu: 6.352 ± 1.575
2.722AspPhe: 2.722 ± 0.585
3.63AspGly: 3.63 ± 1.093
1.815AspHis: 1.815 ± 0.966
0.907AspIle: 0.907 ± 0.915
1.815AspLys: 1.815 ± 1.036
5.445AspLeu: 5.445 ± 1.022
1.815AspMet: 1.815 ± 1.187
1.815AspAsn: 1.815 ± 0.824
6.352AspPro: 6.352 ± 1.999
2.722AspGln: 2.722 ± 1.438
5.445AspArg: 5.445 ± 2.638
4.537AspSer: 4.537 ± 1.597
0.0AspThr: 0.0 ± 0.0
5.445AspVal: 5.445 ± 2.13
2.722AspTrp: 2.722 ± 1.919
3.63AspTyr: 3.63 ± 1.292
0.0AspXaa: 0.0 ± 0.0
Glu
4.537GluAla: 4.537 ± 1.889
0.907GluCys: 0.907 ± 0.769
1.815GluAsp: 1.815 ± 0.792
2.722GluGlu: 2.722 ± 1.608
0.907GluPhe: 0.907 ± 0.769
4.537GluGly: 4.537 ± 0.929
2.722GluHis: 2.722 ± 2.306
2.722GluIle: 2.722 ± 1.268
4.537GluLys: 4.537 ± 2.475
6.352GluLeu: 6.352 ± 2.059
0.907GluMet: 0.907 ± 0.879
0.0GluAsn: 0.0 ± 0.0
3.63GluPro: 3.63 ± 1.581
0.907GluGln: 0.907 ± 0.879
4.537GluArg: 4.537 ± 1.928
3.63GluSer: 3.63 ± 1.456
2.722GluThr: 2.722 ± 1.405
3.63GluVal: 3.63 ± 1.233
1.815GluTrp: 1.815 ± 1.036
2.722GluTyr: 2.722 ± 1.524
0.0GluXaa: 0.0 ± 0.0
Phe
1.815PheAla: 1.815 ± 0.824
0.907PheCys: 0.907 ± 0.687
0.907PheAsp: 0.907 ± 0.769
0.907PheGlu: 0.907 ± 0.769
0.907PhePhe: 0.907 ± 0.769
0.907PheGly: 0.907 ± 0.879
1.815PheHis: 1.815 ± 1.074
2.722PheIle: 2.722 ± 1.408
1.815PheLys: 1.815 ± 0.792
3.63PheLeu: 3.63 ± 3.074
0.907PheMet: 0.907 ± 0.687
2.722PheAsn: 2.722 ± 0.585
2.722PhePro: 2.722 ± 1.309
0.0PheGln: 0.0 ± 0.0
2.722PheArg: 2.722 ± 0.585
2.722PheSer: 2.722 ± 0.585
2.722PheThr: 2.722 ± 1.628
2.722PheVal: 2.722 ± 1.408
0.907PheTrp: 0.907 ± 0.769
1.815PheTyr: 1.815 ± 1.187
0.0PheXaa: 0.0 ± 0.0
Gly
2.722GlyAla: 2.722 ± 1.315
1.815GlyCys: 1.815 ± 1.065
6.352GlyAsp: 6.352 ± 2.133
2.722GlyGlu: 2.722 ± 1.524
0.907GlyPhe: 0.907 ± 0.687
4.537GlyGly: 4.537 ± 1.518
0.907GlyHis: 0.907 ± 0.879
3.63GlyIle: 3.63 ± 1.292
3.63GlyLys: 3.63 ± 1.804
2.722GlyLeu: 2.722 ± 1.147
1.815GlyMet: 1.815 ± 0.792
3.63GlyAsn: 3.63 ± 1.696
7.26GlyPro: 7.26 ± 0.999
3.63GlyGln: 3.63 ± 1.585
3.63GlyArg: 3.63 ± 1.421
6.352GlySer: 6.352 ± 2.3
1.815GlyThr: 1.815 ± 1.541
1.815GlyVal: 1.815 ± 1.065
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.815HisAla: 1.815 ± 1.074
0.907HisCys: 0.907 ± 0.687
2.722HisAsp: 2.722 ± 1.032
2.722HisGlu: 2.722 ± 1.11
1.815HisPhe: 1.815 ± 0.824
0.907HisGly: 0.907 ± 0.771
0.907HisHis: 0.907 ± 0.769
0.0HisIle: 0.0 ± 0.0
1.815HisLys: 1.815 ± 1.036
1.815HisLeu: 1.815 ± 1.537
0.0HisMet: 0.0 ± 0.0
0.907HisAsn: 0.907 ± 0.915
0.907HisPro: 0.907 ± 0.769
0.0HisGln: 0.0 ± 0.0
0.907HisArg: 0.907 ± 0.769
0.907HisSer: 0.907 ± 0.687
2.722HisThr: 2.722 ± 2.312
1.815HisVal: 1.815 ± 0.824
0.0HisTrp: 0.0 ± 0.0
1.815HisTyr: 1.815 ± 1.179
0.0HisXaa: 0.0 ± 0.0
Ile
1.815IleAla: 1.815 ± 1.541
2.722IleCys: 2.722 ± 1.919
3.63IleAsp: 3.63 ± 1.292
2.722IleGlu: 2.722 ± 1.309
3.63IlePhe: 3.63 ± 1.665
2.722IleGly: 2.722 ± 2.062
0.0IleHis: 0.0 ± 0.0
2.722IleIle: 2.722 ± 1.032
5.445IleLys: 5.445 ± 3.526
3.63IleLeu: 3.63 ± 1.367
0.0IleMet: 0.0 ± 0.0
2.722IleAsn: 2.722 ± 0.585
2.722IlePro: 2.722 ± 0.93
2.722IleGln: 2.722 ± 1.11
6.352IleArg: 6.352 ± 2.267
3.63IleSer: 3.63 ± 0.711
2.722IleThr: 2.722 ± 1.11
0.0IleVal: 0.0 ± 0.0
2.722IleTrp: 2.722 ± 1.639
1.815IleTyr: 1.815 ± 1.541
0.0IleXaa: 0.0 ± 0.0
Lys
3.63LysAla: 3.63 ± 0.968
1.815LysCys: 1.815 ± 1.043
5.445LysAsp: 5.445 ± 1.209
5.445LysGlu: 5.445 ± 2.294
1.815LysPhe: 1.815 ± 0.966
6.352LysGly: 6.352 ± 1.895
0.0LysHis: 0.0 ± 0.0
6.352LysIle: 6.352 ± 2.806
4.537LysLys: 4.537 ± 0.929
3.63LysLeu: 3.63 ± 1.921
0.0LysMet: 0.0 ± 0.0
1.815LysAsn: 1.815 ± 1.074
4.537LysPro: 4.537 ± 1.417
0.907LysGln: 0.907 ± 0.769
1.815LysArg: 1.815 ± 0.832
9.074LysSer: 9.074 ± 3.317
2.722LysThr: 2.722 ± 1.779
1.815LysVal: 1.815 ± 1.065
0.0LysTrp: 0.0 ± 0.0
1.815LysTyr: 1.815 ± 1.074
0.0LysXaa: 0.0 ± 0.0
Leu
4.537LeuAla: 4.537 ± 1.574
2.722LeuCys: 2.722 ± 1.438
2.722LeuAsp: 2.722 ± 2.312
1.815LeuGlu: 1.815 ± 0.824
1.815LeuPhe: 1.815 ± 1.065
4.537LeuGly: 4.537 ± 1.284
4.537LeuHis: 4.537 ± 1.735
1.815LeuIle: 1.815 ± 1.758
4.537LeuLys: 4.537 ± 1.918
1.815LeuLeu: 1.815 ± 1.537
4.537LeuMet: 4.537 ± 1.693
5.445LeuAsn: 5.445 ± 3.238
2.722LeuPro: 2.722 ± 0.98
3.63LeuGln: 3.63 ± 2.153
4.537LeuArg: 4.537 ± 2.896
6.352LeuSer: 6.352 ± 1.879
3.63LeuThr: 3.63 ± 1.406
4.537LeuVal: 4.537 ± 0.85
0.907LeuTrp: 0.907 ± 0.769
2.722LeuTyr: 2.722 ± 1.974
0.0LeuXaa: 0.0 ± 0.0
Met
0.907MetAla: 0.907 ± 0.769
1.815MetCys: 1.815 ± 1.043
4.537MetAsp: 4.537 ± 1.845
1.815MetGlu: 1.815 ± 1.036
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.815MetHis: 1.815 ± 0.966
2.722MetIle: 2.722 ± 1.408
3.63MetLys: 3.63 ± 1.233
0.907MetLeu: 0.907 ± 0.879
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.907MetGln: 0.907 ± 0.687
0.0MetArg: 0.0 ± 0.0
2.722MetSer: 2.722 ± 1.315
0.907MetThr: 0.907 ± 0.771
1.815MetVal: 1.815 ± 1.065
0.907MetTrp: 0.907 ± 0.771
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.537AsnAla: 4.537 ± 1.575
0.0AsnCys: 0.0 ± 0.0
2.722AsnAsp: 2.722 ± 1.147
0.907AsnGlu: 0.907 ± 0.687
0.0AsnPhe: 0.0 ± 0.0
1.815AsnGly: 1.815 ± 1.065
0.0AsnHis: 0.0 ± 0.0
4.537AsnIle: 4.537 ± 2.043
0.907AsnLys: 0.907 ± 0.687
5.445AsnLeu: 5.445 ± 1.773
0.907AsnMet: 0.907 ± 0.634
0.907AsnAsn: 0.907 ± 0.771
3.63AsnPro: 3.63 ± 1.628
2.722AsnGln: 2.722 ± 0.93
0.0AsnArg: 0.0 ± 0.373
6.352AsnSer: 6.352 ± 2.897
0.0AsnThr: 0.0 ± 0.0
2.722AsnVal: 2.722 ± 1.752
0.0AsnTrp: 0.0 ± 0.0
3.63AsnTyr: 3.63 ± 1.406
0.0AsnXaa: 0.0 ± 0.0
Pro
4.537ProAla: 4.537 ± 2.101
3.63ProCys: 3.63 ± 1.402
4.537ProAsp: 4.537 ± 3.517
2.722ProGlu: 2.722 ± 1.608
1.815ProPhe: 1.815 ± 1.537
4.537ProGly: 4.537 ± 1.798
0.907ProHis: 0.907 ± 0.769
2.722ProIle: 2.722 ± 1.309
5.445ProLys: 5.445 ± 0.979
1.815ProLeu: 1.815 ± 0.792
0.907ProMet: 0.907 ± 0.771
1.815ProAsn: 1.815 ± 1.537
1.815ProPro: 1.815 ± 0.832
1.815ProGln: 1.815 ± 0.824
5.445ProArg: 5.445 ± 2.542
6.352ProSer: 6.352 ± 2.256
2.722ProThr: 2.722 ± 1.841
2.722ProVal: 2.722 ± 0.999
0.907ProTrp: 0.907 ± 0.687
0.907ProTyr: 0.907 ± 0.687
0.0ProXaa: 0.0 ± 0.0
Gln
5.445GlnAla: 5.445 ± 1.773
1.815GlnCys: 1.815 ± 0.824
3.63GlnAsp: 3.63 ± 2.073
3.63GlnGlu: 3.63 ± 1.803
0.907GlnPhe: 0.907 ± 0.687
0.0GlnGly: 0.0 ± 0.0
0.907GlnHis: 0.907 ± 0.769
2.722GlnIle: 2.722 ± 1.268
1.815GlnLys: 1.815 ± 1.074
0.907GlnLeu: 0.907 ± 0.687
0.0GlnMet: 0.0 ± 0.0
3.63GlnAsn: 3.63 ± 1.495
0.0GlnPro: 0.0 ± 0.0
1.815GlnGln: 1.815 ± 0.824
1.815GlnArg: 1.815 ± 1.065
0.0GlnSer: 0.0 ± 0.0
1.815GlnThr: 1.815 ± 0.832
2.722GlnVal: 2.722 ± 1.524
0.907GlnTrp: 0.907 ± 0.769
2.722GlnTyr: 2.722 ± 1.11
0.0GlnXaa: 0.0 ± 0.0
Arg
1.815ArgAla: 1.815 ± 1.074
2.722ArgCys: 2.722 ± 1.794
4.537ArgAsp: 4.537 ± 1.03
4.537ArgGlu: 4.537 ± 2.038
4.537ArgPhe: 4.537 ± 2.842
5.445ArgGly: 5.445 ± 2.377
0.0ArgHis: 0.0 ± 0.0
1.815ArgIle: 1.815 ± 1.036
4.537ArgLys: 4.537 ± 2.065
5.445ArgLeu: 5.445 ± 1.049
1.815ArgMet: 1.815 ± 0.792
0.907ArgAsn: 0.907 ± 0.769
6.352ArgPro: 6.352 ± 2.267
0.907ArgGln: 0.907 ± 0.771
9.074ArgArg: 9.074 ± 2.887
7.26ArgSer: 7.26 ± 1.822
1.815ArgThr: 1.815 ± 1.179
7.26ArgVal: 7.26 ± 1.705
1.815ArgTrp: 1.815 ± 0.832
2.722ArgTyr: 2.722 ± 1.794
0.0ArgXaa: 0.0 ± 0.0
Ser
5.445SerAla: 5.445 ± 1.773
1.815SerCys: 1.815 ± 0.824
4.537SerAsp: 4.537 ± 1.798
2.722SerGlu: 2.722 ± 1.268
2.722SerPhe: 2.722 ± 2.306
6.352SerGly: 6.352 ± 0.766
2.722SerHis: 2.722 ± 1.032
1.815SerIle: 1.815 ± 0.966
4.537SerLys: 4.537 ± 0.85
7.26SerLeu: 7.26 ± 2.588
2.722SerMet: 2.722 ± 1.524
5.445SerAsn: 5.445 ± 2.377
4.537SerPro: 4.537 ± 2.066
0.907SerGln: 0.907 ± 0.915
5.445SerArg: 5.445 ± 1.209
7.26SerSer: 7.26 ± 2.004
6.352SerThr: 6.352 ± 1.445
9.074SerVal: 9.074 ± 2.72
1.815SerTrp: 1.815 ± 1.375
0.907SerTyr: 0.907 ± 0.687
0.0SerXaa: 0.0 ± 0.0
Thr
1.815ThrAla: 1.815 ± 1.187
0.0ThrCys: 0.0 ± 0.0
1.815ThrAsp: 1.815 ± 1.074
2.722ThrGlu: 2.722 ± 1.405
0.907ThrPhe: 0.907 ± 0.687
1.815ThrGly: 1.815 ± 1.074
0.907ThrHis: 0.907 ± 0.771
5.445ThrIle: 5.445 ± 2.07
2.722ThrLys: 2.722 ± 1.11
1.815ThrLeu: 1.815 ± 0.832
0.907ThrMet: 0.907 ± 0.771
0.0ThrAsn: 0.0 ± 0.0
0.907ThrPro: 0.907 ± 0.769
0.907ThrGln: 0.907 ± 0.769
2.722ThrArg: 2.722 ± 1.268
4.537ThrSer: 4.537 ± 1.265
1.815ThrThr: 1.815 ± 0.792
5.445ThrVal: 5.445 ± 2.93
0.0ThrTrp: 0.0 ± 0.0
2.722ThrTyr: 2.722 ± 1.628
0.0ThrXaa: 0.0 ± 0.0
Val
3.63ValAla: 3.63 ± 1.453
0.907ValCys: 0.907 ± 0.769
4.537ValAsp: 4.537 ± 2.867
5.445ValGlu: 5.445 ± 1.959
1.815ValPhe: 1.815 ± 1.187
5.445ValGly: 5.445 ± 2.691
0.907ValHis: 0.907 ± 0.687
4.537ValIle: 4.537 ± 1.793
4.537ValLys: 4.537 ± 1.162
9.074ValLeu: 9.074 ± 3.122
1.815ValMet: 1.815 ± 0.985
4.537ValAsn: 4.537 ± 1.763
3.63ValPro: 3.63 ± 1.585
1.815ValGln: 1.815 ± 1.065
4.537ValArg: 4.537 ± 2.232
2.722ValSer: 2.722 ± 1.408
0.907ValThr: 0.907 ± 0.771
2.722ValVal: 2.722 ± 2.062
0.0ValTrp: 0.0 ± 0.0
0.907ValTyr: 0.907 ± 0.687
0.0ValXaa: 0.0 ± 0.0
Trp
3.63TrpAla: 3.63 ± 0.711
1.815TrpCys: 1.815 ± 0.824
0.907TrpAsp: 0.907 ± 0.879
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.907TrpGly: 0.907 ± 0.879
0.907TrpHis: 0.907 ± 0.687
0.907TrpIle: 0.907 ± 0.771
0.0TrpLys: 0.0 ± 0.0
1.815TrpLeu: 1.815 ± 1.187
0.0TrpMet: 0.0 ± 0.0
0.907TrpAsn: 0.907 ± 0.915
1.815TrpPro: 1.815 ± 0.832
0.907TrpGln: 0.907 ± 0.687
1.815TrpArg: 1.815 ± 1.043
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.907TrpVal: 0.907 ± 0.879
0.907TrpTrp: 0.907 ± 0.915
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.815TyrAla: 1.815 ± 1.065
1.815TyrCys: 1.815 ± 1.043
1.815TyrAsp: 1.815 ± 0.792
0.907TyrGlu: 0.907 ± 0.769
2.722TyrPhe: 2.722 ± 1.032
0.907TyrGly: 0.907 ± 0.771
1.815TyrHis: 1.815 ± 0.792
0.907TyrIle: 0.907 ± 0.915
0.907TyrLys: 0.907 ± 0.687
1.815TyrLeu: 1.815 ± 1.043
3.63TyrMet: 3.63 ± 2.237
1.815TyrAsn: 1.815 ± 1.831
1.815TyrPro: 1.815 ± 0.824
0.907TyrGln: 0.907 ± 0.879
3.63TyrArg: 3.63 ± 1.47
1.815TyrSer: 1.815 ± 0.824
1.815TyrThr: 1.815 ± 0.966
0.907TyrVal: 0.907 ± 0.769
0.0TyrTrp: 0.0 ± 0.0
2.722TyrTyr: 2.722 ± 1.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski