Amino acid dipepetide frequency for Tomato apical leaf curl virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.826AlaAla: 0.826 ± 0.69
0.0AlaCys: 0.0 ± 0.0
7.438AlaAsp: 7.438 ± 1.245
3.306AlaGlu: 3.306 ± 0.882
2.479AlaPhe: 2.479 ± 1.48
1.653AlaGly: 1.653 ± 0.962
0.826AlaHis: 0.826 ± 0.792
4.132AlaIle: 4.132 ± 1.593
3.306AlaLys: 3.306 ± 0.77
3.306AlaLeu: 3.306 ± 2.758
0.0AlaMet: 0.0 ± 0.0
4.132AlaAsn: 4.132 ± 1.313
0.0AlaPro: 0.0 ± 0.0
4.132AlaGln: 4.132 ± 1.313
3.306AlaArg: 3.306 ± 1.031
2.479AlaSer: 2.479 ± 0.814
2.479AlaThr: 2.479 ± 0.71
2.479AlaVal: 2.479 ± 1.502
1.653AlaTrp: 1.653 ± 1.164
0.826AlaTyr: 0.826 ± 0.792
0.0AlaXaa: 0.0 ± 0.0
Cys
0.826CysAla: 0.826 ± 0.781
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.653CysGlu: 1.653 ± 1.103
0.0CysPhe: 0.0 ± 0.0
0.826CysGly: 0.826 ± 0.616
0.826CysHis: 0.826 ± 0.69
0.826CysIle: 0.826 ± 0.792
0.826CysLys: 0.826 ± 0.616
1.653CysLeu: 1.653 ± 0.772
0.0CysMet: 0.0 ± 0.0
0.826CysAsn: 0.826 ± 0.616
1.653CysPro: 1.653 ± 0.772
4.132CysGln: 4.132 ± 1.908
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
2.479CysThr: 2.479 ± 1.329
1.653CysVal: 1.653 ± 0.772
0.0CysTrp: 0.0 ± 0.0
0.826CysTyr: 0.826 ± 0.69
0.0CysXaa: 0.0 ± 0.0
Asp
2.479AspAla: 2.479 ± 0.814
0.0AspCys: 0.0 ± 0.0
1.653AspAsp: 1.653 ± 0.773
0.826AspGlu: 0.826 ± 0.781
5.785AspPhe: 5.785 ± 2.859
4.132AspGly: 4.132 ± 0.961
0.0AspHis: 0.0 ± 0.0
4.959AspIle: 4.959 ± 1.793
0.826AspLys: 0.826 ± 0.69
3.306AspLeu: 3.306 ± 0.882
0.826AspMet: 0.826 ± 0.69
0.826AspAsn: 0.826 ± 0.734
2.479AspPro: 2.479 ± 1.189
0.826AspGln: 0.826 ± 0.69
2.479AspArg: 2.479 ± 1.351
4.959AspSer: 4.959 ± 1.06
1.653AspThr: 1.653 ± 0.772
3.306AspVal: 3.306 ± 1.314
1.653AspTrp: 1.653 ± 1.198
4.959AspTyr: 4.959 ± 1.045
0.0AspXaa: 0.0 ± 0.0
Glu
0.826GluAla: 0.826 ± 0.616
0.0GluCys: 0.0 ± 0.0
3.306GluAsp: 3.306 ± 1.131
7.438GluGlu: 7.438 ± 2.132
3.306GluPhe: 3.306 ± 1.504
3.306GluGly: 3.306 ± 1.229
0.0GluHis: 0.0 ± 0.0
2.479GluIle: 2.479 ± 0.905
4.132GluLys: 4.132 ± 1.346
4.959GluLeu: 4.959 ± 2.238
0.826GluMet: 0.826 ± 0.656
2.479GluAsn: 2.479 ± 0.814
1.653GluPro: 1.653 ± 0.772
4.132GluGln: 4.132 ± 2.106
3.306GluArg: 3.306 ± 1.407
4.959GluSer: 4.959 ± 2.048
4.132GluThr: 4.132 ± 1.63
0.0GluVal: 0.0 ± 0.0
0.826GluTrp: 0.826 ± 0.781
5.785GluTyr: 5.785 ± 1.25
0.0GluXaa: 0.0 ± 0.0
Phe
1.653PheAla: 1.653 ± 1.379
0.826PheCys: 0.826 ± 0.616
2.479PheAsp: 2.479 ± 1.189
1.653PheGlu: 1.653 ± 1.562
2.479PhePhe: 2.479 ± 0.92
0.826PheGly: 0.826 ± 0.616
0.0PheHis: 0.0 ± 0.0
3.306PheIle: 3.306 ± 0.882
2.479PheLys: 2.479 ± 0.71
4.959PheLeu: 4.959 ± 1.839
0.826PheMet: 0.826 ± 0.545
4.132PheAsn: 4.132 ± 2.148
0.0PhePro: 0.0 ± 0.0
5.785PheGln: 5.785 ± 1.28
2.479PheArg: 2.479 ± 0.92
4.959PheSer: 4.959 ± 1.419
5.785PheThr: 5.785 ± 1.804
2.479PheVal: 2.479 ± 1.074
0.826PheTrp: 0.826 ± 0.781
0.826PheTyr: 0.826 ± 0.734
0.0PheXaa: 0.0 ± 0.0
Gly
4.959GlyAla: 4.959 ± 3.313
0.826GlyCys: 0.826 ± 0.781
0.0GlyAsp: 0.0 ± 0.0
4.132GlyGlu: 4.132 ± 1.387
0.826GlyPhe: 0.826 ± 0.69
6.612GlyGly: 6.612 ± 2.032
0.0GlyHis: 0.0 ± 0.0
1.653GlyIle: 1.653 ± 0.821
4.959GlyLys: 4.959 ± 1.649
4.132GlyLeu: 4.132 ± 1.289
0.826GlyMet: 0.826 ± 0.74
0.826GlyAsn: 0.826 ± 0.69
3.306GlyPro: 3.306 ± 1.167
0.826GlyGln: 0.826 ± 0.792
3.306GlyArg: 3.306 ± 3.471
5.785GlySer: 5.785 ± 1.924
1.653GlyThr: 1.653 ± 0.982
9.917GlyVal: 9.917 ± 2.074
0.0GlyTrp: 0.0 ± 0.0
0.826GlyTyr: 0.826 ± 0.69
0.0GlyXaa: 0.0 ± 0.0
His
0.826HisAla: 0.826 ± 0.616
1.653HisCys: 1.653 ± 0.772
0.826HisAsp: 0.826 ± 0.69
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.826HisHis: 0.826 ± 0.781
0.826HisIle: 0.826 ± 0.734
1.653HisLys: 1.653 ± 1.562
3.306HisLeu: 3.306 ± 0.882
0.826HisMet: 0.826 ± 0.792
1.653HisAsn: 1.653 ± 0.821
2.479HisPro: 2.479 ± 0.814
1.653HisGln: 1.653 ± 0.772
0.826HisArg: 0.826 ± 0.734
0.826HisSer: 0.826 ± 0.69
1.653HisThr: 1.653 ± 1.583
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.306IleAla: 3.306 ± 1.167
1.653IleCys: 1.653 ± 0.772
2.479IleAsp: 2.479 ± 1.189
4.959IleGlu: 4.959 ± 1.772
4.132IlePhe: 4.132 ± 1.313
1.653IleGly: 1.653 ± 1.389
0.826IleHis: 0.826 ± 0.792
3.306IleIle: 3.306 ± 0.882
5.785IleLys: 5.785 ± 0.896
4.959IleLeu: 4.959 ± 2.272
0.0IleMet: 0.0 ± 0.0
0.826IleAsn: 0.826 ± 0.69
7.438IlePro: 7.438 ± 2.23
1.653IleGln: 1.653 ± 1.389
1.653IleArg: 1.653 ± 0.839
7.438IleSer: 7.438 ± 3.98
4.959IleThr: 4.959 ± 1.957
2.479IleVal: 2.479 ± 1.216
1.653IleTrp: 1.653 ± 0.772
3.306IleTyr: 3.306 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
2.479LysAla: 2.479 ± 1.611
3.306LysCys: 3.306 ± 1.031
0.826LysAsp: 0.826 ± 0.616
1.653LysGlu: 1.653 ± 1.232
4.959LysPhe: 4.959 ± 1.419
4.132LysGly: 4.132 ± 1.289
0.0LysHis: 0.0 ± 0.0
4.959LysIle: 4.959 ± 1.326
9.917LysLys: 9.917 ± 1.928
4.959LysLeu: 4.959 ± 2.235
2.479LysMet: 2.479 ± 1.395
2.479LysAsn: 2.479 ± 0.71
3.306LysPro: 3.306 ± 1.198
0.0LysGln: 0.0 ± 0.0
1.653LysArg: 1.653 ± 0.962
7.438LysSer: 7.438 ± 1.969
2.479LysThr: 2.479 ± 0.905
1.653LysVal: 1.653 ± 0.962
1.653LysTrp: 1.653 ± 0.772
1.653LysTyr: 1.653 ± 0.873
0.0LysXaa: 0.0 ± 0.0
Leu
0.826LeuAla: 0.826 ± 1.114
0.826LeuCys: 0.826 ± 0.616
2.479LeuAsp: 2.479 ± 1.582
5.785LeuGlu: 5.785 ± 1.613
5.785LeuPhe: 5.785 ± 1.73
4.959LeuGly: 4.959 ± 1.911
5.785LeuHis: 5.785 ± 1.799
6.612LeuIle: 6.612 ± 1.848
2.479LeuLys: 2.479 ± 1.646
5.785LeuLeu: 5.785 ± 1.623
0.0LeuMet: 0.0 ± 0.0
4.959LeuAsn: 4.959 ± 2.435
1.653LeuPro: 1.653 ± 0.873
5.785LeuGln: 5.785 ± 2.114
9.091LeuArg: 9.091 ± 2.732
2.479LeuSer: 2.479 ± 0.71
4.959LeuThr: 4.959 ± 1.627
4.132LeuVal: 4.132 ± 1.379
0.0LeuTrp: 0.0 ± 0.0
4.132LeuTyr: 4.132 ± 1.131
0.0LeuXaa: 0.0 ± 0.0
Met
3.306MetAla: 3.306 ± 1.314
0.0MetCys: 0.0 ± 0.0
0.826MetAsp: 0.826 ± 1.114
0.826MetGlu: 0.826 ± 0.781
0.0MetPhe: 0.0 ± 0.0
1.653MetGly: 1.653 ± 0.982
0.826MetHis: 0.826 ± 0.69
0.826MetIle: 0.826 ± 0.792
0.826MetLys: 0.826 ± 0.69
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.653MetPro: 1.653 ± 0.772
0.826MetGln: 0.826 ± 0.792
0.826MetArg: 0.826 ± 0.792
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.959AsnAla: 4.959 ± 1.494
2.479AsnCys: 2.479 ± 1.502
3.306AsnAsp: 3.306 ± 2.065
2.479AsnGlu: 2.479 ± 1.491
3.306AsnPhe: 3.306 ± 1.22
4.132AsnGly: 4.132 ± 1.164
0.0AsnHis: 0.0 ± 0.0
2.479AsnIle: 2.479 ± 1.375
4.132AsnLys: 4.132 ± 0.961
4.959AsnLeu: 4.959 ± 1.289
0.0AsnMet: 0.0 ± 0.0
1.653AsnAsn: 1.653 ± 0.982
5.785AsnPro: 5.785 ± 1.325
2.479AsnGln: 2.479 ± 1.502
4.132AsnArg: 4.132 ± 0.977
1.653AsnSer: 1.653 ± 0.982
3.306AsnThr: 3.306 ± 0.657
2.479AsnVal: 2.479 ± 1.216
3.306AsnTrp: 3.306 ± 1.071
0.826AsnTyr: 0.826 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
0.826ProAla: 0.826 ± 0.69
0.826ProCys: 0.826 ± 0.616
3.306ProAsp: 3.306 ± 1.031
3.306ProGlu: 3.306 ± 1.031
1.653ProPhe: 1.653 ± 0.873
1.653ProGly: 1.653 ± 0.839
1.653ProHis: 1.653 ± 0.772
1.653ProIle: 1.653 ± 0.772
8.264ProLys: 8.264 ± 2.598
5.785ProLeu: 5.785 ± 1.799
0.0ProMet: 0.0 ± 0.998
7.438ProAsn: 7.438 ± 1.381
0.0ProPro: 0.0 ± 0.0
0.826ProGln: 0.826 ± 0.734
8.264ProArg: 8.264 ± 1.565
4.132ProSer: 4.132 ± 1.224
2.479ProThr: 2.479 ± 1.308
0.826ProVal: 0.826 ± 0.781
0.0ProTrp: 0.0 ± 0.0
2.479ProTyr: 2.479 ± 0.71
0.0ProXaa: 0.0 ± 0.0
Gln
0.826GlnAla: 0.826 ± 0.69
0.0GlnCys: 0.0 ± 0.0
3.306GlnAsp: 3.306 ± 1.407
4.959GlnGlu: 4.959 ± 1.753
0.826GlnPhe: 0.826 ± 0.781
4.132GlnGly: 4.132 ± 0.876
0.826GlnHis: 0.826 ± 0.792
3.306GlnIle: 3.306 ± 1.22
1.653GlnLys: 1.653 ± 0.873
8.264GlnLeu: 8.264 ± 2.669
0.0GlnMet: 0.0 ± 0.0
3.306GlnAsn: 3.306 ± 1.314
2.479GlnPro: 2.479 ± 1.074
0.0GlnGln: 0.0 ± 0.0
3.306GlnArg: 3.306 ± 0.882
1.653GlnSer: 1.653 ± 0.773
3.306GlnThr: 3.306 ± 2.488
1.653GlnVal: 1.653 ± 1.389
0.0GlnTrp: 0.0 ± 0.0
1.653GlnTyr: 1.653 ± 0.821
0.0GlnXaa: 0.0 ± 0.0
Arg
4.132ArgAla: 4.132 ± 0.708
1.653ArgCys: 1.653 ± 0.772
6.612ArgAsp: 6.612 ± 1.697
4.132ArgGlu: 4.132 ± 1.375
3.306ArgPhe: 3.306 ± 0.657
3.306ArgGly: 3.306 ± 0.882
0.826ArgHis: 0.826 ± 0.781
2.479ArgIle: 2.479 ± 2.391
3.306ArgLys: 3.306 ± 1.504
4.132ArgLeu: 4.132 ± 1.136
0.0ArgMet: 0.0 ± 0.0
6.612ArgAsn: 6.612 ± 1.894
2.479ArgPro: 2.479 ± 1.074
0.0ArgGln: 0.0 ± 0.0
8.264ArgArg: 8.264 ± 1.531
6.612ArgSer: 6.612 ± 1.494
4.959ArgThr: 4.959 ± 0.854
5.785ArgVal: 5.785 ± 4.063
0.0ArgTrp: 0.0 ± 0.0
0.826ArgTyr: 0.826 ± 0.69
0.0ArgXaa: 0.0 ± 0.0
Ser
4.132SerAla: 4.132 ± 0.876
1.653SerCys: 1.653 ± 0.982
3.306SerAsp: 3.306 ± 1.031
2.479SerGlu: 2.479 ± 1.963
0.826SerPhe: 0.826 ± 0.616
4.959SerGly: 4.959 ± 1.962
1.653SerHis: 1.653 ± 1.392
6.612SerIle: 6.612 ± 2.494
3.306SerLys: 3.306 ± 1.924
1.653SerLeu: 1.653 ± 0.821
2.479SerMet: 2.479 ± 1.12
3.306SerAsn: 3.306 ± 1.472
4.132SerPro: 4.132 ± 0.876
1.653SerGln: 1.653 ± 1.389
6.612SerArg: 6.612 ± 2.381
9.091SerSer: 9.091 ± 4.236
9.091SerThr: 9.091 ± 1.092
3.306SerVal: 3.306 ± 1.947
1.653SerTrp: 1.653 ± 1.379
4.132SerTyr: 4.132 ± 1.946
0.0SerXaa: 0.0 ± 0.0
Thr
3.306ThrAla: 3.306 ± 1.337
0.826ThrCys: 0.826 ± 0.781
2.479ThrAsp: 2.479 ± 2.069
0.826ThrGlu: 0.826 ± 0.616
4.132ThrPhe: 4.132 ± 0.876
3.306ThrGly: 3.306 ± 1.476
1.653ThrHis: 1.653 ± 0.772
5.785ThrIle: 5.785 ± 0.954
0.826ThrLys: 0.826 ± 0.792
3.306ThrLeu: 3.306 ± 2.242
0.0ThrMet: 0.0 ± 0.0
2.479ThrAsn: 2.479 ± 1.375
5.785ThrPro: 5.785 ± 1.309
4.959ThrGln: 4.959 ± 1.753
4.132ThrArg: 4.132 ± 1.289
5.785ThrSer: 5.785 ± 3.044
2.479ThrThr: 2.479 ± 1.502
4.132ThrVal: 4.132 ± 3.376
2.479ThrTrp: 2.479 ± 1.181
6.612ThrTyr: 6.612 ± 3.089
0.0ThrXaa: 0.0 ± 0.0
Val
3.306ValAla: 3.306 ± 0.882
1.653ValCys: 1.653 ± 0.772
1.653ValAsp: 1.653 ± 0.773
4.132ValGlu: 4.132 ± 0.961
3.306ValPhe: 3.306 ± 3.28
0.826ValGly: 0.826 ± 1.114
0.826ValHis: 0.826 ± 0.734
4.132ValIle: 4.132 ± 1.164
0.826ValLys: 0.826 ± 0.792
3.306ValLeu: 3.306 ± 1.501
0.0ValMet: 0.0 ± 0.0
4.132ValAsn: 4.132 ± 1.131
3.306ValPro: 3.306 ± 1.424
3.306ValGln: 3.306 ± 1.131
4.132ValArg: 4.132 ± 2.146
2.479ValSer: 2.479 ± 2.209
2.479ValThr: 2.479 ± 1.404
0.826ValVal: 0.826 ± 0.734
2.479ValTrp: 2.479 ± 0.905
4.132ValTyr: 4.132 ± 0.977
0.0ValXaa: 0.0 ± 0.0
Trp
0.826TrpAla: 0.826 ± 0.616
0.826TrpCys: 0.826 ± 0.616
0.0TrpAsp: 0.0 ± 0.0
0.826TrpGlu: 0.826 ± 0.781
0.0TrpPhe: 0.0 ± 0.0
2.479TrpGly: 2.479 ± 0.71
0.0TrpHis: 0.0 ± 0.0
0.826TrpIle: 0.826 ± 0.781
0.0TrpLys: 0.0 ± 0.0
0.826TrpLeu: 0.826 ± 0.69
0.0TrpMet: 0.0 ± 0.0
0.826TrpAsn: 0.826 ± 0.69
2.479TrpPro: 2.479 ± 0.814
1.653TrpGln: 1.653 ± 1.562
0.826TrpArg: 0.826 ± 0.792
0.0TrpSer: 0.0 ± 0.0
2.479TrpThr: 2.479 ± 1.137
2.479TrpVal: 2.479 ± 1.074
0.0TrpTrp: 0.0 ± 0.0
0.826TrpTyr: 0.826 ± 0.69
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.132TyrAla: 4.132 ± 1.313
0.0TyrCys: 0.0 ± 0.0
1.653TyrAsp: 1.653 ± 0.839
2.479TyrGlu: 2.479 ± 1.189
1.653TyrPhe: 1.653 ± 0.873
0.826TyrGly: 0.826 ± 0.792
2.479TyrHis: 2.479 ± 0.71
3.306TyrIle: 3.306 ± 1.167
2.479TyrLys: 2.479 ± 1.329
4.959TyrLeu: 4.959 ± 0.854
2.479TyrMet: 2.479 ± 0.814
4.959TyrAsn: 4.959 ± 1.416
4.132TyrPro: 4.132 ± 0.708
0.826TyrGln: 0.826 ± 0.781
0.826TyrArg: 0.826 ± 0.616
3.306TyrSer: 3.306 ± 1.745
2.479TyrThr: 2.479 ± 0.71
1.653TyrVal: 1.653 ± 0.773
0.0TyrTrp: 0.0 ± 0.0
1.653TyrTyr: 1.653 ± 1.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski