Amino acid dipepetide frequency for Tomato yellow leaf curl Shuangbai virus - [Y4536]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.529AlaAla: 4.529 ± 1.79
1.812AlaCys: 1.812 ± 1.109
0.906AlaAsp: 0.906 ± 0.737
3.623AlaGlu: 3.623 ± 1.82
0.906AlaPhe: 0.906 ± 0.795
0.906AlaGly: 0.906 ± 0.737
1.812AlaHis: 1.812 ± 1.12
0.906AlaIle: 0.906 ± 0.646
3.623AlaLys: 3.623 ± 1.756
7.246AlaLeu: 7.246 ± 1.556
0.0AlaMet: 0.0 ± 0.0
2.717AlaAsn: 2.717 ± 0.754
5.435AlaPro: 5.435 ± 1.315
1.812AlaGln: 1.812 ± 1.12
6.341AlaArg: 6.341 ± 2.096
3.623AlaSer: 3.623 ± 1.985
5.435AlaThr: 5.435 ± 1.727
0.906AlaVal: 0.906 ± 0.97
1.812AlaTrp: 1.812 ± 0.711
0.906AlaTyr: 0.906 ± 0.646
0.0AlaXaa: 0.0 ± 0.0
Cys
0.906CysAla: 0.906 ± 0.795
1.812CysCys: 1.812 ± 1.941
0.0CysAsp: 0.0 ± 0.0
2.717CysGlu: 2.717 ± 1.222
0.906CysPhe: 0.906 ± 0.88
1.812CysGly: 1.812 ± 0.783
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.906CysLys: 0.906 ± 0.737
0.906CysLeu: 0.906 ± 0.935
1.812CysMet: 1.812 ± 1.26
2.717CysAsn: 2.717 ± 1.016
3.623CysPro: 3.623 ± 2.929
0.0CysGln: 0.0 ± 0.0
0.906CysArg: 0.906 ± 0.646
1.812CysSer: 1.812 ± 0.783
2.717CysThr: 2.717 ± 0.754
1.812CysVal: 1.812 ± 1.474
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.717AspAla: 2.717 ± 1.29
0.906AspCys: 0.906 ± 0.795
1.812AspAsp: 1.812 ± 0.783
0.906AspGlu: 0.906 ± 0.737
1.812AspPhe: 1.812 ± 1.109
2.717AspGly: 2.717 ± 1.939
0.0AspHis: 0.0 ± 0.0
2.717AspIle: 2.717 ± 1.664
2.717AspLys: 2.717 ± 1.142
6.341AspLeu: 6.341 ± 2.608
0.0AspMet: 0.0 ± 0.0
2.717AspAsn: 2.717 ± 1.379
2.717AspPro: 2.717 ± 1.112
1.812AspGln: 1.812 ± 0.783
3.623AspArg: 3.623 ± 1.318
4.529AspSer: 4.529 ± 1.379
1.812AspThr: 1.812 ± 1.12
6.341AspVal: 6.341 ± 1.568
1.812AspTrp: 1.812 ± 1.293
0.906AspTyr: 0.906 ± 0.646
0.0AspXaa: 0.0 ± 0.0
Glu
4.529GluAla: 4.529 ± 1.737
0.0GluCys: 0.0 ± 0.0
2.717GluAsp: 2.717 ± 1.29
3.623GluGlu: 3.623 ± 1.756
3.623GluPhe: 3.623 ± 2.094
4.529GluGly: 4.529 ± 1.148
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.812GluLys: 1.812 ± 1.12
4.529GluLeu: 4.529 ± 1.576
0.0GluMet: 0.0 ± 0.0
3.623GluAsn: 3.623 ± 1.985
1.812GluPro: 1.812 ± 0.967
2.717GluGln: 2.717 ± 1.792
0.906GluArg: 0.906 ± 0.88
4.529GluSer: 4.529 ± 1.169
2.717GluThr: 2.717 ± 1.58
2.717GluVal: 2.717 ± 0.867
1.812GluTrp: 1.812 ± 0.783
1.812GluTyr: 1.812 ± 0.814
0.0GluXaa: 0.0 ± 0.0
Phe
0.906PheAla: 0.906 ± 0.646
0.906PheCys: 0.906 ± 0.737
3.623PheAsp: 3.623 ± 1.423
2.717PheGlu: 2.717 ± 0.891
1.812PhePhe: 1.812 ± 0.711
1.812PheGly: 1.812 ± 1.02
1.812PheHis: 1.812 ± 1.12
0.906PheIle: 0.906 ± 0.646
1.812PheLys: 1.812 ± 0.814
6.341PheLeu: 6.341 ± 1.49
0.906PheMet: 0.906 ± 0.646
3.623PheAsn: 3.623 ± 2.692
0.906PhePro: 0.906 ± 0.97
3.623PheGln: 3.623 ± 1.149
4.529PheArg: 4.529 ± 2.697
1.812PheSer: 1.812 ± 1.59
0.906PheThr: 0.906 ± 0.795
1.812PheVal: 1.812 ± 0.711
0.0PheTrp: 0.0 ± 0.0
0.906PheTyr: 0.906 ± 0.737
0.0PheXaa: 0.0 ± 0.0
Gly
2.717GlyAla: 2.717 ± 1.29
1.812GlyCys: 1.812 ± 1.02
1.812GlyAsp: 1.812 ± 1.293
4.529GlyGlu: 4.529 ± 1.039
1.812GlyPhe: 1.812 ± 1.166
2.717GlyGly: 2.717 ± 1.142
1.812GlyHis: 1.812 ± 1.12
3.623GlyIle: 3.623 ± 1.632
6.341GlyLys: 6.341 ± 2.644
0.906GlyLeu: 0.906 ± 0.737
0.0GlyMet: 0.0 ± 0.0
0.906GlyAsn: 0.906 ± 0.795
2.717GlyPro: 2.717 ± 1.142
2.717GlyGln: 2.717 ± 0.867
0.906GlyArg: 0.906 ± 0.646
1.812GlySer: 1.812 ± 1.293
2.717GlyThr: 2.717 ± 0.891
2.717GlyVal: 2.717 ± 1.986
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.717HisAla: 2.717 ± 1.42
2.717HisCys: 2.717 ± 1.743
2.717HisAsp: 2.717 ± 1.222
2.717HisGlu: 2.717 ± 1.171
3.623HisPhe: 3.623 ± 1.411
1.812HisGly: 1.812 ± 1.166
1.812HisHis: 1.812 ± 1.257
2.717HisIle: 2.717 ± 1.379
0.0HisLys: 0.0 ± 0.0
2.717HisLeu: 2.717 ± 1.29
0.0HisMet: 0.0 ± 0.0
3.623HisAsn: 3.623 ± 1.149
0.906HisPro: 0.906 ± 0.646
2.717HisGln: 2.717 ± 1.476
3.623HisArg: 3.623 ± 2.041
1.812HisSer: 1.812 ± 1.109
0.906HisThr: 0.906 ± 0.737
1.812HisVal: 1.812 ± 0.814
0.0HisTrp: 0.0 ± 0.0
1.812HisTyr: 1.812 ± 1.293
0.0HisXaa: 0.0 ± 0.0
Ile
0.906IleAla: 0.906 ± 0.795
1.812IleCys: 1.812 ± 1.12
2.717IleAsp: 2.717 ± 1.196
1.812IleGlu: 1.812 ± 1.293
3.623IlePhe: 3.623 ± 1.149
0.906IleGly: 0.906 ± 0.737
0.906IleHis: 0.906 ± 0.88
3.623IleIle: 3.623 ± 2.217
6.341IleLys: 6.341 ± 1.49
0.0IleLeu: 0.0 ± 0.0
0.906IleMet: 0.906 ± 0.743
1.812IleAsn: 1.812 ± 0.814
0.906IlePro: 0.906 ± 0.646
7.246IleGln: 7.246 ± 1.738
5.435IleArg: 5.435 ± 2.614
4.529IleSer: 4.529 ± 2.153
4.529IleThr: 4.529 ± 1.909
1.812IleVal: 1.812 ± 0.711
2.717IleTrp: 2.717 ± 1.379
0.906IleTyr: 0.906 ± 0.737
0.0IleXaa: 0.0 ± 0.0
Lys
3.623LysAla: 3.623 ± 0.907
0.0LysCys: 0.0 ± 0.0
2.717LysAsp: 2.717 ± 1.939
4.529LysGlu: 4.529 ± 2.326
1.812LysPhe: 1.812 ± 1.109
0.906LysGly: 0.906 ± 0.646
2.717LysHis: 2.717 ± 1.178
3.623LysIle: 3.623 ± 1.322
1.812LysLys: 1.812 ± 0.711
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
4.529LysAsn: 4.529 ± 1.25
2.717LysPro: 2.717 ± 1.296
0.0LysGln: 0.0 ± 0.0
3.623LysArg: 3.623 ± 1.268
8.152LysSer: 8.152 ± 2.27
2.717LysThr: 2.717 ± 0.891
5.435LysVal: 5.435 ± 1.96
1.812LysTrp: 1.812 ± 0.711
5.435LysTyr: 5.435 ± 1.41
0.0LysXaa: 0.0 ± 0.0
Leu
2.717LeuAla: 2.717 ± 1.112
1.812LeuCys: 1.812 ± 1.293
5.435LeuAsp: 5.435 ± 1.789
1.812LeuGlu: 1.812 ± 1.293
0.0LeuPhe: 0.0 ± 0.0
4.529LeuGly: 4.529 ± 1.549
1.812LeuHis: 1.812 ± 1.293
4.529LeuIle: 4.529 ± 2.106
4.529LeuLys: 4.529 ± 1.459
0.906LeuLeu: 0.906 ± 0.88
0.906LeuMet: 0.906 ± 0.737
6.341LeuAsn: 6.341 ± 2.682
0.906LeuPro: 0.906 ± 0.795
3.623LeuGln: 3.623 ± 1.411
5.435LeuArg: 5.435 ± 2.815
4.529LeuSer: 4.529 ± 2.148
6.341LeuThr: 6.341 ± 1.593
5.435LeuVal: 5.435 ± 1.783
0.906LeuTrp: 0.906 ± 0.88
3.623LeuTyr: 3.623 ± 1.539
0.0LeuXaa: 0.0 ± 0.0
Met
1.812MetAla: 1.812 ± 0.711
0.906MetCys: 0.906 ± 0.737
1.812MetAsp: 1.812 ± 1.109
0.906MetGlu: 0.906 ± 0.935
1.812MetPhe: 1.812 ± 1.474
2.717MetGly: 2.717 ± 1.309
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.906MetLys: 0.906 ± 0.88
1.812MetLeu: 1.812 ± 0.967
0.0MetMet: 0.0 ± 0.0
0.906MetAsn: 0.906 ± 0.737
0.0MetPro: 0.0 ± 0.0
0.906MetGln: 0.906 ± 0.795
0.906MetArg: 0.906 ± 0.795
0.906MetSer: 0.906 ± 0.737
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.812MetTrp: 1.812 ± 1.12
1.812MetTyr: 1.812 ± 1.474
0.0MetXaa: 0.0 ± 0.0
Asn
2.717AsnAla: 2.717 ± 1.142
0.0AsnCys: 0.0 ± 0.0
2.717AsnAsp: 2.717 ± 1.196
2.717AsnGlu: 2.717 ± 0.891
0.906AsnPhe: 0.906 ± 0.737
0.906AsnGly: 0.906 ± 0.935
6.341AsnHis: 6.341 ± 2.956
1.812AsnIle: 1.812 ± 0.711
0.906AsnLys: 0.906 ± 0.646
6.341AsnLeu: 6.341 ± 2.41
2.717AsnMet: 2.717 ± 1.474
4.529AsnAsn: 4.529 ± 0.85
4.529AsnPro: 4.529 ± 0.85
3.623AsnGln: 3.623 ± 1.025
3.623AsnArg: 3.623 ± 1.318
6.341AsnSer: 6.341 ± 2.044
4.529AsnThr: 4.529 ± 2.192
4.529AsnVal: 4.529 ± 1.427
0.0AsnTrp: 0.0 ± 0.0
2.717AsnTyr: 2.717 ± 0.891
0.0AsnXaa: 0.0 ± 0.0
Pro
2.717ProAla: 2.717 ± 1.634
1.812ProCys: 1.812 ± 0.967
2.717ProAsp: 2.717 ± 1.104
2.717ProGlu: 2.717 ± 1.112
1.812ProPhe: 1.812 ± 0.814
1.812ProGly: 1.812 ± 0.923
3.623ProHis: 3.623 ± 2.094
6.341ProIle: 6.341 ± 1.371
3.623ProLys: 3.623 ± 2.585
2.717ProLeu: 2.717 ± 1.171
1.812ProMet: 1.812 ± 0.914
1.812ProAsn: 1.812 ± 0.783
2.717ProPro: 2.717 ± 1.55
4.529ProGln: 4.529 ± 1.169
5.435ProArg: 5.435 ± 2.14
3.623ProSer: 3.623 ± 1.472
5.435ProThr: 5.435 ± 2.58
3.623ProVal: 3.623 ± 2.041
0.906ProTrp: 0.906 ± 0.646
0.906ProTyr: 0.906 ± 0.737
0.0ProXaa: 0.0 ± 0.0
Gln
3.623GlnAla: 3.623 ± 0.801
3.623GlnCys: 3.623 ± 2.408
3.623GlnAsp: 3.623 ± 1.812
2.717GlnGlu: 2.717 ± 0.867
1.812GlnPhe: 1.812 ± 0.814
0.906GlnGly: 0.906 ± 0.646
3.623GlnHis: 3.623 ± 1.991
3.623GlnIle: 3.623 ± 1.628
2.717GlnLys: 2.717 ± 1.992
3.623GlnLeu: 3.623 ± 2.24
0.0GlnMet: 0.0 ± 0.0
3.623GlnAsn: 3.623 ± 1.149
5.435GlnPro: 5.435 ± 3.31
1.812GlnGln: 1.812 ± 0.711
0.906GlnArg: 0.906 ± 0.646
4.529GlnSer: 4.529 ± 1.737
0.906GlnThr: 0.906 ± 0.88
2.717GlnVal: 2.717 ± 0.754
0.0GlnTrp: 0.0 ± 0.0
1.812GlnTyr: 1.812 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
3.623ArgAla: 3.623 ± 1.576
2.717ArgCys: 2.717 ± 1.994
3.623ArgAsp: 3.623 ± 1.192
3.623ArgGlu: 3.623 ± 1.727
3.623ArgPhe: 3.623 ± 1.22
3.623ArgGly: 3.623 ± 1.268
3.623ArgHis: 3.623 ± 1.472
3.623ArgIle: 3.623 ± 1.025
2.717ArgLys: 2.717 ± 1.664
2.717ArgLeu: 2.717 ± 1.379
1.812ArgMet: 1.812 ± 1.474
1.812ArgAsn: 1.812 ± 1.357
7.246ArgPro: 7.246 ± 1.996
1.812ArgGln: 1.812 ± 1.357
7.246ArgArg: 7.246 ± 3.828
4.529ArgSer: 4.529 ± 1.517
3.623ArgThr: 3.623 ± 1.803
6.341ArgVal: 6.341 ± 3.142
0.0ArgTrp: 0.0 ± 0.0
1.812ArgTyr: 1.812 ± 0.967
0.0ArgXaa: 0.0 ± 0.0
Ser
5.435SerAla: 5.435 ± 3.878
0.0SerCys: 0.0 ± 0.0
2.717SerAsp: 2.717 ± 0.754
1.812SerGlu: 1.812 ± 0.711
3.623SerPhe: 3.623 ± 1.025
2.717SerGly: 2.717 ± 1.178
1.812SerHis: 1.812 ± 1.257
3.623SerIle: 3.623 ± 1.3
5.435SerLys: 5.435 ± 1.739
1.812SerLeu: 1.812 ± 1.293
3.623SerMet: 3.623 ± 1.417
5.435SerAsn: 5.435 ± 1.315
8.152SerPro: 8.152 ± 1.164
3.623SerGln: 3.623 ± 2.094
4.529SerArg: 4.529 ± 1.744
10.87SerSer: 10.87 ± 4.201
6.341SerThr: 6.341 ± 3.996
4.529SerVal: 4.529 ± 2.049
0.0SerTrp: 0.0 ± 0.0
2.717SerTyr: 2.717 ± 1.196
0.0SerXaa: 0.0 ± 0.0
Thr
2.717ThrAla: 2.717 ± 1.379
1.812ThrCys: 1.812 ± 1.321
1.812ThrAsp: 1.812 ± 1.87
1.812ThrGlu: 1.812 ± 1.279
0.906ThrPhe: 0.906 ± 0.935
3.623ThrGly: 3.623 ± 1.322
4.529ThrHis: 4.529 ± 2.26
5.435ThrIle: 5.435 ± 2.031
4.529ThrLys: 4.529 ± 1.25
3.623ThrLeu: 3.623 ± 1.032
0.906ThrMet: 0.906 ± 0.646
5.435ThrAsn: 5.435 ± 1.488
5.435ThrPro: 5.435 ± 1.846
2.717ThrGln: 2.717 ± 1.016
2.717ThrArg: 2.717 ± 0.814
3.623ThrSer: 3.623 ± 1.39
1.812ThrThr: 1.812 ± 1.321
2.717ThrVal: 2.717 ± 1.42
2.717ThrTrp: 2.717 ± 1.986
2.717ThrTyr: 2.717 ± 1.309
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.906ValCys: 0.906 ± 0.646
2.717ValAsp: 2.717 ± 0.754
1.812ValGlu: 1.812 ± 1.941
3.623ValPhe: 3.623 ± 0.948
1.812ValGly: 1.812 ± 1.02
1.812ValHis: 1.812 ± 0.967
4.529ValIle: 4.529 ± 1.768
4.529ValLys: 4.529 ± 1.769
6.341ValLeu: 6.341 ± 1.665
1.812ValMet: 1.812 ± 1.474
3.623ValAsn: 3.623 ± 2.027
2.717ValPro: 2.717 ± 0.754
4.529ValGln: 4.529 ± 1.971
4.529ValArg: 4.529 ± 2.964
4.529ValSer: 4.529 ± 1.281
5.435ValThr: 5.435 ± 2.679
1.812ValVal: 1.812 ± 0.711
0.0ValTrp: 0.0 ± 0.0
5.435ValTyr: 5.435 ± 1.972
0.0ValXaa: 0.0 ± 0.0
Trp
3.623TrpAla: 3.623 ± 1.714
0.0TrpCys: 0.0 ± 0.0
1.812TrpAsp: 1.812 ± 1.26
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.906TrpGly: 0.906 ± 0.646
1.812TrpHis: 1.812 ± 1.109
0.906TrpIle: 0.906 ± 0.935
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.906TrpMet: 0.906 ± 0.737
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.906TrpGln: 0.906 ± 0.646
1.812TrpArg: 1.812 ± 0.783
0.0TrpSer: 0.0 ± 0.0
1.812TrpThr: 1.812 ± 1.76
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.812TrpTyr: 1.812 ± 0.923
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.717TyrAla: 2.717 ± 1.296
0.0TyrCys: 0.0 ± 0.0
0.906TyrAsp: 0.906 ± 0.737
0.906TyrGlu: 0.906 ± 0.737
3.623TyrPhe: 3.623 ± 0.948
0.906TyrGly: 0.906 ± 0.646
0.906TyrHis: 0.906 ± 0.646
0.906TyrIle: 0.906 ± 0.646
0.906TyrLys: 0.906 ± 0.646
7.246TyrLeu: 7.246 ± 1.985
0.906TyrMet: 0.906 ± 1.049
2.717TyrAsn: 2.717 ± 0.891
1.812TyrPro: 1.812 ± 0.923
0.906TyrGln: 0.906 ± 0.737
2.717TyrArg: 2.717 ± 1.296
2.717TyrSer: 2.717 ± 1.55
0.906TyrThr: 0.906 ± 0.88
5.435TyrVal: 5.435 ± 1.727
0.0TyrTrp: 0.0 ± 0.0
0.906TyrTyr: 0.906 ± 0.795
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski