Amino acid dipepetide frequency for Pepper leaf curl Lahore virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.257AlaAla: 8.257 ± 2.356
0.917AlaCys: 0.917 ± 0.885
1.835AlaAsp: 1.835 ± 0.814
1.835AlaGlu: 1.835 ± 1.355
0.917AlaPhe: 0.917 ± 1.189
1.835AlaGly: 1.835 ± 0.814
0.917AlaHis: 0.917 ± 1.139
1.835AlaIle: 1.835 ± 1.355
2.752AlaLys: 2.752 ± 1.555
8.257AlaLeu: 8.257 ± 1.465
0.917AlaMet: 0.917 ± 1.028
0.917AlaAsn: 0.917 ± 0.678
0.917AlaPro: 0.917 ± 0.885
4.587AlaGln: 4.587 ± 1.564
4.587AlaArg: 4.587 ± 1.706
4.587AlaSer: 4.587 ± 3.4
4.587AlaThr: 4.587 ± 2.344
3.67AlaVal: 3.67 ± 1.482
2.752AlaTrp: 2.752 ± 1.209
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.835CysCys: 1.835 ± 2.279
0.917CysAsp: 0.917 ± 0.678
0.917CysGlu: 0.917 ± 0.885
0.917CysPhe: 0.917 ± 1.028
1.835CysGly: 1.835 ± 1.207
0.917CysHis: 0.917 ± 1.189
2.752CysIle: 2.752 ± 1.515
0.917CysLys: 0.917 ± 0.885
0.0CysLeu: 0.0 ± 0.0
1.835CysMet: 1.835 ± 1.179
0.917CysAsn: 0.917 ± 0.678
3.67CysPro: 3.67 ± 2.339
0.917CysGln: 0.917 ± 1.139
0.0CysArg: 0.0 ± 0.0
2.752CysSer: 2.752 ± 2.469
0.917CysThr: 0.917 ± 0.885
1.835CysVal: 1.835 ± 0.814
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.752AspAla: 2.752 ± 1.555
0.0AspCys: 0.0 ± 0.0
1.835AspAsp: 1.835 ± 1.207
2.752AspGlu: 2.752 ± 1.03
2.752AspPhe: 2.752 ± 1.03
3.67AspGly: 3.67 ± 2.073
1.835AspHis: 1.835 ± 1.506
2.752AspIle: 2.752 ± 1.515
1.835AspLys: 1.835 ± 0.814
5.505AspLeu: 5.505 ± 3.077
0.917AspMet: 0.917 ± 1.028
0.917AspAsn: 0.917 ± 0.885
2.752AspPro: 2.752 ± 1.535
1.835AspGln: 1.835 ± 1.355
3.67AspArg: 3.67 ± 1.597
4.587AspSer: 4.587 ± 1.557
1.835AspThr: 1.835 ± 1.511
7.339AspVal: 7.339 ± 2.539
1.835AspTrp: 1.835 ± 1.355
0.917AspTyr: 0.917 ± 0.678
0.0AspXaa: 0.0 ± 0.0
Glu
3.67GluAla: 3.67 ± 1.259
0.0GluCys: 0.0 ± 0.0
0.917GluAsp: 0.917 ± 1.148
5.505GluGlu: 5.505 ± 2.591
2.752GluPhe: 2.752 ± 1.535
3.67GluGly: 3.67 ± 1.071
1.835GluHis: 1.835 ± 2.056
0.917GluIle: 0.917 ± 0.678
3.67GluLys: 3.67 ± 2.064
2.752GluLeu: 2.752 ± 1.335
0.0GluMet: 0.0 ± 0.0
3.67GluAsn: 3.67 ± 2.403
1.835GluPro: 1.835 ± 0.814
5.505GluGln: 5.505 ± 1.963
0.0GluArg: 0.0 ± 0.0
1.835GluSer: 1.835 ± 1.264
1.835GluThr: 1.835 ± 1.694
3.67GluVal: 3.67 ± 1.19
1.835GluTrp: 1.835 ± 1.207
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.835PheCys: 1.835 ± 0.814
3.67PheAsp: 3.67 ± 1.629
0.917PheGlu: 0.917 ± 0.885
0.917PhePhe: 0.917 ± 0.885
0.917PheGly: 0.917 ± 0.885
1.835PheHis: 1.835 ± 1.169
2.752PheIle: 2.752 ± 2.033
3.67PheLys: 3.67 ± 2.184
7.339PheLeu: 7.339 ± 2.357
1.835PheMet: 1.835 ± 1.092
0.917PheAsn: 0.917 ± 0.885
0.917PhePro: 0.917 ± 1.139
3.67PheGln: 3.67 ± 2.073
3.67PheArg: 3.67 ± 1.635
0.917PheSer: 0.917 ± 1.189
2.752PheThr: 2.752 ± 2.298
0.917PheVal: 0.917 ± 0.885
0.0PheTrp: 0.0 ± 0.0
0.917PheTyr: 0.917 ± 0.885
0.0PheXaa: 0.0 ± 0.0
Gly
1.835GlyAla: 1.835 ± 1.355
1.835GlyCys: 1.835 ± 1.264
2.752GlyAsp: 2.752 ± 2.298
2.752GlyGlu: 2.752 ± 1.468
2.752GlyPhe: 2.752 ± 2.469
3.67GlyGly: 3.67 ± 1.201
2.752GlyHis: 2.752 ± 1.392
0.917GlyIle: 0.917 ± 0.678
6.422GlyLys: 6.422 ± 3.117
2.752GlyLeu: 2.752 ± 1.734
0.917GlyMet: 0.917 ± 1.028
0.0GlyAsn: 0.0 ± 0.0
3.67GlyPro: 3.67 ± 1.783
3.67GlyGln: 3.67 ± 1.259
0.917GlyArg: 0.917 ± 0.678
4.587GlySer: 4.587 ± 2.664
4.587GlyThr: 4.587 ± 2.313
2.752GlyVal: 2.752 ± 2.227
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.752HisAla: 2.752 ± 1.56
1.835HisCys: 1.835 ± 1.511
3.67HisAsp: 3.67 ± 2.32
0.917HisGlu: 0.917 ± 1.139
2.752HisPhe: 2.752 ± 1.535
1.835HisGly: 1.835 ± 1.511
0.917HisHis: 0.917 ± 1.189
1.835HisIle: 1.835 ± 0.814
1.835HisLys: 1.835 ± 1.444
0.917HisLeu: 0.917 ± 0.678
0.0HisMet: 0.0 ± 0.0
4.587HisAsn: 4.587 ± 2.414
1.835HisPro: 1.835 ± 1.355
0.917HisGln: 0.917 ± 0.678
4.587HisArg: 4.587 ± 3.502
1.835HisSer: 1.835 ± 1.506
1.835HisThr: 1.835 ± 1.77
1.835HisVal: 1.835 ± 2.056
0.0HisTrp: 0.0 ± 0.0
0.917HisTyr: 0.917 ± 0.678
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.917IleCys: 0.917 ± 1.028
3.67IleAsp: 3.67 ± 2.054
1.835IleGlu: 1.835 ± 1.355
3.67IlePhe: 3.67 ± 2.711
0.917IleGly: 0.917 ± 0.885
0.917IleHis: 0.917 ± 0.678
4.587IleIle: 4.587 ± 2.177
3.67IleLys: 3.67 ± 1.128
0.917IleLeu: 0.917 ± 0.678
0.0IleMet: 0.0 ± 0.0
1.835IleAsn: 1.835 ± 1.092
1.835IlePro: 1.835 ± 1.355
0.917IleGln: 0.917 ± 0.678
6.422IleArg: 6.422 ± 2.114
5.505IleSer: 5.505 ± 2.54
3.67IleThr: 3.67 ± 2.886
0.917IleVal: 0.917 ± 0.678
1.835IleTrp: 1.835 ± 1.234
1.835IleTyr: 1.835 ± 1.234
0.0IleXaa: 0.0 ± 0.0
Lys
2.752LysAla: 2.752 ± 1.038
2.752LysCys: 2.752 ± 1.499
1.835LysAsp: 1.835 ± 1.355
3.67LysGlu: 3.67 ± 1.783
2.752LysPhe: 2.752 ± 1.038
3.67LysGly: 3.67 ± 2.339
1.835LysHis: 1.835 ± 1.355
3.67LysIle: 3.67 ± 2.467
1.835LysLys: 1.835 ± 1.264
1.835LysLeu: 1.835 ± 1.355
0.0LysMet: 0.0 ± 0.0
3.67LysAsn: 3.67 ± 1.629
3.67LysPro: 3.67 ± 1.597
0.0LysGln: 0.0 ± 0.0
2.752LysArg: 2.752 ± 1.83
4.587LysSer: 4.587 ± 1.656
2.752LysThr: 2.752 ± 1.084
4.587LysVal: 4.587 ± 2.699
0.917LysTrp: 0.917 ± 0.885
2.752LysTyr: 2.752 ± 1.084
0.0LysXaa: 0.0 ± 0.0
Leu
2.752LeuAla: 2.752 ± 1.392
3.67LeuCys: 3.67 ± 1.748
5.505LeuAsp: 5.505 ± 2.58
4.587LeuGlu: 4.587 ± 1.804
1.835LeuPhe: 1.835 ± 1.207
5.505LeuGly: 5.505 ± 1.773
0.917LeuHis: 0.917 ± 0.678
5.505LeuIle: 5.505 ± 2.86
3.67LeuLys: 3.67 ± 1.783
1.835LeuLeu: 1.835 ± 1.694
4.587LeuMet: 4.587 ± 2.61
5.505LeuAsn: 5.505 ± 1.876
1.835LeuPro: 1.835 ± 1.207
1.835LeuGln: 1.835 ± 1.169
6.422LeuArg: 6.422 ± 3.8
2.752LeuSer: 2.752 ± 1.511
3.67LeuThr: 3.67 ± 1.19
3.67LeuVal: 3.67 ± 1.947
0.0LeuTrp: 0.0 ± 0.0
4.587LeuTyr: 4.587 ± 1.897
0.0LeuXaa: 0.0 ± 0.0
Met
0.917MetAla: 0.917 ± 0.885
0.917MetCys: 0.917 ± 0.885
1.835MetAsp: 1.835 ± 1.234
1.835MetGlu: 1.835 ± 1.443
2.752MetPhe: 2.752 ± 1.83
2.752MetGly: 2.752 ± 1.582
1.835MetHis: 1.835 ± 1.234
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
3.67MetLeu: 3.67 ± 1.149
0.917MetMet: 0.917 ± 1.148
1.835MetAsn: 1.835 ± 1.234
1.835MetPro: 1.835 ± 1.158
0.917MetGln: 0.917 ± 1.148
0.917MetArg: 0.917 ± 1.189
3.67MetSer: 3.67 ± 2.135
0.917MetThr: 0.917 ± 1.028
0.917MetVal: 0.917 ± 0.678
1.835MetTrp: 1.835 ± 1.169
1.835MetTyr: 1.835 ± 1.77
0.0MetXaa: 0.0 ± 0.0
Asn
3.67AsnAla: 3.67 ± 1.783
2.752AsnCys: 2.752 ± 2.298
1.835AsnAsp: 1.835 ± 1.355
1.835AsnGlu: 1.835 ± 1.299
1.835AsnPhe: 1.835 ± 0.814
0.917AsnGly: 0.917 ± 1.028
5.505AsnHis: 5.505 ± 2.331
1.835AsnIle: 1.835 ± 0.814
0.0AsnLys: 0.0 ± 0.0
4.587AsnLeu: 4.587 ± 2.224
2.752AsnMet: 2.752 ± 2.523
2.752AsnAsn: 2.752 ± 1.03
3.67AsnPro: 3.67 ± 1.19
0.917AsnGln: 0.917 ± 0.885
4.587AsnArg: 4.587 ± 2.703
4.587AsnSer: 4.587 ± 1.714
2.752AsnThr: 2.752 ± 1.499
1.835AsnVal: 1.835 ± 1.355
0.0AsnTrp: 0.0 ± 0.0
2.752AsnTyr: 2.752 ± 1.084
0.0AsnXaa: 0.0 ± 0.0
Pro
2.752ProAla: 2.752 ± 1.647
1.835ProCys: 1.835 ± 1.299
4.587ProAsp: 4.587 ± 2.589
2.752ProGlu: 2.752 ± 1.392
0.917ProPhe: 0.917 ± 0.678
0.917ProGly: 0.917 ± 0.678
2.752ProHis: 2.752 ± 1.535
1.835ProIle: 1.835 ± 1.207
3.67ProLys: 3.67 ± 2.711
5.505ProLeu: 5.505 ± 2.205
1.835ProMet: 1.835 ± 1.069
5.505ProAsn: 5.505 ± 1.986
2.752ProPro: 2.752 ± 1.582
3.67ProGln: 3.67 ± 2.36
6.422ProArg: 6.422 ± 2.25
7.339ProSer: 7.339 ± 2.773
2.752ProThr: 2.752 ± 1.137
4.587ProVal: 4.587 ± 1.427
0.0ProTrp: 0.0 ± 0.0
0.917ProTyr: 0.917 ± 0.885
0.0ProXaa: 0.0 ± 0.0
Gln
3.67GlnAla: 3.67 ± 2.32
0.0GlnCys: 0.0 ± 0.0
4.587GlnAsp: 4.587 ± 1.957
3.67GlnGlu: 3.67 ± 1.259
2.752GlnPhe: 2.752 ± 2.033
2.752GlnGly: 2.752 ± 2.033
0.917GlnHis: 0.917 ± 1.189
2.752GlnIle: 2.752 ± 1.499
0.917GlnLys: 0.917 ± 1.139
2.752GlnLeu: 2.752 ± 2.649
1.835GlnMet: 1.835 ± 1.207
0.0GlnAsn: 0.0 ± 0.0
5.505GlnPro: 5.505 ± 3.906
4.587GlnGln: 4.587 ± 2.248
1.835GlnArg: 1.835 ± 1.158
4.587GlnSer: 4.587 ± 1.185
3.67GlnThr: 3.67 ± 1.549
2.752GlnVal: 2.752 ± 1.038
0.0GlnTrp: 0.0 ± 0.0
1.835GlnTyr: 1.835 ± 0.814
0.0GlnXaa: 0.0 ± 0.0
Arg
4.587ArgAla: 4.587 ± 1.837
1.835ArgCys: 1.835 ± 2.279
3.67ArgAsp: 3.67 ± 1.597
1.835ArgGlu: 1.835 ± 1.158
1.835ArgPhe: 1.835 ± 1.234
3.67ArgGly: 3.67 ± 1.495
3.67ArgHis: 3.67 ± 1.071
2.752ArgIle: 2.752 ± 1.038
2.752ArgLys: 2.752 ± 1.885
3.67ArgLeu: 3.67 ± 1.071
1.835ArgMet: 1.835 ± 1.77
4.587ArgAsn: 4.587 ± 1.999
9.174ArgPro: 9.174 ± 2.938
1.835ArgGln: 1.835 ± 1.694
6.422ArgArg: 6.422 ± 3.885
4.587ArgSer: 4.587 ± 1.206
2.752ArgThr: 2.752 ± 1.335
6.422ArgVal: 6.422 ± 1.857
0.0ArgTrp: 0.0 ± 0.0
3.67ArgTyr: 3.67 ± 2.176
0.0ArgXaa: 0.0 ± 0.0
Ser
2.752SerAla: 2.752 ± 2.033
0.917SerCys: 0.917 ± 0.678
4.587SerAsp: 4.587 ± 1.656
0.917SerGlu: 0.917 ± 1.148
3.67SerPhe: 3.67 ± 1.287
0.917SerGly: 0.917 ± 1.189
0.0SerHis: 0.0 ± 0.0
1.835SerIle: 1.835 ± 1.355
6.422SerLys: 6.422 ± 2.168
5.505SerLeu: 5.505 ± 1.986
3.67SerMet: 3.67 ± 3.73
4.587SerAsn: 4.587 ± 2.177
8.257SerPro: 8.257 ± 2.517
2.752SerGln: 2.752 ± 1.353
7.339SerArg: 7.339 ± 2.655
14.679SerSer: 14.679 ± 6.448
7.339SerThr: 7.339 ± 2.638
5.505SerVal: 5.505 ± 2.446
0.0SerTrp: 0.0 ± 0.0
2.752SerTyr: 2.752 ± 1.555
0.0SerXaa: 0.0 ± 0.0
Thr
5.505ThrAla: 5.505 ± 1.267
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
0.917ThrGlu: 0.917 ± 0.885
0.917ThrPhe: 0.917 ± 0.678
6.422ThrGly: 6.422 ± 1.994
4.587ThrHis: 4.587 ± 2.205
0.917ThrIle: 0.917 ± 0.678
2.752ThrLys: 2.752 ± 1.56
3.67ThrLeu: 3.67 ± 1.784
0.917ThrMet: 0.917 ± 0.678
1.835ThrAsn: 1.835 ± 1.234
4.587ThrPro: 4.587 ± 2.633
2.752ThrGln: 2.752 ± 1.511
1.835ThrArg: 1.835 ± 1.264
5.505ThrSer: 5.505 ± 3.502
0.917ThrThr: 0.917 ± 0.885
7.339ThrVal: 7.339 ± 2.829
0.917ThrTrp: 0.917 ± 1.028
1.835ThrTyr: 1.835 ± 1.169
0.0ThrXaa: 0.0 ± 0.0
Val
2.752ValAla: 2.752 ± 1.555
0.0ValCys: 0.0 ± 0.0
2.752ValAsp: 2.752 ± 1.03
4.587ValGlu: 4.587 ± 2.179
1.835ValPhe: 1.835 ± 1.234
1.835ValGly: 1.835 ± 1.264
1.835ValHis: 1.835 ± 1.299
4.587ValIle: 4.587 ± 1.704
4.587ValLys: 4.587 ± 2.344
4.587ValLeu: 4.587 ± 3.276
2.752ValMet: 2.752 ± 1.56
4.587ValAsn: 4.587 ± 2.414
4.587ValPro: 4.587 ± 2.072
7.339ValGln: 7.339 ± 3.201
4.587ValArg: 4.587 ± 3.51
2.752ValSer: 2.752 ± 1.137
3.67ValThr: 3.67 ± 2.403
2.752ValVal: 2.752 ± 1.885
0.0ValTrp: 0.0 ± 0.0
6.422ValTyr: 6.422 ± 2.536
0.0ValXaa: 0.0 ± 0.0
Trp
2.752TrpAla: 2.752 ± 2.033
0.0TrpCys: 0.0 ± 0.0
0.917TrpAsp: 0.917 ± 1.139
0.917TrpGlu: 0.917 ± 1.028
0.0TrpPhe: 0.0 ± 0.0
0.917TrpGly: 0.917 ± 0.678
0.917TrpHis: 0.917 ± 0.885
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.835TrpMet: 1.835 ± 1.234
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.917TrpGln: 0.917 ± 0.678
0.917TrpArg: 0.917 ± 1.189
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.917TrpVal: 0.917 ± 0.885
0.0TrpTrp: 0.0 ± 0.0
0.917TrpTyr: 0.917 ± 0.678
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.752TyrAla: 2.752 ± 2.656
0.0TyrCys: 0.0 ± 0.0
0.917TyrAsp: 0.917 ± 0.885
0.917TyrGlu: 0.917 ± 0.885
1.835TyrPhe: 1.835 ± 1.234
0.917TyrGly: 0.917 ± 0.678
0.917TyrHis: 0.917 ± 0.678
1.835TyrIle: 1.835 ± 1.355
0.917TyrLys: 0.917 ± 0.678
4.587TyrLeu: 4.587 ± 1.564
1.835TyrMet: 1.835 ± 1.194
2.752TyrAsn: 2.752 ± 1.084
0.917TyrPro: 0.917 ± 0.678
1.835TyrGln: 1.835 ± 1.234
3.67TyrArg: 3.67 ± 1.591
2.752TyrSer: 2.752 ± 1.353
0.917TyrThr: 0.917 ± 1.028
4.587TyrVal: 4.587 ± 1.387
0.0TyrTrp: 0.0 ± 0.0
0.917TyrTyr: 0.917 ± 1.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski