Amino acid dipepetide frequency for Pedilanthus leaf curl virus [Pakistan:Multan:2004]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.847AlaAla: 7.847 ± 3.057
1.744AlaCys: 1.744 ± 1.042
0.872AlaAsp: 0.872 ± 0.737
1.744AlaGlu: 1.744 ± 1.026
0.872AlaPhe: 0.872 ± 1.079
1.744AlaGly: 1.744 ± 0.776
2.616AlaHis: 2.616 ± 1.152
1.744AlaIle: 1.744 ± 0.891
2.616AlaLys: 2.616 ± 1.354
9.59AlaLeu: 9.59 ± 2.31
0.0AlaMet: 0.0 ± 0.0
2.616AlaAsn: 2.616 ± 1.189
4.359AlaPro: 4.359 ± 1.222
3.487AlaGln: 3.487 ± 1.393
6.103AlaArg: 6.103 ± 2.441
6.103AlaSer: 6.103 ± 2.602
3.487AlaThr: 3.487 ± 2.219
2.616AlaVal: 2.616 ± 1.795
1.744AlaTrp: 1.744 ± 0.776
0.872AlaTyr: 0.872 ± 0.613
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.744CysCys: 1.744 ± 2.018
0.0CysAsp: 0.0 ± 0.0
1.744CysGlu: 1.744 ± 1.11
1.744CysPhe: 1.744 ± 1.448
1.744CysGly: 1.744 ± 0.891
0.872CysHis: 0.872 ± 0.882
0.0CysIle: 0.0 ± 0.0
0.872CysLys: 0.872 ± 0.737
0.0CysLeu: 0.0 ± 0.0
1.744CysMet: 1.744 ± 1.025
0.872CysAsn: 0.872 ± 0.613
1.744CysPro: 1.744 ± 2.018
0.872CysGln: 0.872 ± 0.613
0.872CysArg: 0.872 ± 0.613
4.359CysSer: 4.359 ± 2.047
1.744CysThr: 1.744 ± 1.273
1.744CysVal: 1.744 ± 1.474
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.744AspAla: 1.744 ± 1.227
0.0AspCys: 0.0 ± 0.0
1.744AspAsp: 1.744 ± 0.891
2.616AspGlu: 2.616 ± 0.974
0.872AspPhe: 0.872 ± 0.737
2.616AspGly: 2.616 ± 1.84
0.872AspHis: 0.872 ± 1.079
3.487AspIle: 3.487 ± 1.338
0.872AspLys: 0.872 ± 0.737
6.103AspLeu: 6.103 ± 2.36
0.0AspMet: 0.0 ± 0.0
2.616AspAsn: 2.616 ± 1.327
0.872AspPro: 0.872 ± 0.613
1.744AspGln: 1.744 ± 1.227
2.616AspArg: 2.616 ± 1.384
5.231AspSer: 5.231 ± 1.491
4.359AspThr: 4.359 ± 1.796
5.231AspVal: 5.231 ± 1.948
1.744AspTrp: 1.744 ± 0.891
0.872AspTyr: 0.872 ± 0.613
0.0AspXaa: 0.0 ± 0.0
Glu
4.359GluAla: 4.359 ± 1.624
0.0GluCys: 0.0 ± 0.0
1.744GluAsp: 1.744 ± 1.086
3.487GluGlu: 3.487 ± 1.755
3.487GluPhe: 3.487 ± 1.837
6.103GluGly: 6.103 ± 1.826
0.872GluHis: 0.872 ± 1.079
0.872GluIle: 0.872 ± 1.079
1.744GluLys: 1.744 ± 1.227
4.359GluLeu: 4.359 ± 1.697
0.0GluMet: 0.0 ± 0.0
4.359GluAsn: 4.359 ± 1.947
2.616GluPro: 2.616 ± 1.279
3.487GluGln: 3.487 ± 2.134
0.0GluArg: 0.0 ± 0.0
4.359GluSer: 4.359 ± 2.535
1.744GluThr: 1.744 ± 1.449
2.616GluVal: 2.616 ± 1.189
1.744GluTrp: 1.744 ± 0.891
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.872PheCys: 0.872 ± 0.737
4.359PheAsp: 4.359 ± 1.443
1.744PheGlu: 1.744 ± 0.776
1.744PhePhe: 1.744 ± 0.776
1.744PheGly: 1.744 ± 0.776
2.616PheHis: 2.616 ± 1.355
3.487PheIle: 3.487 ± 0.985
2.616PheLys: 2.616 ± 2.014
6.103PheLeu: 6.103 ± 1.785
0.872PheMet: 0.872 ± 0.613
2.616PheAsn: 2.616 ± 2.097
1.744PhePro: 1.744 ± 1.287
2.616PheGln: 2.616 ± 1.249
3.487PheArg: 3.487 ± 2.646
0.872PheSer: 0.872 ± 0.613
4.359PheThr: 4.359 ± 2.077
0.872PheVal: 0.872 ± 0.613
0.0PheTrp: 0.0 ± 0.0
0.872PheTyr: 0.872 ± 0.737
0.0PheXaa: 0.0 ± 0.0
Gly
3.487GlyAla: 3.487 ± 2.171
1.744GlyCys: 1.744 ± 1.042
4.359GlyAsp: 4.359 ± 2.082
4.359GlyGlu: 4.359 ± 1.219
2.616GlyPhe: 2.616 ± 2.137
3.487GlyGly: 3.487 ± 1.199
0.872GlyHis: 0.872 ± 0.613
2.616GlyIle: 2.616 ± 0.937
6.975GlyLys: 6.975 ± 2.635
1.744GlyLeu: 1.744 ± 1.042
0.872GlyMet: 0.872 ± 1.009
1.744GlyAsn: 1.744 ± 2.305
3.487GlyPro: 3.487 ± 1.725
2.616GlyGln: 2.616 ± 1.073
1.744GlyArg: 1.744 ± 1.026
3.487GlySer: 3.487 ± 1.837
4.359GlyThr: 4.359 ± 1.796
1.744GlyVal: 1.744 ± 2.157
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.744HisAla: 1.744 ± 1.11
2.616HisCys: 2.616 ± 1.152
0.872HisAsp: 0.872 ± 0.737
2.616HisGlu: 2.616 ± 1.152
2.616HisPhe: 2.616 ± 1.355
1.744HisGly: 1.744 ± 1.287
4.359HisHis: 4.359 ± 3.335
1.744HisIle: 1.744 ± 1.273
1.744HisLys: 1.744 ± 1.448
2.616HisLeu: 2.616 ± 1.84
0.0HisMet: 0.0 ± 0.0
4.359HisAsn: 4.359 ± 2.021
0.872HisPro: 0.872 ± 0.613
0.0HisGln: 0.0 ± 0.0
3.487HisArg: 3.487 ± 2.084
1.744HisSer: 1.744 ± 1.765
1.744HisThr: 1.744 ± 1.474
2.616HisVal: 2.616 ± 2.014
0.0HisTrp: 0.0 ± 0.0
1.744HisTyr: 1.744 ± 1.227
0.0HisXaa: 0.0 ± 0.0
Ile
0.872IleAla: 0.872 ± 0.882
2.616IleCys: 2.616 ± 1.268
2.616IleAsp: 2.616 ± 1.355
1.744IleGlu: 1.744 ± 1.026
2.616IlePhe: 2.616 ± 1.84
2.616IleGly: 2.616 ± 1.384
1.744IleHis: 1.744 ± 1.361
2.616IleIle: 2.616 ± 1.593
5.231IleLys: 5.231 ± 1.283
0.872IleLeu: 0.872 ± 0.613
0.0IleMet: 0.0 ± 0.0
2.616IleAsn: 2.616 ± 1.282
0.872IlePro: 0.872 ± 0.613
6.975IleGln: 6.975 ± 2.709
5.231IleArg: 5.231 ± 2.345
4.359IleSer: 4.359 ± 2.513
2.616IleThr: 2.616 ± 3.236
1.744IleVal: 1.744 ± 0.776
3.487IleTrp: 3.487 ± 2.358
2.616IleTyr: 2.616 ± 0.937
0.0IleXaa: 0.0 ± 0.0
Lys
3.487LysAla: 3.487 ± 1.491
0.872LysCys: 0.872 ± 1.079
1.744LysAsp: 1.744 ± 1.227
4.359LysGlu: 4.359 ± 2.3
2.616LysPhe: 2.616 ± 0.937
5.231LysGly: 5.231 ± 2.525
0.872LysHis: 0.872 ± 0.613
4.359LysIle: 4.359 ± 1.46
2.616LysLys: 2.616 ± 0.885
0.872LysLeu: 0.872 ± 1.009
0.0LysMet: 0.0 ± 0.0
4.359LysAsn: 4.359 ± 1.911
2.616LysPro: 2.616 ± 1.384
0.0LysGln: 0.0 ± 0.0
2.616LysArg: 2.616 ± 1.384
6.975LysSer: 6.975 ± 2.435
2.616LysThr: 2.616 ± 0.937
4.359LysVal: 4.359 ± 2.005
0.872LysTrp: 0.872 ± 0.737
4.359LysTyr: 4.359 ± 1.058
0.0LysXaa: 0.0 ± 0.0
Leu
4.359LeuAla: 4.359 ± 1.891
2.616LeuCys: 2.616 ± 1.355
5.231LeuAsp: 5.231 ± 2.116
6.103LeuGlu: 6.103 ± 2.505
0.872LeuPhe: 0.872 ± 0.882
5.231LeuGly: 5.231 ± 1.876
2.616LeuHis: 2.616 ± 1.301
4.359LeuIle: 4.359 ± 2.149
5.231LeuLys: 5.231 ± 1.283
1.744LeuLeu: 1.744 ± 2.305
1.744LeuMet: 1.744 ± 1.029
3.487LeuAsn: 3.487 ± 1.199
0.872LeuPro: 0.872 ± 0.882
2.616LeuGln: 2.616 ± 1.152
6.103LeuArg: 6.103 ± 1.736
2.616LeuSer: 2.616 ± 1.84
6.103LeuThr: 6.103 ± 2.136
4.359LeuVal: 4.359 ± 1.996
0.872LeuTrp: 0.872 ± 1.079
2.616LeuTyr: 2.616 ± 0.937
0.0LeuXaa: 0.0 ± 0.0
Met
1.744MetAla: 1.744 ± 0.776
0.872MetCys: 0.872 ± 0.737
2.616MetAsp: 2.616 ± 0.937
0.872MetGlu: 0.872 ± 1.153
2.616MetPhe: 2.616 ± 1.575
2.616MetGly: 2.616 ± 1.268
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.744MetLeu: 1.744 ± 1.11
0.0MetMet: 0.0 ± 0.0
1.744MetAsn: 1.744 ± 1.143
0.872MetPro: 0.872 ± 0.613
0.0MetGln: 0.0 ± 0.0
0.872MetArg: 0.872 ± 0.882
0.872MetSer: 0.872 ± 0.737
0.872MetThr: 0.872 ± 1.079
0.872MetVal: 0.872 ± 1.009
0.872MetTrp: 0.872 ± 0.613
1.744MetTyr: 1.744 ± 1.474
0.0MetXaa: 0.0 ± 0.0
Asn
3.487AsnAla: 3.487 ± 1.725
0.0AsnCys: 0.0 ± 0.0
0.872AsnAsp: 0.872 ± 0.613
1.744AsnGlu: 1.744 ± 0.776
0.872AsnPhe: 0.872 ± 0.737
1.744AsnGly: 1.744 ± 1.592
3.487AsnHis: 3.487 ± 1.678
4.359AsnIle: 4.359 ± 1.821
0.872AsnLys: 0.872 ± 0.613
5.231AsnLeu: 5.231 ± 2.027
1.744AsnMet: 1.744 ± 1.367
2.616AsnAsn: 2.616 ± 1.557
3.487AsnPro: 3.487 ± 1.096
5.231AsnGln: 5.231 ± 1.998
4.359AsnArg: 4.359 ± 2.128
2.616AsnSer: 2.616 ± 1.189
1.744AsnThr: 1.744 ± 1.086
4.359AsnVal: 4.359 ± 1.821
0.872AsnTrp: 0.872 ± 0.613
3.487AsnTyr: 3.487 ± 1.19
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 1.474
2.616ProCys: 2.616 ± 1.279
2.616ProAsp: 2.616 ± 1.279
2.616ProGlu: 2.616 ± 1.152
2.616ProPhe: 2.616 ± 1.301
2.616ProGly: 2.616 ± 1.334
4.359ProHis: 4.359 ± 2.38
5.231ProIle: 5.231 ± 1.877
3.487ProLys: 3.487 ± 2.453
3.487ProLeu: 3.487 ± 1.288
0.872ProMet: 0.872 ± 0.737
1.744ProAsn: 1.744 ± 1.227
4.359ProPro: 4.359 ± 2.055
3.487ProGln: 3.487 ± 2.524
3.487ProArg: 3.487 ± 1.643
4.359ProSer: 4.359 ± 2.213
5.231ProThr: 5.231 ± 2.308
3.487ProVal: 3.487 ± 1.552
0.0ProTrp: 0.0 ± 0.0
0.872ProTyr: 0.872 ± 0.737
0.0ProXaa: 0.0 ± 0.0
Gln
7.847GlnAla: 7.847 ± 1.673
0.0GlnCys: 0.0 ± 0.0
4.359GlnAsp: 4.359 ± 2.174
1.744GlnGlu: 1.744 ± 0.776
2.616GlnPhe: 2.616 ± 1.301
0.872GlnGly: 0.872 ± 0.613
2.616GlnHis: 2.616 ± 1.549
4.359GlnIle: 4.359 ± 2.287
1.744GlnLys: 1.744 ± 2.018
1.744GlnLeu: 1.744 ± 2.018
0.872GlnMet: 0.872 ± 0.882
3.487GlnAsn: 3.487 ± 1.288
4.359GlnPro: 4.359 ± 2.833
2.616GlnGln: 2.616 ± 0.885
3.487GlnArg: 3.487 ± 2.171
5.231GlnSer: 5.231 ± 1.283
0.0GlnThr: 0.0 ± 0.0
4.359GlnVal: 4.359 ± 0.965
0.0GlnTrp: 0.0 ± 0.0
0.872GlnTyr: 0.872 ± 0.737
0.0GlnXaa: 0.0 ± 0.0
Arg
4.359ArgAla: 4.359 ± 1.7
2.616ArgCys: 2.616 ± 1.939
3.487ArgAsp: 3.487 ± 1.39
2.616ArgGlu: 2.616 ± 1.354
3.487ArgPhe: 3.487 ± 1.096
3.487ArgGly: 3.487 ± 1.364
2.616ArgHis: 2.616 ± 1.279
2.616ArgIle: 2.616 ± 0.937
3.487ArgLys: 3.487 ± 1.678
4.359ArgLeu: 4.359 ± 2.535
1.744ArgMet: 1.744 ± 1.474
1.744ArgAsn: 1.744 ± 1.025
6.103ArgPro: 6.103 ± 1.654
0.872ArgGln: 0.872 ± 1.009
6.103ArgArg: 6.103 ± 3.324
6.103ArgSer: 6.103 ± 2.136
3.487ArgThr: 3.487 ± 2.323
6.103ArgVal: 6.103 ± 2.141
0.0ArgTrp: 0.0 ± 0.0
1.744ArgTyr: 1.744 ± 1.11
0.0ArgXaa: 0.0 ± 0.0
Ser
4.359SerAla: 4.359 ± 3.067
0.0SerCys: 0.0 ± 0.0
3.487SerAsp: 3.487 ± 1.253
0.872SerGlu: 0.872 ± 0.613
3.487SerPhe: 3.487 ± 1.253
0.0SerGly: 0.0 ± 0.0
0.872SerHis: 0.872 ± 0.882
3.487SerIle: 3.487 ± 2.223
6.103SerLys: 6.103 ± 2.41
3.487SerLeu: 3.487 ± 1.44
2.616SerMet: 2.616 ± 2.107
5.231SerAsn: 5.231 ± 1.703
9.59SerPro: 9.59 ± 2.047
5.231SerGln: 5.231 ± 1.44
6.103SerArg: 6.103 ± 1.808
13.078SerSer: 13.078 ± 6.284
6.103SerThr: 6.103 ± 1.135
5.231SerVal: 5.231 ± 2.604
0.0SerTrp: 0.0 ± 0.0
3.487SerTyr: 3.487 ± 1.288
0.0SerXaa: 0.0 ± 0.0
Thr
2.616ThrAla: 2.616 ± 1.327
1.744ThrCys: 1.744 ± 1.086
0.872ThrAsp: 0.872 ± 0.613
2.616ThrGlu: 2.616 ± 1.557
1.744ThrPhe: 1.744 ± 1.449
4.359ThrGly: 4.359 ± 2.029
5.231ThrHis: 5.231 ± 1.77
0.872ThrIle: 0.872 ± 0.613
4.359ThrLys: 4.359 ± 1.911
4.359ThrLeu: 4.359 ± 1.093
1.744ThrMet: 1.744 ± 1.026
1.744ThrAsn: 1.744 ± 0.776
5.231ThrPro: 5.231 ± 1.905
3.487ThrGln: 3.487 ± 2.438
2.616ThrArg: 2.616 ± 0.885
2.616ThrSer: 2.616 ± 1.795
0.872ThrThr: 0.872 ± 1.153
4.359ThrVal: 4.359 ± 1.7
0.872ThrTrp: 0.872 ± 1.153
2.616ThrTyr: 2.616 ± 1.939
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
3.487ValAsp: 3.487 ± 0.985
2.616ValGlu: 2.616 ± 1.939
2.616ValPhe: 2.616 ± 2.097
2.616ValGly: 2.616 ± 1.279
1.744ValHis: 1.744 ± 1.11
6.975ValIle: 6.975 ± 2.786
4.359ValLys: 4.359 ± 2.001
6.103ValLeu: 6.103 ± 2.749
2.616ValMet: 2.616 ± 1.384
1.744ValAsn: 1.744 ± 1.143
3.487ValPro: 3.487 ± 0.995
6.103ValGln: 6.103 ± 2.149
3.487ValArg: 3.487 ± 2.219
4.359ValSer: 4.359 ± 1.477
3.487ValThr: 3.487 ± 2.077
1.744ValVal: 1.744 ± 1.143
0.872ValTrp: 0.872 ± 0.613
5.231ValTyr: 5.231 ± 1.959
0.0ValXaa: 0.0 ± 0.0
Trp
3.487TrpAla: 3.487 ± 1.725
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.872TrpGlu: 0.872 ± 1.079
0.0TrpPhe: 0.0 ± 0.0
0.872TrpGly: 0.872 ± 0.613
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.744TrpMet: 1.744 ± 1.143
0.872TrpAsn: 0.872 ± 1.079
0.0TrpPro: 0.0 ± 0.0
0.872TrpGln: 0.872 ± 0.613
1.744TrpArg: 1.744 ± 1.531
1.744TrpSer: 1.744 ± 1.531
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.744TrpTyr: 1.744 ± 0.776
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.231TyrAla: 5.231 ± 2.767
0.0TyrCys: 0.0 ± 0.0
0.872TyrAsp: 0.872 ± 0.737
0.872TyrGlu: 0.872 ± 0.737
3.487TyrPhe: 3.487 ± 0.985
0.872TyrGly: 0.872 ± 0.613
0.0TyrHis: 0.0 ± 0.0
0.872TyrIle: 0.872 ± 0.613
0.872TyrLys: 0.872 ± 0.613
5.231TyrLeu: 5.231 ± 1.567
1.744TyrMet: 1.744 ± 1.016
2.616TyrAsn: 2.616 ± 0.974
1.744TyrPro: 1.744 ± 1.227
0.872TyrGln: 0.872 ± 0.737
2.616TyrArg: 2.616 ± 1.384
2.616TyrSer: 2.616 ± 1.282
0.0TyrThr: 0.0 ± 0.0
5.231TyrVal: 5.231 ± 1.948
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1148 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski