Amino acid dipepetide frequency for Cucurbit aphid-borne yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.709AlaAla: 5.709 ± 0.59
2.015AlaCys: 2.015 ± 0.91
2.686AlaAsp: 2.686 ± 1.03
5.709AlaGlu: 5.709 ± 0.866
2.015AlaPhe: 2.015 ± 0.459
6.38AlaGly: 6.38 ± 1.142
0.672AlaHis: 0.672 ± 0.492
1.343AlaIle: 1.343 ± 0.599
4.03AlaLys: 4.03 ± 0.793
4.03AlaLeu: 4.03 ± 1.129
1.679AlaMet: 1.679 ± 0.726
1.679AlaAsn: 1.679 ± 0.84
4.365AlaPro: 4.365 ± 1.184
4.701AlaGln: 4.701 ± 0.915
5.373AlaArg: 5.373 ± 1.617
7.052AlaSer: 7.052 ± 1.209
5.709AlaThr: 5.709 ± 0.59
4.03AlaVal: 4.03 ± 1.145
1.343AlaTrp: 1.343 ± 0.355
4.365AlaTyr: 4.365 ± 0.652
0.0AlaXaa: 0.0 ± 0.0
Cys
1.007CysAla: 1.007 ± 0.187
0.672CysCys: 0.672 ± 0.492
1.343CysAsp: 1.343 ± 0.484
0.336CysGlu: 0.336 ± 0.327
1.007CysPhe: 1.007 ± 0.738
1.007CysGly: 1.007 ± 0.536
0.336CysHis: 0.336 ± 0.246
0.336CysIle: 0.336 ± 0.246
2.686CysLys: 2.686 ± 0.407
1.343CysLeu: 1.343 ± 0.511
0.336CysMet: 0.336 ± 0.327
0.0CysAsn: 0.0 ± 0.0
1.007CysPro: 1.007 ± 0.455
0.672CysGln: 0.672 ± 0.588
0.336CysArg: 0.336 ± 0.246
2.686CysSer: 2.686 ± 1.348
0.0CysThr: 0.0 ± 0.0
0.672CysVal: 0.672 ± 0.294
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.365AspAla: 4.365 ± 0.237
1.343AspCys: 1.343 ± 0.559
3.022AspAsp: 3.022 ± 1.696
2.686AspGlu: 2.686 ± 0.89
2.015AspPhe: 2.015 ± 0.47
3.022AspGly: 3.022 ± 0.503
0.336AspHis: 0.336 ± 0.403
1.343AspIle: 1.343 ± 0.589
2.686AspLys: 2.686 ± 0.513
5.709AspLeu: 5.709 ± 1.035
0.672AspMet: 0.672 ± 0.275
2.015AspAsn: 2.015 ± 0.991
1.679AspPro: 1.679 ± 0.378
1.007AspGln: 1.007 ± 0.487
2.686AspArg: 2.686 ± 0.955
3.022AspSer: 3.022 ± 0.913
0.672AspThr: 0.672 ± 0.356
2.015AspVal: 2.015 ± 0.98
1.007AspTrp: 1.007 ± 0.407
1.007AspTyr: 1.007 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
3.694GluAla: 3.694 ± 1.02
0.672GluCys: 0.672 ± 0.275
3.694GluAsp: 3.694 ± 1.317
7.388GluGlu: 7.388 ± 1.806
4.701GluPhe: 4.701 ± 0.473
5.037GluGly: 5.037 ± 0.617
0.336GluHis: 0.336 ± 0.246
3.022GluIle: 3.022 ± 1.19
4.03GluLys: 4.03 ± 0.925
4.701GluLeu: 4.701 ± 0.466
1.343GluMet: 1.343 ± 0.67
1.679GluAsn: 1.679 ± 0.513
2.015GluPro: 2.015 ± 0.439
1.343GluGln: 1.343 ± 0.746
3.694GluArg: 3.694 ± 0.858
4.701GluSer: 4.701 ± 0.623
4.365GluThr: 4.365 ± 0.663
2.351GluVal: 2.351 ± 0.809
1.679GluTrp: 1.679 ± 0.647
2.015GluTyr: 2.015 ± 0.65
0.0GluXaa: 0.0 ± 0.0
Phe
1.343PheAla: 1.343 ± 0.331
0.672PheCys: 0.672 ± 0.492
1.343PheAsp: 1.343 ± 0.614
1.679PheGlu: 1.679 ± 0.462
1.679PhePhe: 1.679 ± 0.736
3.022PheGly: 3.022 ± 0.835
2.015PheHis: 2.015 ± 0.431
1.679PheIle: 1.679 ± 0.448
2.015PheLys: 2.015 ± 0.463
2.351PheLeu: 2.351 ± 1.335
0.336PheMet: 0.336 ± 0.246
3.022PheAsn: 3.022 ± 0.474
0.672PhePro: 0.672 ± 0.374
1.679PheGln: 1.679 ± 0.861
2.015PheArg: 2.015 ± 0.66
5.037PheSer: 5.037 ± 0.817
1.679PheThr: 1.679 ± 0.84
2.686PheVal: 2.686 ± 1.153
0.672PheTrp: 0.672 ± 0.455
1.343PheTyr: 1.343 ± 0.67
0.0PheXaa: 0.0 ± 0.0
Gly
3.022GlyAla: 3.022 ± 0.629
0.336GlyCys: 0.336 ± 0.327
4.03GlyAsp: 4.03 ± 0.82
5.037GlyGlu: 5.037 ± 1.23
1.679GlyPhe: 1.679 ± 1.23
4.701GlyGly: 4.701 ± 0.566
2.015GlyHis: 2.015 ± 0.431
3.022GlyIle: 3.022 ± 1.297
3.022GlyLys: 3.022 ± 0.56
5.037GlyLeu: 5.037 ± 0.887
1.343GlyMet: 1.343 ± 0.942
5.037GlyAsn: 5.037 ± 1.547
3.694GlyPro: 3.694 ± 0.85
1.343GlyGln: 1.343 ± 0.67
4.701GlyArg: 4.701 ± 1.398
8.731GlySer: 8.731 ± 4.145
3.694GlyThr: 3.694 ± 1.016
3.022GlyVal: 3.022 ± 0.911
2.351GlyTrp: 2.351 ± 0.71
3.694GlyTyr: 3.694 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
1.679HisAla: 1.679 ± 0.726
1.679HisCys: 1.679 ± 0.465
1.679HisAsp: 1.679 ± 0.448
1.343HisGlu: 1.343 ± 0.706
0.336HisPhe: 0.336 ± 0.246
0.336HisGly: 0.336 ± 0.246
0.336HisHis: 0.336 ± 0.246
1.007HisIle: 1.007 ± 0.377
0.672HisLys: 0.672 ± 0.492
0.336HisLeu: 0.336 ± 0.403
0.336HisMet: 0.336 ± 0.246
0.0HisAsn: 0.0 ± 0.0
1.679HisPro: 1.679 ± 0.462
2.015HisGln: 2.015 ± 0.91
2.351HisArg: 2.351 ± 1.061
2.015HisSer: 2.015 ± 0.898
1.007HisThr: 1.007 ± 0.455
3.022HisVal: 3.022 ± 0.911
0.0HisTrp: 0.0 ± 0.0
0.336HisTyr: 0.336 ± 0.337
0.0HisXaa: 0.0 ± 0.0
Ile
2.686IleAla: 2.686 ± 0.875
0.336IleCys: 0.336 ± 0.246
1.007IleAsp: 1.007 ± 0.617
3.694IleGlu: 3.694 ± 0.916
1.679IlePhe: 1.679 ± 0.736
1.679IleGly: 1.679 ± 0.621
0.672IleHis: 0.672 ± 0.374
1.007IleIle: 1.007 ± 0.617
1.343IleLys: 1.343 ± 0.678
4.03IleLeu: 4.03 ± 1.022
3.022IleMet: 3.022 ± 0.684
3.358IleAsn: 3.358 ± 1.465
4.03IlePro: 4.03 ± 0.759
0.672IleGln: 0.672 ± 0.455
1.007IleArg: 1.007 ± 0.597
6.044IleSer: 6.044 ± 0.918
4.365IleThr: 4.365 ± 1.665
0.672IleVal: 0.672 ± 0.294
0.672IleTrp: 0.672 ± 0.294
1.343IleTyr: 1.343 ± 0.331
0.0IleXaa: 0.0 ± 0.0
Lys
4.701LysAla: 4.701 ± 1.176
0.672LysCys: 0.672 ± 0.588
2.686LysAsp: 2.686 ± 0.466
1.007LysGlu: 1.007 ± 0.377
1.679LysPhe: 1.679 ± 0.698
3.022LysGly: 3.022 ± 1.456
0.672LysHis: 0.672 ± 0.481
4.03LysIle: 4.03 ± 0.991
2.351LysLys: 2.351 ± 0.598
3.358LysLeu: 3.358 ± 0.586
2.351LysMet: 2.351 ± 0.849
1.343LysAsn: 1.343 ± 0.41
2.351LysPro: 2.351 ± 0.371
3.022LysGln: 3.022 ± 0.519
3.358LysArg: 3.358 ± 1.395
4.701LysSer: 4.701 ± 0.734
1.679LysThr: 1.679 ± 0.362
3.022LysVal: 3.022 ± 0.538
1.679LysTrp: 1.679 ± 0.757
1.343LysTyr: 1.343 ± 0.444
0.0LysXaa: 0.0 ± 0.0
Leu
8.059LeuAla: 8.059 ± 2.066
2.351LeuCys: 2.351 ± 0.907
3.022LeuAsp: 3.022 ± 0.606
8.731LeuGlu: 8.731 ± 1.375
2.351LeuPhe: 2.351 ± 0.882
3.022LeuGly: 3.022 ± 0.965
1.343LeuHis: 1.343 ± 0.511
5.373LeuIle: 5.373 ± 1.485
2.686LeuLys: 2.686 ± 1.045
6.716LeuLeu: 6.716 ± 2.154
1.007LeuMet: 1.007 ± 0.464
2.015LeuAsn: 2.015 ± 0.431
3.358LeuPro: 3.358 ± 1.127
4.365LeuGln: 4.365 ± 0.652
4.03LeuArg: 4.03 ± 0.75
8.731LeuSer: 8.731 ± 0.951
5.373LeuThr: 5.373 ± 0.86
3.358LeuVal: 3.358 ± 1.534
3.022LeuTrp: 3.022 ± 0.749
3.022LeuTyr: 3.022 ± 0.589
0.0LeuXaa: 0.0 ± 0.0
Met
3.358MetAla: 3.358 ± 1.105
0.0MetCys: 0.0 ± 0.0
1.343MetAsp: 1.343 ± 0.355
2.015MetGlu: 2.015 ± 0.496
0.672MetPhe: 0.672 ± 0.294
1.679MetGly: 1.679 ± 0.462
0.672MetHis: 0.672 ± 0.294
0.336MetIle: 0.336 ± 0.246
0.336MetLys: 0.336 ± 0.295
2.351MetLeu: 2.351 ± 0.66
0.336MetMet: 0.336 ± 0.403
1.343MetAsn: 1.343 ± 0.877
0.336MetPro: 0.336 ± 0.295
2.015MetGln: 2.015 ± 0.882
0.0MetArg: 0.0 ± 0.0
1.343MetSer: 1.343 ± 0.559
0.0MetThr: 0.0 ± 0.0
2.015MetVal: 2.015 ± 0.459
0.0MetTrp: 0.0 ± 0.0
0.336MetTyr: 0.336 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
2.686AsnAla: 2.686 ± 0.635
0.0AsnCys: 0.0 ± 0.0
0.672AsnAsp: 0.672 ± 0.455
2.015AsnGlu: 2.015 ± 0.507
2.351AsnPhe: 2.351 ± 0.931
7.052AsnGly: 7.052 ± 0.74
0.336AsnHis: 0.336 ± 0.337
1.343AsnIle: 1.343 ± 1.005
3.358AsnLys: 3.358 ± 0.754
5.037AsnLeu: 5.037 ± 0.567
0.336AsnMet: 0.336 ± 0.237
1.007AsnAsn: 1.007 ± 0.407
3.022AsnPro: 3.022 ± 1.424
2.351AsnGln: 2.351 ± 0.915
3.694AsnArg: 3.694 ± 0.875
2.686AsnSer: 2.686 ± 0.506
2.686AsnThr: 2.686 ± 0.704
1.343AsnVal: 1.343 ± 0.66
1.007AsnTrp: 1.007 ± 0.455
2.351AsnTyr: 2.351 ± 0.957
0.0AsnXaa: 0.0 ± 0.0
Pro
3.694ProAla: 3.694 ± 1.025
0.336ProCys: 0.336 ± 0.327
1.679ProAsp: 1.679 ± 1.236
1.343ProGlu: 1.343 ± 0.614
0.672ProPhe: 0.672 ± 0.374
3.694ProGly: 3.694 ± 0.404
1.679ProHis: 1.679 ± 0.618
3.022ProIle: 3.022 ± 0.454
2.351ProLys: 2.351 ± 0.89
3.358ProLeu: 3.358 ± 1.246
0.336ProMet: 0.336 ± 0.246
1.679ProAsn: 1.679 ± 0.566
8.731ProPro: 8.731 ± 2.724
6.044ProGln: 6.044 ± 0.748
4.365ProArg: 4.365 ± 1.981
8.731ProSer: 8.731 ± 1.325
3.358ProThr: 3.358 ± 0.831
1.343ProVal: 1.343 ± 0.67
0.0ProTrp: 0.0 ± 0.0
1.007ProTyr: 1.007 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
4.03GlnAla: 4.03 ± 1.158
0.336GlnCys: 0.336 ± 0.246
1.007GlnAsp: 1.007 ± 0.187
1.343GlnGlu: 1.343 ± 0.559
2.686GlnPhe: 2.686 ± 0.595
2.015GlnGly: 2.015 ± 0.774
1.007GlnHis: 1.007 ± 0.597
1.343GlnIle: 1.343 ± 0.488
3.358GlnLys: 3.358 ± 0.887
2.351GlnLeu: 2.351 ± 0.565
1.007GlnMet: 1.007 ± 0.187
5.373GlnAsn: 5.373 ± 0.883
2.351GlnPro: 2.351 ± 1.067
0.672GlnGln: 0.672 ± 0.673
5.373GlnArg: 5.373 ± 1.751
3.022GlnSer: 3.022 ± 0.569
2.015GlnThr: 2.015 ± 0.893
4.03GlnVal: 4.03 ± 1.085
0.336GlnTrp: 0.336 ± 0.246
1.343GlnTyr: 1.343 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
5.709ArgAla: 5.709 ± 1.308
1.679ArgCys: 1.679 ± 0.613
3.358ArgAsp: 3.358 ± 0.474
1.679ArgGlu: 1.679 ± 0.462
1.679ArgPhe: 1.679 ± 0.641
7.052ArgGly: 7.052 ± 1.503
1.679ArgHis: 1.679 ± 0.736
2.686ArgIle: 2.686 ± 1.027
1.007ArgLys: 1.007 ± 0.485
6.716ArgLeu: 6.716 ± 2.262
0.672ArgMet: 0.672 ± 0.472
5.037ArgAsn: 5.037 ± 1.616
3.022ArgPro: 3.022 ± 1.729
2.686ArgGln: 2.686 ± 0.407
11.753ArgArg: 11.753 ± 6.366
4.03ArgSer: 4.03 ± 1.857
3.358ArgThr: 3.358 ± 0.888
3.694ArgVal: 3.694 ± 0.87
1.007ArgTrp: 1.007 ± 0.487
1.007ArgTyr: 1.007 ± 0.738
0.0ArgXaa: 0.0 ± 0.0
Ser
4.365SerAla: 4.365 ± 1.518
0.336SerCys: 0.336 ± 0.246
2.015SerAsp: 2.015 ± 0.696
6.716SerGlu: 6.716 ± 0.665
4.701SerPhe: 4.701 ± 0.678
7.052SerGly: 7.052 ± 1.032
2.351SerHis: 2.351 ± 0.907
5.373SerIle: 5.373 ± 0.831
5.037SerLys: 5.037 ± 0.929
8.731SerLeu: 8.731 ± 1.24
1.343SerMet: 1.343 ± 0.509
3.694SerAsn: 3.694 ± 1.042
6.044SerPro: 6.044 ± 2.001
2.351SerGln: 2.351 ± 0.959
6.044SerArg: 6.044 ± 2.026
15.782SerSer: 15.782 ± 3.431
7.723SerThr: 7.723 ± 2.03
6.044SerVal: 6.044 ± 0.841
2.686SerTrp: 2.686 ± 1.178
4.365SerTyr: 4.365 ± 0.606
0.0SerXaa: 0.0 ± 0.0
Thr
6.38ThrAla: 6.38 ± 1.536
0.672ThrCys: 0.672 ± 0.294
2.686ThrAsp: 2.686 ± 1.176
2.015ThrGlu: 2.015 ± 0.896
1.679ThrPhe: 1.679 ± 1.052
3.358ThrGly: 3.358 ± 0.93
1.007ThrHis: 1.007 ± 0.455
3.022ThrIle: 3.022 ± 1.424
1.679ThrLys: 1.679 ± 0.566
4.701ThrLeu: 4.701 ± 1.614
1.343ThrMet: 1.343 ± 0.355
2.015ThrAsn: 2.015 ± 0.61
4.365ThrPro: 4.365 ± 1.129
1.007ThrGln: 1.007 ± 0.536
4.03ThrArg: 4.03 ± 1.103
6.716ThrSer: 6.716 ± 0.926
2.351ThrThr: 2.351 ± 0.897
3.694ThrVal: 3.694 ± 1.132
1.007ThrTrp: 1.007 ± 0.455
1.343ThrTyr: 1.343 ± 0.41
0.0ThrXaa: 0.0 ± 0.0
Val
5.037ValAla: 5.037 ± 1.202
0.672ValCys: 0.672 ± 0.294
2.686ValAsp: 2.686 ± 0.937
2.351ValGlu: 2.351 ± 0.936
1.343ValPhe: 1.343 ± 0.706
3.358ValGly: 3.358 ± 0.898
2.015ValHis: 2.015 ± 0.431
1.679ValIle: 1.679 ± 0.813
2.015ValLys: 2.015 ± 0.47
5.037ValLeu: 5.037 ± 1.262
1.007ValMet: 1.007 ± 0.552
1.679ValAsn: 1.679 ± 0.621
3.694ValPro: 3.694 ± 1.473
5.373ValGln: 5.373 ± 0.747
2.351ValArg: 2.351 ± 1.119
5.037ValSer: 5.037 ± 0.833
0.672ValThr: 0.672 ± 0.294
4.03ValVal: 4.03 ± 1.535
0.672ValTrp: 0.672 ± 0.455
2.015ValTyr: 2.015 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
1.679TrpAla: 1.679 ± 0.462
0.336TrpCys: 0.336 ± 0.337
0.672TrpAsp: 0.672 ± 0.275
2.015TrpGlu: 2.015 ± 0.475
0.336TrpPhe: 0.336 ± 0.327
1.343TrpGly: 1.343 ± 0.865
0.672TrpHis: 0.672 ± 0.588
1.343TrpIle: 1.343 ± 0.589
0.336TrpLys: 0.336 ± 0.327
2.686TrpLeu: 2.686 ± 0.934
0.0TrpMet: 0.0 ± 0.0
0.672TrpAsn: 0.672 ± 0.481
0.672TrpPro: 0.672 ± 0.374
0.0TrpGln: 0.0 ± 0.0
1.679TrpArg: 1.679 ± 0.613
1.679TrpSer: 1.679 ± 1.163
1.679TrpThr: 1.679 ± 0.516
0.672TrpVal: 0.672 ± 0.294
0.0TrpTrp: 0.0 ± 0.0
1.343TrpTyr: 1.343 ± 0.589
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.679TyrAla: 1.679 ± 0.639
0.672TyrCys: 0.672 ± 0.275
1.679TyrAsp: 1.679 ± 0.369
3.358TyrGlu: 3.358 ± 0.51
1.679TyrPhe: 1.679 ± 0.665
1.679TyrGly: 1.679 ± 0.324
2.015TyrHis: 2.015 ± 0.463
1.007TyrIle: 1.007 ± 0.455
4.03TyrLys: 4.03 ± 0.991
3.358TyrLeu: 3.358 ± 0.53
1.343TyrMet: 1.343 ± 0.589
2.686TyrAsn: 2.686 ± 0.749
0.336TyrPro: 0.336 ± 0.327
1.679TyrGln: 1.679 ± 0.367
0.672TyrArg: 0.672 ± 0.492
1.343TyrSer: 1.343 ± 0.77
2.686TyrThr: 2.686 ± 0.663
1.007TyrVal: 1.007 ± 0.377
0.672TyrTrp: 0.672 ± 0.294
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski