Amino acid dipepetide frequency for Sweet potato chlorotic stunt virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.993AlaAla: 2.993 ± 1.056
0.748AlaCys: 0.748 ± 0.282
2.432AlaAsp: 2.432 ± 0.84
1.871AlaGlu: 1.871 ± 0.524
2.058AlaPhe: 2.058 ± 0.537
2.245AlaGly: 2.245 ± 0.49
0.935AlaHis: 0.935 ± 0.305
2.806AlaIle: 2.806 ± 0.929
4.116AlaLys: 4.116 ± 0.93
4.116AlaLeu: 4.116 ± 0.433
1.684AlaMet: 1.684 ± 0.639
3.742AlaAsn: 3.742 ± 0.523
0.935AlaPro: 0.935 ± 0.394
0.561AlaGln: 0.561 ± 0.366
2.619AlaArg: 2.619 ± 0.617
2.619AlaSer: 2.619 ± 0.676
1.684AlaThr: 1.684 ± 0.637
4.116AlaVal: 4.116 ± 0.619
0.374AlaTrp: 0.374 ± 0.301
0.748AlaTyr: 0.748 ± 0.324
0.0AlaXaa: 0.0 ± 0.0
Cys
0.935CysAla: 0.935 ± 0.29
0.374CysCys: 0.374 ± 0.239
1.871CysAsp: 1.871 ± 0.578
1.684CysGlu: 1.684 ± 0.392
0.561CysPhe: 0.561 ± 0.297
0.935CysGly: 0.935 ± 0.428
0.0CysHis: 0.0 ± 0.0
0.374CysIle: 0.374 ± 0.243
2.058CysLys: 2.058 ± 0.551
2.058CysLeu: 2.058 ± 0.386
0.187CysMet: 0.187 ± 0.194
1.871CysAsn: 1.871 ± 0.456
0.561CysPro: 0.561 ± 0.339
0.0CysGln: 0.0 ± 0.0
0.561CysArg: 0.561 ± 0.373
1.31CysSer: 1.31 ± 0.481
1.497CysThr: 1.497 ± 0.348
1.31CysVal: 1.31 ± 0.495
0.374CysTrp: 0.374 ± 0.231
0.935CysTyr: 0.935 ± 0.424
0.0CysXaa: 0.0 ± 0.0
Asp
2.245AspAla: 2.245 ± 0.423
1.31AspCys: 1.31 ± 0.42
4.677AspAsp: 4.677 ± 1.226
4.116AspGlu: 4.116 ± 0.699
6.361AspPhe: 6.361 ± 1.015
2.619AspGly: 2.619 ± 0.469
1.123AspHis: 1.123 ± 0.355
3.929AspIle: 3.929 ± 0.605
5.613AspLys: 5.613 ± 1.584
7.109AspLeu: 7.109 ± 1.236
2.245AspMet: 2.245 ± 0.673
4.116AspAsn: 4.116 ± 0.745
0.374AspPro: 0.374 ± 0.231
1.123AspGln: 1.123 ± 0.399
4.303AspArg: 4.303 ± 0.585
5.613AspSer: 5.613 ± 0.838
1.871AspThr: 1.871 ± 0.563
6.922AspVal: 6.922 ± 0.882
0.374AspTrp: 0.374 ± 0.374
2.245AspTyr: 2.245 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
0.935GluAla: 0.935 ± 0.298
1.31GluCys: 1.31 ± 0.468
4.864GluAsp: 4.864 ± 0.708
3.181GluGlu: 3.181 ± 0.898
2.806GluPhe: 2.806 ± 0.514
1.871GluGly: 1.871 ± 0.509
0.187GluHis: 0.187 ± 0.115
3.181GluIle: 3.181 ± 0.467
5.426GluLys: 5.426 ± 0.901
5.987GluLeu: 5.987 ± 0.841
0.748GluMet: 0.748 ± 0.282
2.619GluAsn: 2.619 ± 0.642
0.561GluPro: 0.561 ± 0.203
2.058GluGln: 2.058 ± 0.538
2.993GluArg: 2.993 ± 0.464
4.677GluSer: 4.677 ± 1.549
1.871GluThr: 1.871 ± 0.458
3.555GluVal: 3.555 ± 0.701
0.374GluTrp: 0.374 ± 0.442
3.555GluTyr: 3.555 ± 0.652
0.0GluXaa: 0.0 ± 0.0
Phe
2.432PheAla: 2.432 ± 0.883
1.684PheCys: 1.684 ± 0.483
4.677PheAsp: 4.677 ± 1.03
4.303PheGlu: 4.303 ± 0.507
3.555PhePhe: 3.555 ± 1.181
3.555PheGly: 3.555 ± 0.472
1.497PheHis: 1.497 ± 0.524
2.432PheIle: 2.432 ± 0.536
3.929PheLys: 3.929 ± 0.906
5.613PheLeu: 5.613 ± 1.553
1.871PheMet: 1.871 ± 0.386
3.742PheAsn: 3.742 ± 1.048
1.497PhePro: 1.497 ± 0.474
1.31PheGln: 1.31 ± 0.422
3.181PheArg: 3.181 ± 0.912
5.987PheSer: 5.987 ± 1.344
2.806PheThr: 2.806 ± 0.653
5.051PheVal: 5.051 ± 0.796
0.374PheTrp: 0.374 ± 0.17
1.684PheTyr: 1.684 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
2.245GlyAla: 2.245 ± 0.487
1.31GlyCys: 1.31 ± 0.238
4.303GlyAsp: 4.303 ± 0.665
2.432GlyGlu: 2.432 ± 0.455
3.368GlyPhe: 3.368 ± 0.51
2.993GlyGly: 2.993 ± 1.066
0.374GlyHis: 0.374 ± 0.231
1.684GlyIle: 1.684 ± 0.515
3.555GlyLys: 3.555 ± 1.035
4.677GlyLeu: 4.677 ± 0.587
1.123GlyMet: 1.123 ± 0.487
1.871GlyAsn: 1.871 ± 0.354
0.187GlyPro: 0.187 ± 0.115
0.935GlyGln: 0.935 ± 0.45
1.871GlyArg: 1.871 ± 0.427
4.116GlySer: 4.116 ± 0.767
1.684GlyThr: 1.684 ± 0.545
4.303GlyVal: 4.303 ± 0.534
1.123GlyTrp: 1.123 ± 0.692
1.497GlyTyr: 1.497 ± 0.589
0.0GlyXaa: 0.0 ± 0.0
His
1.123HisAla: 1.123 ± 0.234
0.561HisCys: 0.561 ± 0.248
1.123HisAsp: 1.123 ± 0.615
0.187HisGlu: 0.187 ± 0.115
1.31HisPhe: 1.31 ± 0.529
0.561HisGly: 0.561 ± 0.346
0.374HisHis: 0.374 ± 0.589
1.497HisIle: 1.497 ± 0.367
1.123HisLys: 1.123 ± 0.693
1.497HisLeu: 1.497 ± 0.456
1.497HisMet: 1.497 ± 0.895
1.31HisAsn: 1.31 ± 0.609
1.31HisPro: 1.31 ± 0.486
0.187HisGln: 0.187 ± 0.209
1.497HisArg: 1.497 ± 0.424
0.748HisSer: 0.748 ± 0.446
0.935HisThr: 0.935 ± 0.501
0.935HisVal: 0.935 ± 0.521
0.187HisTrp: 0.187 ± 0.295
0.748HisTyr: 0.748 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
2.058IleAla: 2.058 ± 0.328
1.684IleCys: 1.684 ± 0.436
4.864IleAsp: 4.864 ± 0.716
1.31IleGlu: 1.31 ± 0.468
3.555IlePhe: 3.555 ± 0.992
2.058IleGly: 2.058 ± 0.849
1.31IleHis: 1.31 ± 0.382
3.555IleIle: 3.555 ± 0.697
3.555IleLys: 3.555 ± 0.717
4.49IleLeu: 4.49 ± 1.323
1.31IleMet: 1.31 ± 0.451
3.368IleAsn: 3.368 ± 0.309
4.116IlePro: 4.116 ± 0.572
0.748IleGln: 0.748 ± 0.328
1.871IleArg: 1.871 ± 1.012
6.548IleSer: 6.548 ± 0.871
2.058IleThr: 2.058 ± 0.328
3.929IleVal: 3.929 ± 0.87
0.0IleTrp: 0.0 ± 0.0
3.368IleTyr: 3.368 ± 0.475
0.0IleXaa: 0.0 ± 0.0
Lys
3.742LysAla: 3.742 ± 0.448
1.497LysCys: 1.497 ± 0.628
4.677LysAsp: 4.677 ± 1.239
2.993LysGlu: 2.993 ± 0.402
5.239LysPhe: 5.239 ± 0.909
3.181LysGly: 3.181 ± 0.754
2.058LysHis: 2.058 ± 0.429
5.8LysIle: 5.8 ± 1.491
5.426LysLys: 5.426 ± 0.977
7.297LysLeu: 7.297 ± 0.929
0.935LysMet: 0.935 ± 0.394
3.742LysAsn: 3.742 ± 0.986
3.929LysPro: 3.929 ± 0.904
1.684LysGln: 1.684 ± 0.407
4.116LysArg: 4.116 ± 1.107
5.239LysSer: 5.239 ± 0.536
4.864LysThr: 4.864 ± 0.817
6.361LysVal: 6.361 ± 0.686
0.187LysTrp: 0.187 ± 0.206
3.368LysTyr: 3.368 ± 0.774
0.0LysXaa: 0.0 ± 0.0
Leu
4.116LeuAla: 4.116 ± 0.99
2.245LeuCys: 2.245 ± 0.543
5.613LeuAsp: 5.613 ± 1.263
5.051LeuGlu: 5.051 ± 0.923
6.922LeuPhe: 6.922 ± 1.041
5.426LeuGly: 5.426 ± 1.233
1.684LeuHis: 1.684 ± 0.502
4.864LeuIle: 4.864 ± 0.917
8.98LeuLys: 8.98 ± 1.011
6.922LeuLeu: 6.922 ± 2.12
3.181LeuMet: 3.181 ± 0.585
5.8LeuAsn: 5.8 ± 1.383
3.742LeuPro: 3.742 ± 0.54
1.31LeuGln: 1.31 ± 0.377
5.8LeuArg: 5.8 ± 1.178
8.98LeuSer: 8.98 ± 0.886
6.174LeuThr: 6.174 ± 1.063
7.109LeuVal: 7.109 ± 0.931
1.123LeuTrp: 1.123 ± 0.562
2.993LeuTyr: 2.993 ± 0.41
0.0LeuXaa: 0.0 ± 0.0
Met
1.871MetAla: 1.871 ± 0.472
0.187MetCys: 0.187 ± 0.115
1.871MetAsp: 1.871 ± 0.365
1.123MetGlu: 1.123 ± 0.575
1.684MetPhe: 1.684 ± 0.302
1.31MetGly: 1.31 ± 0.482
0.374MetHis: 0.374 ± 0.186
1.684MetIle: 1.684 ± 0.655
2.432MetLys: 2.432 ± 0.468
1.497MetLeu: 1.497 ± 0.342
0.935MetMet: 0.935 ± 0.528
1.684MetAsn: 1.684 ± 0.625
0.374MetPro: 0.374 ± 0.194
1.497MetGln: 1.497 ± 0.445
2.058MetArg: 2.058 ± 0.349
3.368MetSer: 3.368 ± 0.517
2.432MetThr: 2.432 ± 0.769
0.748MetVal: 0.748 ± 0.699
0.374MetTrp: 0.374 ± 0.154
0.748MetTyr: 0.748 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
2.432AsnAla: 2.432 ± 0.889
0.187AsnCys: 0.187 ± 0.115
2.993AsnAsp: 2.993 ± 1.107
4.864AsnGlu: 4.864 ± 0.581
4.116AsnPhe: 4.116 ± 0.475
1.31AsnGly: 1.31 ± 0.397
1.684AsnHis: 1.684 ± 0.502
3.742AsnIle: 3.742 ± 0.895
3.742AsnLys: 3.742 ± 0.634
6.548AsnLeu: 6.548 ± 1.464
1.123AsnMet: 1.123 ± 0.269
2.993AsnAsn: 2.993 ± 0.597
1.684AsnPro: 1.684 ± 0.464
2.058AsnGln: 2.058 ± 0.746
3.929AsnArg: 3.929 ± 0.999
5.051AsnSer: 5.051 ± 1.649
2.806AsnThr: 2.806 ± 0.468
5.987AsnVal: 5.987 ± 1.063
0.0AsnTrp: 0.0 ± 0.0
2.619AsnTyr: 2.619 ± 0.47
0.0AsnXaa: 0.0 ± 0.0
Pro
1.497ProAla: 1.497 ± 0.328
0.187ProCys: 0.187 ± 0.115
2.432ProAsp: 2.432 ± 0.641
2.245ProGlu: 2.245 ± 0.786
0.935ProPhe: 0.935 ± 0.424
1.123ProGly: 1.123 ± 0.195
0.748ProHis: 0.748 ± 0.439
1.497ProIle: 1.497 ± 0.379
2.432ProLys: 2.432 ± 0.627
3.368ProLeu: 3.368 ± 0.787
0.748ProMet: 0.748 ± 0.529
1.871ProAsn: 1.871 ± 0.544
1.31ProPro: 1.31 ± 0.597
1.123ProGln: 1.123 ± 0.329
1.871ProArg: 1.871 ± 0.786
1.684ProSer: 1.684 ± 0.558
1.497ProThr: 1.497 ± 0.337
1.684ProVal: 1.684 ± 0.82
0.0ProTrp: 0.0 ± 0.0
2.245ProTyr: 2.245 ± 0.643
0.0ProXaa: 0.0 ± 0.0
Gln
1.31GlnAla: 1.31 ± 0.527
0.187GlnCys: 0.187 ± 0.115
1.31GlnAsp: 1.31 ± 0.351
1.31GlnGlu: 1.31 ± 0.456
1.31GlnPhe: 1.31 ± 0.434
1.497GlnGly: 1.497 ± 0.447
0.748GlnHis: 0.748 ± 0.355
2.058GlnIle: 2.058 ± 0.985
1.497GlnLys: 1.497 ± 0.31
3.555GlnLeu: 3.555 ± 0.902
0.935GlnMet: 0.935 ± 0.381
0.187GlnAsn: 0.187 ± 0.269
0.748GlnPro: 0.748 ± 0.212
0.187GlnGln: 0.187 ± 0.269
1.123GlnArg: 1.123 ± 0.376
1.497GlnSer: 1.497 ± 0.66
1.123GlnThr: 1.123 ± 0.404
2.245GlnVal: 2.245 ± 0.482
0.374GlnTrp: 0.374 ± 0.154
0.561GlnTyr: 0.561 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
2.619ArgAla: 2.619 ± 0.975
1.31ArgCys: 1.31 ± 0.488
3.181ArgAsp: 3.181 ± 0.813
2.993ArgGlu: 2.993 ± 0.757
3.368ArgPhe: 3.368 ± 0.9
1.684ArgGly: 1.684 ± 0.39
1.123ArgHis: 1.123 ± 0.349
3.555ArgIle: 3.555 ± 0.791
2.806ArgLys: 2.806 ± 0.662
6.735ArgLeu: 6.735 ± 1.182
2.058ArgMet: 2.058 ± 0.466
2.993ArgAsn: 2.993 ± 0.721
0.935ArgPro: 0.935 ± 0.397
1.684ArgGln: 1.684 ± 0.468
2.806ArgArg: 2.806 ± 0.597
4.116ArgSer: 4.116 ± 0.941
3.181ArgThr: 3.181 ± 0.595
3.929ArgVal: 3.929 ± 1.177
0.561ArgTrp: 0.561 ± 0.203
0.748ArgTyr: 0.748 ± 0.272
0.0ArgXaa: 0.0 ± 0.0
Ser
4.116SerAla: 4.116 ± 0.975
1.497SerCys: 1.497 ± 0.494
4.864SerAsp: 4.864 ± 1.356
4.677SerGlu: 4.677 ± 0.697
4.49SerPhe: 4.49 ± 0.62
4.677SerGly: 4.677 ± 0.587
1.684SerHis: 1.684 ± 0.719
5.239SerIle: 5.239 ± 1.746
8.419SerLys: 8.419 ± 0.746
10.664SerLeu: 10.664 ± 1.512
2.245SerMet: 2.245 ± 0.559
4.677SerAsn: 4.677 ± 0.96
2.432SerPro: 2.432 ± 0.587
2.058SerGln: 2.058 ± 0.484
2.993SerArg: 2.993 ± 0.896
5.8SerSer: 5.8 ± 1.577
4.677SerThr: 4.677 ± 1.064
5.987SerVal: 5.987 ± 0.598
0.561SerTrp: 0.561 ± 0.45
3.368SerTyr: 3.368 ± 0.981
0.0SerXaa: 0.0 ± 0.0
Thr
1.871ThrAla: 1.871 ± 0.67
0.748ThrCys: 0.748 ± 0.201
2.432ThrAsp: 2.432 ± 0.887
1.684ThrGlu: 1.684 ± 0.534
4.116ThrPhe: 4.116 ± 0.352
3.181ThrGly: 3.181 ± 0.917
0.187ThrHis: 0.187 ± 0.206
3.555ThrIle: 3.555 ± 0.74
3.929ThrLys: 3.929 ± 1.169
5.426ThrLeu: 5.426 ± 0.828
1.871ThrMet: 1.871 ± 0.544
2.993ThrAsn: 2.993 ± 0.856
1.497ThrPro: 1.497 ± 0.635
2.619ThrGln: 2.619 ± 0.712
2.058ThrArg: 2.058 ± 0.512
4.303ThrSer: 4.303 ± 0.695
4.116ThrThr: 4.116 ± 0.845
2.993ThrVal: 2.993 ± 0.764
0.561ThrTrp: 0.561 ± 0.346
2.245ThrTyr: 2.245 ± 0.646
0.0ThrXaa: 0.0 ± 0.0
Val
2.245ValAla: 2.245 ± 0.865
1.31ValCys: 1.31 ± 0.315
7.109ValAsp: 7.109 ± 1.061
4.116ValGlu: 4.116 ± 0.814
2.993ValPhe: 2.993 ± 0.676
4.116ValGly: 4.116 ± 0.752
1.497ValHis: 1.497 ± 0.556
2.432ValIle: 2.432 ± 0.478
4.49ValLys: 4.49 ± 0.593
5.987ValLeu: 5.987 ± 0.726
2.432ValMet: 2.432 ± 0.725
8.232ValAsn: 8.232 ± 0.908
2.619ValPro: 2.619 ± 0.838
1.497ValGln: 1.497 ± 0.338
3.555ValArg: 3.555 ± 0.58
8.045ValSer: 8.045 ± 0.928
4.677ValThr: 4.677 ± 0.575
7.858ValVal: 7.858 ± 1.7
0.374ValTrp: 0.374 ± 0.28
4.303ValTyr: 4.303 ± 0.625
0.0ValXaa: 0.0 ± 0.0
Trp
0.935TrpAla: 0.935 ± 0.809
0.0TrpCys: 0.0 ± 0.0
0.187TrpAsp: 0.187 ± 0.269
0.561TrpGlu: 0.561 ± 0.297
0.187TrpPhe: 0.187 ± 0.183
0.0TrpGly: 0.0 ± 0.0
0.374TrpHis: 0.374 ± 0.194
0.561TrpIle: 0.561 ± 0.236
0.187TrpLys: 0.187 ± 0.183
1.31TrpLeu: 1.31 ± 0.603
0.187TrpMet: 0.187 ± 0.309
0.0TrpAsn: 0.0 ± 0.0
0.187TrpPro: 0.187 ± 0.209
0.187TrpGln: 0.187 ± 0.115
0.748TrpArg: 0.748 ± 0.224
0.748TrpSer: 0.748 ± 0.415
0.187TrpThr: 0.187 ± 0.183
0.561TrpVal: 0.561 ± 0.405
0.0TrpTrp: 0.0 ± 0.0
0.561TrpTyr: 0.561 ± 0.323
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.684TyrAla: 1.684 ± 0.675
1.123TyrCys: 1.123 ± 0.39
2.806TyrAsp: 2.806 ± 0.972
2.058TyrGlu: 2.058 ± 0.66
2.245TyrPhe: 2.245 ± 0.471
1.31TyrGly: 1.31 ± 0.42
0.748TyrHis: 0.748 ± 0.394
1.497TyrIle: 1.497 ± 0.31
2.619TyrLys: 2.619 ± 0.455
3.181TyrLeu: 3.181 ± 0.684
0.748TyrMet: 0.748 ± 0.366
2.058TyrAsn: 2.058 ± 0.414
1.497TyrPro: 1.497 ± 0.399
0.935TyrGln: 0.935 ± 0.369
2.432TyrArg: 2.432 ± 0.82
4.677TyrSer: 4.677 ± 0.646
2.245TyrThr: 2.245 ± 0.823
4.303TyrVal: 4.303 ± 0.478
0.374TyrTrp: 0.374 ± 0.359
2.058TyrTyr: 2.058 ± 0.553
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (5346 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski