Amino acid dipepetide frequency for Soil-borne cereal mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.623AlaAla: 5.623 ± 0.637
0.649AlaCys: 0.649 ± 0.285
3.893AlaAsp: 3.893 ± 0.997
4.325AlaGlu: 4.325 ± 0.865
2.379AlaPhe: 2.379 ± 0.527
2.379AlaGly: 2.379 ± 0.349
1.514AlaHis: 1.514 ± 0.312
2.811AlaIle: 2.811 ± 0.394
5.19AlaLys: 5.19 ± 0.801
8.002AlaLeu: 8.002 ± 1.16
1.514AlaMet: 1.514 ± 0.425
3.028AlaAsn: 3.028 ± 0.811
2.595AlaPro: 2.595 ± 0.543
2.379AlaGln: 2.379 ± 0.662
2.379AlaArg: 2.379 ± 0.4
4.758AlaSer: 4.758 ± 0.587
5.623AlaThr: 5.623 ± 0.493
5.839AlaVal: 5.839 ± 0.891
1.081AlaTrp: 1.081 ± 0.354
0.865AlaTyr: 0.865 ± 0.155
0.0AlaXaa: 0.0 ± 0.0
Cys
1.298CysAla: 1.298 ± 0.269
0.865CysCys: 0.865 ± 0.322
2.163CysAsp: 2.163 ± 0.452
1.73CysGlu: 1.73 ± 0.644
1.081CysPhe: 1.081 ± 0.429
1.73CysGly: 1.73 ± 0.847
0.433CysHis: 0.433 ± 0.161
0.649CysIle: 0.649 ± 0.267
1.73CysLys: 1.73 ± 0.805
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.433CysAsn: 0.433 ± 0.161
0.433CysPro: 0.433 ± 0.161
1.081CysGln: 1.081 ± 0.415
0.433CysArg: 0.433 ± 0.306
0.865CysSer: 0.865 ± 0.322
0.433CysThr: 0.433 ± 0.161
1.514CysVal: 1.514 ± 0.562
0.0CysTrp: 0.0 ± 0.0
0.649CysTyr: 0.649 ± 0.686
0.0CysXaa: 0.0 ± 0.0
Asp
4.758AspAla: 4.758 ± 1.055
1.081AspCys: 1.081 ± 0.282
4.758AspAsp: 4.758 ± 0.974
5.623AspGlu: 5.623 ± 0.177
3.676AspPhe: 3.676 ± 0.651
4.325AspGly: 4.325 ± 0.902
2.163AspHis: 2.163 ± 0.804
3.893AspIle: 3.893 ± 0.535
4.109AspLys: 4.109 ± 0.747
7.785AspLeu: 7.785 ± 0.681
3.028AspMet: 3.028 ± 0.361
1.514AspAsn: 1.514 ± 0.569
0.865AspPro: 0.865 ± 0.602
1.081AspGln: 1.081 ± 0.935
2.811AspArg: 2.811 ± 0.623
4.758AspSer: 4.758 ± 0.728
3.244AspThr: 3.244 ± 0.683
6.055AspVal: 6.055 ± 0.937
0.0AspTrp: 0.0 ± 0.0
1.73AspTyr: 1.73 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
4.325GluAla: 4.325 ± 1.069
0.865GluCys: 0.865 ± 0.384
6.055GluAsp: 6.055 ± 0.901
5.623GluGlu: 5.623 ± 0.481
2.811GluPhe: 2.811 ± 0.433
4.109GluGly: 4.109 ± 1.986
0.0GluHis: 0.0 ± 0.0
4.758GluIle: 4.758 ± 0.286
7.137GluLys: 7.137 ± 1.783
7.137GluLeu: 7.137 ± 1.175
0.433GluMet: 0.433 ± 0.564
4.325GluAsn: 4.325 ± 1.206
0.433GluPro: 0.433 ± 0.371
1.73GluGln: 1.73 ± 0.644
4.974GluArg: 4.974 ± 0.574
5.839GluSer: 5.839 ± 0.832
3.244GluThr: 3.244 ± 0.325
5.623GluVal: 5.623 ± 0.647
0.649GluTrp: 0.649 ± 0.42
1.514GluTyr: 1.514 ± 0.666
0.0GluXaa: 0.0 ± 0.0
Phe
2.379PheAla: 2.379 ± 0.403
1.514PheCys: 1.514 ± 0.279
2.811PheAsp: 2.811 ± 0.565
3.893PheGlu: 3.893 ± 0.48
1.73PhePhe: 1.73 ± 0.448
3.244PheGly: 3.244 ± 0.635
1.298PheHis: 1.298 ± 0.269
2.163PheIle: 2.163 ± 1.103
2.163PheLys: 2.163 ± 0.469
3.893PheLeu: 3.893 ± 0.618
1.081PheMet: 1.081 ± 0.433
1.514PheAsn: 1.514 ± 0.324
0.865PhePro: 0.865 ± 0.155
0.865PheGln: 0.865 ± 0.365
1.514PheArg: 1.514 ± 0.257
4.325PheSer: 4.325 ± 0.5
1.73PheThr: 1.73 ± 0.315
3.46PheVal: 3.46 ± 0.281
0.216PheTrp: 0.216 ± 0.244
0.649PheTyr: 0.649 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
2.163GlyAla: 2.163 ± 0.485
0.649GlyCys: 0.649 ± 0.142
3.893GlyAsp: 3.893 ± 0.265
4.758GlyGlu: 4.758 ± 2.307
2.163GlyPhe: 2.163 ± 0.804
4.758GlyGly: 4.758 ± 1.228
0.865GlyHis: 0.865 ± 0.341
1.298GlyIle: 1.298 ± 0.354
3.893GlyLys: 3.893 ± 0.917
0.865GlyLeu: 0.865 ± 0.405
1.73GlyMet: 1.73 ± 0.368
4.109GlyAsn: 4.109 ± 0.554
2.163GlyPro: 2.163 ± 0.308
1.081GlyGln: 1.081 ± 0.551
4.542GlyArg: 4.542 ± 0.983
3.893GlySer: 3.893 ± 0.602
5.19GlyThr: 5.19 ± 1.582
4.109GlyVal: 4.109 ± 2.008
0.649GlyTrp: 0.649 ± 0.562
2.379GlyTyr: 2.379 ± 1.161
0.0GlyXaa: 0.0 ± 0.0
His
2.595HisAla: 2.595 ± 0.566
0.865HisCys: 0.865 ± 0.322
0.433HisAsp: 0.433 ± 0.161
1.081HisGlu: 1.081 ± 0.415
1.514HisPhe: 1.514 ± 0.569
0.865HisGly: 0.865 ± 0.322
0.0HisHis: 0.0 ± 0.0
1.081HisIle: 1.081 ± 0.415
1.514HisLys: 1.514 ± 0.285
1.081HisLeu: 1.081 ± 0.433
0.0HisMet: 0.0 ± 0.0
1.298HisAsn: 1.298 ± 0.534
1.081HisPro: 1.081 ± 0.736
0.0HisGln: 0.0 ± 0.0
0.865HisArg: 0.865 ± 0.322
1.73HisSer: 1.73 ± 0.622
2.163HisThr: 2.163 ± 1.35
1.946HisVal: 1.946 ± 0.44
0.0HisTrp: 0.0 ± 0.0
0.865HisTyr: 0.865 ± 0.539
0.0HisXaa: 0.0 ± 0.0
Ile
1.73IleAla: 1.73 ± 0.263
0.433IleCys: 0.433 ± 0.201
4.542IleAsp: 4.542 ± 1.218
4.325IleGlu: 4.325 ± 0.323
1.73IlePhe: 1.73 ± 0.432
1.946IleGly: 1.946 ± 0.333
1.946IleHis: 1.946 ± 0.919
2.811IleIle: 2.811 ± 0.394
2.163IleLys: 2.163 ± 0.255
3.244IleLeu: 3.244 ± 0.453
0.433IleMet: 0.433 ± 0.161
1.514IleAsn: 1.514 ± 0.404
3.028IlePro: 3.028 ± 1.019
0.865IleGln: 0.865 ± 0.305
2.595IleArg: 2.595 ± 0.837
4.325IleSer: 4.325 ± 0.601
4.325IleThr: 4.325 ± 0.827
4.542IleVal: 4.542 ± 0.78
0.0IleTrp: 0.0 ± 0.0
1.73IleTyr: 1.73 ± 0.284
0.0IleXaa: 0.0 ± 0.0
Lys
5.839LysAla: 5.839 ± 0.344
1.298LysCys: 1.298 ± 0.57
2.811LysAsp: 2.811 ± 0.191
4.325LysGlu: 4.325 ± 0.879
3.244LysPhe: 3.244 ± 0.635
2.811LysGly: 2.811 ± 1.096
1.081LysHis: 1.081 ± 0.354
4.109LysIle: 4.109 ± 0.47
4.325LysLys: 4.325 ± 1.108
8.867LysLeu: 8.867 ± 0.771
1.081LysMet: 1.081 ± 0.882
3.028LysAsn: 3.028 ± 0.398
1.73LysPro: 1.73 ± 0.679
2.163LysGln: 2.163 ± 1.086
6.488LysArg: 6.488 ± 0.663
4.109LysSer: 4.109 ± 0.636
4.325LysThr: 4.325 ± 0.951
4.758LysVal: 4.758 ± 0.622
1.081LysTrp: 1.081 ± 0.551
4.325LysTyr: 4.325 ± 1.358
0.216LysXaa: 0.216 ± 0.15
Leu
4.974LeuAla: 4.974 ± 0.532
1.298LeuCys: 1.298 ± 0.534
3.893LeuAsp: 3.893 ± 0.777
4.542LeuGlu: 4.542 ± 0.832
3.028LeuPhe: 3.028 ± 1.142
5.623LeuGly: 5.623 ± 2.72
1.514LeuHis: 1.514 ± 0.363
4.325LeuIle: 4.325 ± 0.568
6.704LeuLys: 6.704 ± 0.784
9.948LeuLeu: 9.948 ± 1.538
3.676LeuMet: 3.676 ± 0.625
4.542LeuAsn: 4.542 ± 1.077
3.676LeuPro: 3.676 ± 0.764
5.407LeuGln: 5.407 ± 0.981
6.488LeuArg: 6.488 ± 1.238
6.055LeuSer: 6.055 ± 1.321
6.92LeuThr: 6.92 ± 0.706
2.163LeuVal: 2.163 ± 0.255
2.163LeuTrp: 2.163 ± 0.376
2.595LeuTyr: 2.595 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
2.811MetAla: 2.811 ± 0.913
0.649MetCys: 0.649 ± 0.281
1.73MetAsp: 1.73 ± 0.414
2.811MetGlu: 2.811 ± 0.423
1.298MetPhe: 1.298 ± 0.244
1.081MetGly: 1.081 ± 0.351
1.081MetHis: 1.081 ± 0.36
0.865MetIle: 0.865 ± 0.404
2.811MetLys: 2.811 ± 0.439
2.379MetLeu: 2.379 ± 1.67
0.649MetMet: 0.649 ± 0.281
1.73MetAsn: 1.73 ± 0.55
1.081MetPro: 1.081 ± 0.426
0.865MetGln: 0.865 ± 0.732
0.865MetArg: 0.865 ± 0.322
1.298MetSer: 1.298 ± 0.566
1.298MetThr: 1.298 ± 0.313
1.514MetVal: 1.514 ± 0.569
0.433MetTrp: 0.433 ± 0.161
0.649MetTyr: 0.649 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 0.802
1.081AsnCys: 1.081 ± 0.426
3.893AsnAsp: 3.893 ± 1.074
2.811AsnGlu: 2.811 ± 0.43
2.595AsnPhe: 2.595 ± 0.381
3.244AsnGly: 3.244 ± 0.464
0.216AsnHis: 0.216 ± 0.282
0.649AsnIle: 0.649 ± 0.547
3.244AsnLys: 3.244 ± 0.624
3.893AsnLeu: 3.893 ± 0.8
0.865AsnMet: 0.865 ± 0.47
0.649AsnAsn: 0.649 ± 0.281
0.865AsnPro: 0.865 ± 0.322
1.298AsnGln: 1.298 ± 0.313
3.028AsnArg: 3.028 ± 0.663
2.379AsnSer: 2.379 ± 0.564
3.676AsnThr: 3.676 ± 0.931
3.893AsnVal: 3.893 ± 1.203
0.865AsnTrp: 0.865 ± 0.322
1.298AsnTyr: 1.298 ± 0.321
0.0AsnXaa: 0.0 ± 0.0
Pro
0.865ProAla: 0.865 ± 0.63
0.216ProCys: 0.216 ± 0.15
1.298ProAsp: 1.298 ± 0.537
1.946ProGlu: 1.946 ± 0.659
1.298ProPhe: 1.298 ± 0.761
1.298ProGly: 1.298 ± 0.483
0.649ProHis: 0.649 ± 0.358
1.298ProIle: 1.298 ± 0.483
3.244ProLys: 3.244 ± 0.587
1.73ProLeu: 1.73 ± 0.816
2.163ProMet: 2.163 ± 0.535
0.649ProAsn: 0.649 ± 0.281
0.649ProPro: 0.649 ± 0.267
0.865ProGln: 0.865 ± 0.479
1.946ProArg: 1.946 ± 0.556
1.514ProSer: 1.514 ± 0.498
1.946ProThr: 1.946 ± 0.382
3.244ProVal: 3.244 ± 1.071
0.0ProTrp: 0.0 ± 0.0
1.081ProTyr: 1.081 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
1.298GlnAla: 1.298 ± 0.761
0.649GlnCys: 0.649 ± 0.267
2.379GlnAsp: 2.379 ± 1.038
2.163GlnGlu: 2.163 ± 0.732
1.081GlnPhe: 1.081 ± 1.127
3.028GlnGly: 3.028 ± 0.295
0.0GlnHis: 0.0 ± 0.0
2.811GlnIle: 2.811 ± 0.724
4.542GlnLys: 4.542 ± 1.195
3.46GlnLeu: 3.46 ± 0.705
1.298GlnMet: 1.298 ± 0.585
0.865GlnAsn: 0.865 ± 0.155
0.865GlnPro: 0.865 ± 0.322
1.514GlnGln: 1.514 ± 0.52
2.163GlnArg: 2.163 ± 0.744
1.946GlnSer: 1.946 ± 0.899
2.595GlnThr: 2.595 ± 1.732
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.433GlnTyr: 0.433 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
4.758ArgAla: 4.758 ± 0.625
1.081ArgCys: 1.081 ± 0.182
4.542ArgAsp: 4.542 ± 0.745
3.46ArgGlu: 3.46 ± 0.853
1.73ArgPhe: 1.73 ± 0.685
3.244ArgGly: 3.244 ± 0.364
3.028ArgHis: 3.028 ± 0.561
3.46ArgIle: 3.46 ± 0.532
3.46ArgLys: 3.46 ± 0.821
5.623ArgLeu: 5.623 ± 1.322
1.73ArgMet: 1.73 ± 0.324
3.676ArgAsn: 3.676 ± 1.183
0.649ArgPro: 0.649 ± 0.142
1.514ArgGln: 1.514 ± 0.324
4.109ArgArg: 4.109 ± 1.399
4.542ArgSer: 4.542 ± 0.822
3.46ArgThr: 3.46 ± 1.271
4.325ArgVal: 4.325 ± 1.03
1.081ArgTrp: 1.081 ± 0.309
1.081ArgTyr: 1.081 ± 0.897
0.0ArgXaa: 0.0 ± 0.0
Ser
3.893SerAla: 3.893 ± 0.405
0.649SerCys: 0.649 ± 0.476
3.893SerAsp: 3.893 ± 0.617
4.542SerGlu: 4.542 ± 0.548
3.676SerPhe: 3.676 ± 0.664
3.893SerGly: 3.893 ± 0.677
0.649SerHis: 0.649 ± 0.451
4.758SerIle: 4.758 ± 0.453
4.542SerLys: 4.542 ± 0.8
6.055SerLeu: 6.055 ± 0.976
2.379SerMet: 2.379 ± 0.634
2.595SerAsn: 2.595 ± 0.394
1.081SerPro: 1.081 ± 0.357
3.676SerGln: 3.676 ± 0.694
3.676SerArg: 3.676 ± 0.443
4.542SerSer: 4.542 ± 0.736
4.325SerThr: 4.325 ± 0.765
5.407SerVal: 5.407 ± 2.512
0.865SerTrp: 0.865 ± 0.322
3.46SerTyr: 3.46 ± 0.821
0.216SerXaa: 0.216 ± 0.244
Thr
4.758ThrAla: 4.758 ± 0.884
0.649ThrCys: 0.649 ± 0.281
5.19ThrAsp: 5.19 ± 0.692
6.704ThrGlu: 6.704 ± 1.077
3.244ThrPhe: 3.244 ± 0.497
1.514ThrGly: 1.514 ± 0.435
1.73ThrHis: 1.73 ± 0.769
1.73ThrIle: 1.73 ± 0.311
4.974ThrLys: 4.974 ± 0.598
5.839ThrLeu: 5.839 ± 0.7
1.298ThrMet: 1.298 ± 0.244
3.244ThrAsn: 3.244 ± 0.782
1.514ThrPro: 1.514 ± 0.487
2.811ThrGln: 2.811 ± 0.838
3.676ThrArg: 3.676 ± 2.168
4.109ThrSer: 4.109 ± 0.599
6.92ThrThr: 6.92 ± 0.825
4.758ThrVal: 4.758 ± 0.69
0.865ThrTrp: 0.865 ± 0.322
2.595ThrTyr: 2.595 ± 0.691
0.0ThrXaa: 0.0 ± 0.0
Val
4.758ValAla: 4.758 ± 0.822
0.865ValCys: 0.865 ± 0.33
5.19ValAsp: 5.19 ± 0.76
4.974ValGlu: 4.974 ± 0.628
1.946ValPhe: 1.946 ± 0.427
3.244ValGly: 3.244 ± 0.455
1.946ValHis: 1.946 ± 0.727
3.46ValIle: 3.46 ± 0.708
4.325ValLys: 4.325 ± 0.515
4.325ValLeu: 4.325 ± 0.711
3.46ValMet: 3.46 ± 1.015
3.46ValAsn: 3.46 ± 0.685
3.028ValPro: 3.028 ± 1.022
2.163ValGln: 2.163 ± 0.529
5.623ValArg: 5.623 ± 0.719
5.623ValSer: 5.623 ± 1.271
3.028ValThr: 3.028 ± 1.427
6.488ValVal: 6.488 ± 0.836
1.514ValTrp: 1.514 ± 0.569
3.46ValTyr: 3.46 ± 0.785
0.0ValXaa: 0.0 ± 0.0
Trp
1.73TrpAla: 1.73 ± 0.644
0.865TrpCys: 0.865 ± 0.322
0.433TrpAsp: 0.433 ± 0.161
1.081TrpGlu: 1.081 ± 0.549
0.216TrpPhe: 0.216 ± 0.244
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.081TrpIle: 1.081 ± 0.182
0.433TrpLys: 0.433 ± 0.161
0.865TrpLeu: 0.865 ± 0.974
0.649TrpMet: 0.649 ± 0.254
0.649TrpAsn: 0.649 ± 0.267
0.0TrpPro: 0.0 ± 0.0
0.649TrpGln: 0.649 ± 0.429
0.216TrpArg: 0.216 ± 0.298
0.433TrpSer: 0.433 ± 0.201
1.081TrpThr: 1.081 ± 0.814
0.433TrpVal: 0.433 ± 0.161
0.216TrpTrp: 0.216 ± 0.15
0.865TrpTyr: 0.865 ± 0.322
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.595TyrAla: 2.595 ± 0.566
1.298TyrCys: 1.298 ± 0.419
3.46TyrAsp: 3.46 ± 1.101
0.865TyrGlu: 0.865 ± 0.403
0.865TyrPhe: 0.865 ± 0.365
2.811TyrGly: 2.811 ± 0.41
0.865TyrHis: 0.865 ± 0.365
0.216TyrIle: 0.216 ± 0.15
1.73TyrLys: 1.73 ± 0.324
4.542TyrLeu: 4.542 ± 1.145
0.216TyrMet: 0.216 ± 0.15
1.298TyrAsn: 1.298 ± 0.782
1.298TyrPro: 1.298 ± 0.244
1.081TyrGln: 1.081 ± 0.415
1.73TyrArg: 1.73 ± 0.556
1.73TyrSer: 1.73 ± 0.762
2.595TyrThr: 2.595 ± 0.791
2.811TyrVal: 2.811 ± 0.497
0.216TyrTrp: 0.216 ± 0.282
1.73TyrTyr: 1.73 ± 0.644
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.433XaaArg: 0.433 ± 0.201
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4625 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski