Amino acid dipepetide frequency for Beet soil-borne mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.22AlaAla: 6.22 ± 1.893
1.675AlaCys: 1.675 ± 0.502
5.981AlaAsp: 5.981 ± 0.93
2.153AlaGlu: 2.153 ± 0.656
4.545AlaPhe: 4.545 ± 1.29
4.067AlaGly: 4.067 ± 1.589
1.675AlaHis: 1.675 ± 0.776
4.545AlaIle: 4.545 ± 1.366
1.914AlaLys: 1.914 ± 0.703
5.981AlaLeu: 5.981 ± 1.588
1.435AlaMet: 1.435 ± 0.876
3.828AlaAsn: 3.828 ± 0.796
3.589AlaPro: 3.589 ± 0.553
2.871AlaGln: 2.871 ± 0.788
4.306AlaArg: 4.306 ± 0.83
5.502AlaSer: 5.502 ± 1.205
4.545AlaThr: 4.545 ± 1.004
4.306AlaVal: 4.306 ± 1.252
0.957AlaTrp: 0.957 ± 0.584
1.675AlaTyr: 1.675 ± 0.74
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.45
0.478CysCys: 0.478 ± 0.388
0.957CysAsp: 0.957 ± 0.638
0.957CysGlu: 0.957 ± 0.609
0.957CysPhe: 0.957 ± 0.498
2.153CysGly: 2.153 ± 1.015
0.239CysHis: 0.239 ± 0.146
0.478CysIle: 0.478 ± 0.331
0.239CysLys: 0.239 ± 0.265
2.153CysLeu: 2.153 ± 0.833
0.718CysMet: 0.718 ± 0.36
1.675CysAsn: 1.675 ± 0.976
0.0CysPro: 0.0 ± 0.0
0.957CysGln: 0.957 ± 0.483
1.675CysArg: 1.675 ± 1.333
1.675CysSer: 1.675 ± 0.981
0.239CysThr: 0.239 ± 0.307
1.435CysVal: 1.435 ± 0.671
0.478CysTrp: 0.478 ± 0.589
1.435CysTyr: 1.435 ± 0.461
0.0CysXaa: 0.0 ± 0.0
Asp
4.306AspAla: 4.306 ± 0.66
0.718AspCys: 0.718 ± 0.45
5.742AspAsp: 5.742 ± 2.715
4.785AspGlu: 4.785 ± 1.141
3.349AspPhe: 3.349 ± 0.714
5.502AspGly: 5.502 ± 1.961
0.478AspHis: 0.478 ± 0.527
3.828AspIle: 3.828 ± 0.984
3.11AspLys: 3.11 ± 0.828
5.502AspLeu: 5.502 ± 0.817
1.435AspMet: 1.435 ± 0.631
3.11AspAsn: 3.11 ± 0.62
2.153AspPro: 2.153 ± 0.629
0.718AspGln: 0.718 ± 0.278
2.632AspArg: 2.632 ± 1.059
4.785AspSer: 4.785 ± 0.907
2.871AspThr: 2.871 ± 1.062
6.938AspVal: 6.938 ± 1.225
2.392AspTrp: 2.392 ± 0.981
2.153AspTyr: 2.153 ± 0.824
0.0AspXaa: 0.0 ± 0.0
Glu
3.11GluAla: 3.11 ± 1.26
1.675GluCys: 1.675 ± 0.852
3.349GluAsp: 3.349 ± 0.638
2.392GluGlu: 2.392 ± 0.716
3.349GluPhe: 3.349 ± 1.008
2.392GluGly: 2.392 ± 0.489
1.435GluHis: 1.435 ± 0.497
0.718GluIle: 0.718 ± 0.761
3.828GluLys: 3.828 ± 1.103
5.263GluLeu: 5.263 ± 1.412
0.478GluMet: 0.478 ± 0.305
0.957GluAsn: 0.957 ± 0.713
0.957GluPro: 0.957 ± 0.483
1.435GluGln: 1.435 ± 0.547
3.349GluArg: 3.349 ± 0.684
3.11GluSer: 3.11 ± 1.056
2.392GluThr: 2.392 ± 0.346
5.742GluVal: 5.742 ± 0.696
1.435GluTrp: 1.435 ± 1.192
1.196GluTyr: 1.196 ± 0.73
0.0GluXaa: 0.0 ± 0.0
Phe
3.11PheAla: 3.11 ± 0.58
1.914PheCys: 1.914 ± 0.99
2.392PheAsp: 2.392 ± 0.448
2.871PheGlu: 2.871 ± 0.738
1.435PhePhe: 1.435 ± 0.539
3.349PheGly: 3.349 ± 0.603
1.675PheHis: 1.675 ± 0.685
1.196PheIle: 1.196 ± 0.702
2.871PheLys: 2.871 ± 1.001
2.871PheLeu: 2.871 ± 1.434
1.675PheMet: 1.675 ± 0.819
1.914PheAsn: 1.914 ± 1.167
2.153PhePro: 2.153 ± 0.537
0.718PheGln: 0.718 ± 0.366
2.871PheArg: 2.871 ± 1.196
5.742PheSer: 5.742 ± 1.772
2.153PheThr: 2.153 ± 0.528
5.742PheVal: 5.742 ± 1.94
0.478PheTrp: 0.478 ± 0.507
0.957PheTyr: 0.957 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
5.263GlyAla: 5.263 ± 1.32
1.914GlyCys: 1.914 ± 1.163
4.067GlyAsp: 4.067 ± 1.108
2.871GlyGlu: 2.871 ± 0.785
2.392GlyPhe: 2.392 ± 0.852
6.459GlyGly: 6.459 ± 2.698
1.196GlyHis: 1.196 ± 0.587
4.067GlyIle: 4.067 ± 0.934
4.067GlyLys: 4.067 ± 1.16
3.349GlyLeu: 3.349 ± 1.233
2.153GlyMet: 2.153 ± 0.818
2.153GlyAsn: 2.153 ± 0.737
2.392GlyPro: 2.392 ± 1.053
1.675GlyGln: 1.675 ± 0.617
2.153GlyArg: 2.153 ± 0.836
5.502GlySer: 5.502 ± 1.905
3.828GlyThr: 3.828 ± 0.818
8.612GlyVal: 8.612 ± 2.386
0.957GlyTrp: 0.957 ± 0.324
3.11GlyTyr: 3.11 ± 0.67
0.0GlyXaa: 0.0 ± 0.0
His
1.196HisAla: 1.196 ± 0.311
0.239HisCys: 0.239 ± 0.307
0.957HisAsp: 0.957 ± 0.38
0.718HisGlu: 0.718 ± 0.376
1.196HisPhe: 1.196 ± 0.601
0.718HisGly: 0.718 ± 0.366
0.239HisHis: 0.239 ± 0.146
1.196HisIle: 1.196 ± 0.415
1.435HisLys: 1.435 ± 0.521
1.914HisLeu: 1.914 ± 0.96
1.435HisMet: 1.435 ± 0.609
1.435HisAsn: 1.435 ± 1.045
0.957HisPro: 0.957 ± 0.467
0.239HisGln: 0.239 ± 0.146
0.957HisArg: 0.957 ± 0.599
1.435HisSer: 1.435 ± 0.635
0.957HisThr: 0.957 ± 0.289
1.675HisVal: 1.675 ± 0.951
0.239HisTrp: 0.239 ± 0.146
0.957HisTyr: 0.957 ± 0.584
0.0HisXaa: 0.0 ± 0.0
Ile
1.435IleAla: 1.435 ± 0.732
1.914IleCys: 1.914 ± 1.668
1.675IleAsp: 1.675 ± 1.252
3.828IleGlu: 3.828 ± 1.079
1.675IlePhe: 1.675 ± 0.852
3.349IleGly: 3.349 ± 0.943
0.478IleHis: 0.478 ± 0.47
1.914IleIle: 1.914 ± 0.705
4.067IleLys: 4.067 ± 1.07
3.349IleLeu: 3.349 ± 0.959
1.675IleMet: 1.675 ± 0.632
3.589IleAsn: 3.589 ± 0.744
1.914IlePro: 1.914 ± 0.642
0.718IleGln: 0.718 ± 0.376
2.632IleArg: 2.632 ± 0.69
4.545IleSer: 4.545 ± 1.633
2.871IleThr: 2.871 ± 0.755
4.067IleVal: 4.067 ± 1.421
0.239IleTrp: 0.239 ± 0.146
1.196IleTyr: 1.196 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
4.785LysAla: 4.785 ± 0.959
0.957LysCys: 0.957 ± 1.339
3.828LysAsp: 3.828 ± 1.145
2.871LysGlu: 2.871 ± 0.766
2.153LysPhe: 2.153 ± 0.597
2.871LysGly: 2.871 ± 1.228
1.435LysHis: 1.435 ± 0.579
2.392LysIle: 2.392 ± 1.056
1.914LysLys: 1.914 ± 1.168
5.742LysLeu: 5.742 ± 1.476
0.718LysMet: 0.718 ± 0.366
2.871LysAsn: 2.871 ± 1.121
0.957LysPro: 0.957 ± 0.467
1.435LysGln: 1.435 ± 0.732
2.632LysArg: 2.632 ± 0.653
4.067LysSer: 4.067 ± 0.738
2.871LysThr: 2.871 ± 0.713
4.306LysVal: 4.306 ± 0.818
0.478LysTrp: 0.478 ± 0.305
2.153LysTyr: 2.153 ± 0.692
0.0LysXaa: 0.0 ± 0.0
Leu
6.22LeuAla: 6.22 ± 1.447
1.196LeuCys: 1.196 ± 0.786
6.699LeuAsp: 6.699 ± 1.542
4.067LeuGlu: 4.067 ± 1.157
4.785LeuPhe: 4.785 ± 1.566
4.306LeuGly: 4.306 ± 0.697
1.675LeuHis: 1.675 ± 0.832
2.871LeuIle: 2.871 ± 0.953
5.742LeuLys: 5.742 ± 2.558
9.091LeuLeu: 9.091 ± 2.248
3.349LeuMet: 3.349 ± 1.222
6.938LeuAsn: 6.938 ± 1.342
5.742LeuPro: 5.742 ± 1.166
1.435LeuGln: 1.435 ± 0.409
5.263LeuArg: 5.263 ± 1.519
6.22LeuSer: 6.22 ± 1.546
3.828LeuThr: 3.828 ± 0.683
7.656LeuVal: 7.656 ± 1.461
2.153LeuTrp: 2.153 ± 1.287
2.153LeuTyr: 2.153 ± 1.156
0.0LeuXaa: 0.0 ± 0.0
Met
1.675MetAla: 1.675 ± 0.544
0.718MetCys: 0.718 ± 0.477
2.153MetAsp: 2.153 ± 0.722
0.957MetGlu: 0.957 ± 0.511
0.718MetPhe: 0.718 ± 0.413
1.196MetGly: 1.196 ± 0.546
0.957MetHis: 0.957 ± 0.324
0.718MetIle: 0.718 ± 0.438
0.718MetLys: 0.718 ± 0.438
1.675MetLeu: 1.675 ± 0.483
1.196MetMet: 1.196 ± 0.702
0.718MetAsn: 0.718 ± 0.36
1.435MetPro: 1.435 ± 0.4
0.957MetGln: 0.957 ± 0.399
1.196MetArg: 1.196 ± 0.486
3.11MetSer: 3.11 ± 0.981
1.914MetThr: 1.914 ± 1.44
4.067MetVal: 4.067 ± 0.928
0.0MetTrp: 0.0 ± 0.0
0.239MetTyr: 0.239 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.11AsnAla: 3.11 ± 1.035
0.957AsnCys: 0.957 ± 0.418
2.632AsnAsp: 2.632 ± 0.876
1.196AsnGlu: 1.196 ± 0.429
3.349AsnPhe: 3.349 ± 0.98
3.349AsnGly: 3.349 ± 0.735
1.914AsnHis: 1.914 ± 0.599
2.871AsnIle: 2.871 ± 0.622
2.392AsnLys: 2.392 ± 1.147
4.545AsnLeu: 4.545 ± 0.933
0.478AsnMet: 0.478 ± 0.338
3.11AsnAsn: 3.11 ± 1.591
1.196AsnPro: 1.196 ± 0.73
2.153AsnGln: 2.153 ± 1.266
1.914AsnArg: 1.914 ± 1.156
4.545AsnSer: 4.545 ± 0.786
2.153AsnThr: 2.153 ± 0.641
4.785AsnVal: 4.785 ± 1.508
1.196AsnTrp: 1.196 ± 0.664
1.675AsnTyr: 1.675 ± 0.606
0.0AsnXaa: 0.0 ± 0.0
Pro
3.349ProAla: 3.349 ± 1.516
0.0ProCys: 0.0 ± 0.0
1.914ProAsp: 1.914 ± 0.419
2.632ProGlu: 2.632 ± 0.708
1.435ProPhe: 1.435 ± 0.451
2.632ProGly: 2.632 ± 0.503
0.239ProHis: 0.239 ± 0.146
3.349ProIle: 3.349 ± 1.267
0.957ProLys: 0.957 ± 0.426
3.11ProLeu: 3.11 ± 1.035
0.957ProMet: 0.957 ± 0.363
2.153ProAsn: 2.153 ± 0.627
1.914ProPro: 1.914 ± 1.463
0.239ProGln: 0.239 ± 0.146
1.435ProArg: 1.435 ± 0.472
2.632ProSer: 2.632 ± 0.736
2.392ProThr: 2.392 ± 1.225
4.545ProVal: 4.545 ± 0.657
0.957ProTrp: 0.957 ± 0.913
1.196ProTyr: 1.196 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.589GlnAla: 3.589 ± 1.678
0.239GlnCys: 0.239 ± 0.146
0.957GlnAsp: 0.957 ± 0.555
1.675GlnGlu: 1.675 ± 0.591
1.914GlnPhe: 1.914 ± 0.594
2.632GlnGly: 2.632 ± 0.712
0.239GlnHis: 0.239 ± 0.307
0.957GlnIle: 0.957 ± 0.584
1.435GlnLys: 1.435 ± 0.687
3.11GlnLeu: 3.11 ± 0.472
0.957GlnMet: 0.957 ± 0.584
1.196GlnAsn: 1.196 ± 0.311
0.239GlnPro: 0.239 ± 0.265
1.196GlnGln: 1.196 ± 0.861
0.718GlnArg: 0.718 ± 0.438
0.957GlnSer: 0.957 ± 0.467
1.435GlnThr: 1.435 ± 0.638
1.675GlnVal: 1.675 ± 0.752
0.478GlnTrp: 0.478 ± 0.292
0.239GlnTyr: 0.239 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
3.349ArgAla: 3.349 ± 0.728
0.478ArgCys: 0.478 ± 0.388
2.871ArgAsp: 2.871 ± 0.738
3.828ArgGlu: 3.828 ± 0.943
1.914ArgPhe: 1.914 ± 0.613
4.306ArgGly: 4.306 ± 1.267
0.718ArgHis: 0.718 ± 0.278
1.675ArgIle: 1.675 ± 0.533
1.914ArgLys: 1.914 ± 0.72
5.502ArgLeu: 5.502 ± 1.29
1.914ArgMet: 1.914 ± 0.453
3.589ArgAsn: 3.589 ± 0.638
1.196ArgPro: 1.196 ± 0.52
1.435ArgGln: 1.435 ± 0.556
2.871ArgArg: 2.871 ± 1.391
3.11ArgSer: 3.11 ± 0.742
3.349ArgThr: 3.349 ± 1.706
5.502ArgVal: 5.502 ± 1.559
1.914ArgTrp: 1.914 ± 0.491
2.871ArgTyr: 2.871 ± 0.704
0.0ArgXaa: 0.0 ± 0.0
Ser
6.459SerAla: 6.459 ± 0.851
1.675SerCys: 1.675 ± 1.334
6.459SerAsp: 6.459 ± 1.647
4.545SerGlu: 4.545 ± 0.907
3.11SerPhe: 3.11 ± 1.617
5.263SerGly: 5.263 ± 0.988
0.239SerHis: 0.239 ± 0.265
4.306SerIle: 4.306 ± 0.997
4.067SerLys: 4.067 ± 1.221
6.699SerLeu: 6.699 ± 1.679
1.196SerMet: 1.196 ± 0.417
3.828SerAsn: 3.828 ± 0.859
3.349SerPro: 3.349 ± 1.022
2.153SerGln: 2.153 ± 0.833
6.459SerArg: 6.459 ± 1.522
6.699SerSer: 6.699 ± 1.14
3.828SerThr: 3.828 ± 1.06
6.938SerVal: 6.938 ± 2.012
0.478SerTrp: 0.478 ± 0.331
1.435SerTyr: 1.435 ± 0.477
0.0SerXaa: 0.0 ± 0.0
Thr
1.914ThrAla: 1.914 ± 0.607
0.478ThrCys: 0.478 ± 0.568
3.828ThrAsp: 3.828 ± 1.19
0.478ThrGlu: 0.478 ± 0.229
2.632ThrPhe: 2.632 ± 0.905
4.306ThrGly: 4.306 ± 0.661
1.914ThrHis: 1.914 ± 0.758
4.067ThrIle: 4.067 ± 0.773
2.632ThrLys: 2.632 ± 0.797
5.742ThrLeu: 5.742 ± 1.505
1.675ThrMet: 1.675 ± 0.708
1.675ThrAsn: 1.675 ± 1.336
2.153ThrPro: 2.153 ± 1.11
1.196ThrGln: 1.196 ± 0.513
3.828ThrArg: 3.828 ± 1.059
4.067ThrSer: 4.067 ± 0.831
3.11ThrThr: 3.11 ± 1.207
3.349ThrVal: 3.349 ± 0.917
1.435ThrTrp: 1.435 ± 0.477
1.675ThrTyr: 1.675 ± 0.716
0.0ThrXaa: 0.0 ± 0.0
Val
8.852ValAla: 8.852 ± 1.401
1.435ValCys: 1.435 ± 0.725
5.742ValAsp: 5.742 ± 1.487
4.067ValGlu: 4.067 ± 0.937
4.785ValPhe: 4.785 ± 1.657
7.895ValGly: 7.895 ± 2.579
1.675ValHis: 1.675 ± 0.326
3.349ValIle: 3.349 ± 0.763
5.263ValLys: 5.263 ± 1.975
10.526ValLeu: 10.526 ± 2.395
1.914ValMet: 1.914 ± 1.463
2.871ValAsn: 2.871 ± 1.008
4.306ValPro: 4.306 ± 1.057
1.675ValGln: 1.675 ± 0.499
4.067ValArg: 4.067 ± 0.734
7.895ValSer: 7.895 ± 1.404
5.024ValThr: 5.024 ± 1.422
8.373ValVal: 8.373 ± 2.461
0.478ValTrp: 0.478 ± 0.331
3.349ValTyr: 3.349 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.422
0.239TrpCys: 0.239 ± 0.461
1.435TrpAsp: 1.435 ± 0.7
0.718TrpGlu: 0.718 ± 0.438
0.718TrpPhe: 0.718 ± 0.679
0.0TrpGly: 0.0 ± 0.0
0.239TrpHis: 0.239 ± 0.307
0.718TrpIle: 0.718 ± 0.36
1.435TrpLys: 1.435 ± 0.409
2.871TrpLeu: 2.871 ± 0.77
0.0TrpMet: 0.0 ± 0.0
0.478TrpAsn: 0.478 ± 0.516
0.0TrpPro: 0.0 ± 0.0
0.718TrpGln: 0.718 ± 0.438
1.435TrpArg: 1.435 ± 0.491
2.153TrpSer: 2.153 ± 0.665
1.435TrpThr: 1.435 ± 0.938
1.435TrpVal: 1.435 ± 0.991
0.0TrpTrp: 0.0 ± 0.0
0.478TrpTyr: 0.478 ± 0.331
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.392TyrAla: 2.392 ± 0.956
0.718TyrCys: 0.718 ± 0.366
2.871TyrAsp: 2.871 ± 0.489
0.957TyrGlu: 0.957 ± 0.395
1.914TyrPhe: 1.914 ± 0.917
1.435TyrGly: 1.435 ± 1.056
1.435TyrHis: 1.435 ± 0.471
1.914TyrIle: 1.914 ± 0.56
1.435TyrLys: 1.435 ± 0.732
2.392TyrLeu: 2.392 ± 0.785
0.718TyrMet: 0.718 ± 0.422
1.435TyrAsn: 1.435 ± 0.464
1.435TyrPro: 1.435 ± 1.203
1.914TyrGln: 1.914 ± 0.917
1.914TyrArg: 1.914 ± 0.699
1.435TyrSer: 1.435 ± 0.518
0.957TyrThr: 0.957 ± 0.596
2.392TyrVal: 2.392 ± 0.884
0.478TyrTrp: 0.478 ± 0.357
0.957TyrTyr: 0.957 ± 0.467
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (4181 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski