Amino acid dipepetide frequency for Malpais Spring vesiculovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.109AlaAla: 3.109 ± 1.868
1.131AlaCys: 1.131 ± 0.675
2.261AlaAsp: 2.261 ± 0.753
1.413AlaGlu: 1.413 ± 0.756
0.848AlaPhe: 0.848 ± 0.506
3.674AlaGly: 3.674 ± 2.121
1.413AlaHis: 1.413 ± 0.616
5.37AlaIle: 5.37 ± 1.3
2.544AlaLys: 2.544 ± 1.307
4.24AlaLeu: 4.24 ± 1.375
0.848AlaMet: 0.848 ± 0.317
2.544AlaAsn: 2.544 ± 0.848
1.979AlaPro: 1.979 ± 1.51
1.696AlaGln: 1.696 ± 0.348
1.696AlaArg: 1.696 ± 0.743
4.24AlaSer: 4.24 ± 1.206
2.261AlaThr: 2.261 ± 0.96
3.392AlaVal: 3.392 ± 0.896
0.848AlaTrp: 0.848 ± 0.896
1.131AlaTyr: 1.131 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.848CysAla: 0.848 ± 0.402
0.565CysCys: 0.565 ± 0.343
0.565CysAsp: 0.565 ± 0.88
1.131CysGlu: 1.131 ± 0.775
0.848CysPhe: 0.848 ± 0.402
1.131CysGly: 1.131 ± 0.688
0.565CysHis: 0.565 ± 0.572
0.848CysIle: 0.848 ± 0.316
3.109CysLys: 3.109 ± 0.716
1.413CysLeu: 1.413 ± 0.595
0.283CysMet: 0.283 ± 0.373
1.131CysAsn: 1.131 ± 0.372
1.131CysPro: 1.131 ± 0.435
1.413CysGln: 1.413 ± 0.805
0.848CysArg: 0.848 ± 0.436
1.413CysSer: 1.413 ± 0.377
0.565CysThr: 0.565 ± 0.337
1.131CysVal: 1.131 ± 0.372
0.848CysTrp: 0.848 ± 0.506
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.261AspAla: 2.261 ± 0.388
0.848AspCys: 0.848 ± 0.402
4.522AspAsp: 4.522 ± 1.542
2.544AspGlu: 2.544 ± 0.508
1.979AspPhe: 1.979 ± 0.216
2.544AspGly: 2.544 ± 0.338
1.131AspHis: 1.131 ± 0.435
3.957AspIle: 3.957 ± 1.104
4.805AspLys: 4.805 ± 0.611
7.631AspLeu: 7.631 ± 1.132
1.979AspMet: 1.979 ± 0.704
2.261AspAsn: 2.261 ± 0.769
3.957AspPro: 3.957 ± 0.785
1.696AspGln: 1.696 ± 1.032
1.413AspArg: 1.413 ± 0.731
3.957AspSer: 3.957 ± 0.575
3.392AspThr: 3.392 ± 0.258
3.109AspVal: 3.109 ± 1.305
1.413AspTrp: 1.413 ± 0.777
4.805AspTyr: 4.805 ± 0.881
0.0AspXaa: 0.0 ± 0.0
Glu
1.979GluAla: 1.979 ± 0.832
0.565GluCys: 0.565 ± 0.344
3.392GluAsp: 3.392 ± 0.895
6.501GluGlu: 6.501 ± 2.213
3.674GluPhe: 3.674 ± 1.013
4.522GluGly: 4.522 ± 2.854
0.283GluHis: 0.283 ± 0.169
5.653GluIle: 5.653 ± 1.675
3.674GluLys: 3.674 ± 1.181
5.088GluLeu: 5.088 ± 0.918
1.696GluMet: 1.696 ± 1.492
1.413GluAsn: 1.413 ± 0.843
2.261GluPro: 2.261 ± 0.228
1.696GluGln: 1.696 ± 0.552
1.413GluArg: 1.413 ± 0.639
3.674GluSer: 3.674 ± 0.263
4.805GluThr: 4.805 ± 0.805
3.674GluVal: 3.674 ± 1.469
1.413GluTrp: 1.413 ± 0.805
2.826GluTyr: 2.826 ± 0.88
0.0GluXaa: 0.0 ± 0.0
Phe
1.413PheAla: 1.413 ± 0.756
0.283PheCys: 0.283 ± 0.169
1.979PheAsp: 1.979 ± 0.641
2.826PheGlu: 2.826 ± 1.322
1.696PhePhe: 1.696 ± 0.348
2.544PheGly: 2.544 ± 0.633
1.413PheHis: 1.413 ± 0.68
1.413PheIle: 1.413 ± 0.843
1.979PheLys: 1.979 ± 0.909
4.805PheLeu: 4.805 ± 1.799
0.848PheMet: 0.848 ± 0.338
1.413PheAsn: 1.413 ± 0.616
3.392PhePro: 3.392 ± 1.084
1.131PheGln: 1.131 ± 0.475
2.544PheArg: 2.544 ± 1.518
4.522PheSer: 4.522 ± 0.914
1.696PheThr: 1.696 ± 0.752
1.696PheVal: 1.696 ± 0.47
0.848PheTrp: 0.848 ± 0.503
1.131PheTyr: 1.131 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
1.696GlyAla: 1.696 ± 0.671
0.565GlyCys: 0.565 ± 0.337
3.392GlyAsp: 3.392 ± 0.558
3.957GlyGlu: 3.957 ± 2.765
2.544GlyPhe: 2.544 ± 0.548
2.261GlyGly: 2.261 ± 0.586
1.131GlyHis: 1.131 ± 0.455
4.24GlyIle: 4.24 ± 1.955
4.522GlyLys: 4.522 ± 1.206
9.61GlyLeu: 9.61 ± 1.151
1.413GlyMet: 1.413 ± 0.382
2.261GlyAsn: 2.261 ± 0.291
1.979GlyPro: 1.979 ± 0.612
3.109GlyGln: 3.109 ± 0.854
2.826GlyArg: 2.826 ± 1.233
4.522GlySer: 4.522 ± 1.082
3.957GlyThr: 3.957 ± 0.432
4.24GlyVal: 4.24 ± 2.042
1.131GlyTrp: 1.131 ± 0.372
1.131GlyTyr: 1.131 ± 0.688
0.0GlyXaa: 0.0 ± 0.0
His
1.131HisAla: 1.131 ± 0.688
0.565HisCys: 0.565 ± 0.344
1.131HisAsp: 1.131 ± 0.687
1.131HisGlu: 1.131 ± 0.372
1.413HisPhe: 1.413 ± 0.616
1.413HisGly: 1.413 ± 0.467
0.565HisHis: 0.565 ± 0.344
1.979HisIle: 1.979 ± 0.216
0.848HisLys: 0.848 ± 0.402
1.696HisLeu: 1.696 ± 0.871
0.0HisMet: 0.0 ± 0.0
1.413HisAsn: 1.413 ± 0.767
2.826HisPro: 2.826 ± 0.546
0.848HisGln: 0.848 ± 0.402
0.848HisArg: 0.848 ± 0.506
2.826HisSer: 2.826 ± 1.202
1.413HisThr: 1.413 ± 0.484
1.413HisVal: 1.413 ± 0.595
0.848HisTrp: 0.848 ± 0.506
0.848HisTyr: 0.848 ± 0.343
0.0HisXaa: 0.0 ± 0.0
Ile
3.674IleAla: 3.674 ± 0.548
2.261IleCys: 2.261 ± 0.874
5.936IleAsp: 5.936 ± 0.738
3.957IleGlu: 3.957 ± 0.705
1.696IlePhe: 1.696 ± 0.564
4.522IleGly: 4.522 ± 0.997
3.109IleHis: 3.109 ± 1.132
3.392IleIle: 3.392 ± 1.137
9.045IleLys: 9.045 ± 1.081
3.957IleLeu: 3.957 ± 0.611
0.848IleMet: 0.848 ± 0.623
3.957IleAsn: 3.957 ± 0.499
4.24IlePro: 4.24 ± 1.476
2.261IleGln: 2.261 ± 0.985
7.349IleArg: 7.349 ± 2.233
4.522IleSer: 4.522 ± 0.456
3.957IleThr: 3.957 ± 0.705
3.674IleVal: 3.674 ± 0.904
0.565IleTrp: 0.565 ± 0.344
2.826IleTyr: 2.826 ± 0.873
0.0IleXaa: 0.0 ± 0.0
Lys
2.544LysAla: 2.544 ± 1.13
1.413LysCys: 1.413 ± 0.377
4.24LysAsp: 4.24 ± 1.993
4.805LysGlu: 4.805 ± 1.528
2.261LysPhe: 2.261 ± 1.217
4.805LysGly: 4.805 ± 0.881
1.413LysHis: 1.413 ± 0.731
7.349LysIle: 7.349 ± 1.373
6.501LysLys: 6.501 ± 2.99
5.37LysLeu: 5.37 ± 1.823
1.979LysMet: 1.979 ± 0.67
2.544LysAsn: 2.544 ± 1.051
2.544LysPro: 2.544 ± 1.102
1.696LysGln: 1.696 ± 0.743
2.826LysArg: 2.826 ± 1.281
6.501LysSer: 6.501 ± 1.866
4.805LysThr: 4.805 ± 1.328
3.674LysVal: 3.674 ± 1.338
2.261LysTrp: 2.261 ± 0.791
3.109LysTyr: 3.109 ± 0.778
0.0LysXaa: 0.0 ± 0.0
Leu
3.957LeuAla: 3.957 ± 0.894
1.131LeuCys: 1.131 ± 0.688
4.522LeuAsp: 4.522 ± 1.116
5.088LeuGlu: 5.088 ± 1.437
2.826LeuPhe: 2.826 ± 1.045
4.805LeuGly: 4.805 ± 1.637
1.696LeuHis: 1.696 ± 0.348
10.458LeuIle: 10.458 ± 1.826
7.349LeuLys: 7.349 ± 0.775
10.175LeuLeu: 10.175 ± 1.678
3.109LeuMet: 3.109 ± 0.854
5.653LeuAsn: 5.653 ± 2.126
5.37LeuPro: 5.37 ± 1.147
1.696LeuGln: 1.696 ± 0.705
5.37LeuArg: 5.37 ± 1.156
7.349LeuSer: 7.349 ± 1.646
3.957LeuThr: 3.957 ± 1.157
4.24LeuVal: 4.24 ± 0.885
1.413LeuTrp: 1.413 ± 0.639
2.544LeuTyr: 2.544 ± 0.682
0.0LeuXaa: 0.0 ± 0.0
Met
2.544MetAla: 2.544 ± 0.682
0.565MetCys: 0.565 ± 0.88
1.413MetAsp: 1.413 ± 0.767
1.696MetGlu: 1.696 ± 0.557
1.131MetPhe: 1.131 ± 0.455
1.696MetGly: 1.696 ± 0.783
0.565MetHis: 0.565 ± 0.337
0.848MetIle: 0.848 ± 0.506
1.979MetLys: 1.979 ± 0.612
1.979MetLeu: 1.979 ± 0.521
1.131MetMet: 1.131 ± 0.455
1.413MetAsn: 1.413 ± 0.41
0.848MetPro: 0.848 ± 0.742
0.848MetGln: 0.848 ± 0.402
0.565MetArg: 0.565 ± 0.337
3.109MetSer: 3.109 ± 0.444
1.413MetThr: 1.413 ± 0.756
0.848MetVal: 0.848 ± 0.343
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.826AsnAla: 2.826 ± 1.511
0.848AsnCys: 0.848 ± 1.096
3.109AsnAsp: 3.109 ± 1.109
1.696AsnGlu: 1.696 ± 0.758
1.696AsnPhe: 1.696 ± 0.758
3.392AsnGly: 3.392 ± 1.614
0.848AsnHis: 0.848 ± 0.506
2.826AsnIle: 2.826 ± 1.308
2.826AsnLys: 2.826 ± 0.925
4.805AsnLeu: 4.805 ± 2.051
0.565AsnMet: 0.565 ± 0.337
1.979AsnAsn: 1.979 ± 0.772
4.24AsnPro: 4.24 ± 0.647
1.413AsnGln: 1.413 ± 0.843
3.109AsnArg: 3.109 ± 0.761
4.522AsnSer: 4.522 ± 0.282
2.544AsnThr: 2.544 ± 0.927
1.413AsnVal: 1.413 ± 0.843
1.131AsnTrp: 1.131 ± 0.372
1.413AsnTyr: 1.413 ± 0.653
0.0AsnXaa: 0.0 ± 0.0
Pro
1.979ProAla: 1.979 ± 0.909
0.848ProCys: 0.848 ± 0.436
5.088ProAsp: 5.088 ± 1.393
1.979ProGlu: 1.979 ± 0.915
1.696ProPhe: 1.696 ± 0.783
1.979ProGly: 1.979 ± 1.38
1.979ProHis: 1.979 ± 1.113
3.392ProIle: 3.392 ± 0.868
3.674ProLys: 3.674 ± 0.476
4.522ProLeu: 4.522 ± 1.606
1.413ProMet: 1.413 ± 0.925
2.544ProAsn: 2.544 ± 1.17
3.109ProPro: 3.109 ± 0.633
0.565ProGln: 0.565 ± 0.344
1.979ProArg: 1.979 ± 1.113
5.653ProSer: 5.653 ± 1.794
4.522ProThr: 4.522 ± 0.639
3.109ProVal: 3.109 ± 0.716
0.565ProTrp: 0.565 ± 0.337
1.413ProTyr: 1.413 ± 1.166
0.0ProXaa: 0.0 ± 0.0
Gln
1.696GlnAla: 1.696 ± 0.804
0.848GlnCys: 0.848 ± 0.402
0.848GlnAsp: 0.848 ± 0.316
1.131GlnGlu: 1.131 ± 0.743
1.696GlnPhe: 1.696 ± 0.969
2.544GlnGly: 2.544 ± 0.724
0.283GlnHis: 0.283 ± 0.169
1.696GlnIle: 1.696 ± 0.743
1.413GlnLys: 1.413 ± 0.295
1.979GlnLeu: 1.979 ± 0.535
1.413GlnMet: 1.413 ± 0.377
1.979GlnAsn: 1.979 ± 0.67
0.848GlnPro: 0.848 ± 0.316
0.565GlnGln: 0.565 ± 0.344
1.413GlnArg: 1.413 ± 0.295
1.979GlnSer: 1.979 ± 0.253
1.696GlnThr: 1.696 ± 1.177
1.696GlnVal: 1.696 ± 0.534
0.848GlnTrp: 0.848 ± 0.316
1.413GlnTyr: 1.413 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
3.392ArgAla: 3.392 ± 0.667
0.848ArgCys: 0.848 ± 0.316
3.674ArgAsp: 3.674 ± 0.629
3.674ArgGlu: 3.674 ± 0.833
3.674ArgPhe: 3.674 ± 1.879
3.674ArgGly: 3.674 ± 0.62
1.131ArgHis: 1.131 ± 0.455
2.261ArgIle: 2.261 ± 0.985
2.261ArgLys: 2.261 ± 0.586
2.826ArgLeu: 2.826 ± 1.002
1.696ArgMet: 1.696 ± 0.348
1.979ArgAsn: 1.979 ± 1.181
1.979ArgPro: 1.979 ± 0.853
0.848ArgGln: 0.848 ± 0.402
1.696ArgArg: 1.696 ± 0.39
2.826ArgSer: 2.826 ± 0.968
2.826ArgThr: 2.826 ± 1.281
3.957ArgVal: 3.957 ± 1.103
1.696ArgTrp: 1.696 ± 0.414
1.413ArgTyr: 1.413 ± 0.68
0.0ArgXaa: 0.0 ± 0.0
Ser
3.674SerAla: 3.674 ± 0.644
1.413SerCys: 1.413 ± 0.552
5.936SerAsp: 5.936 ± 1.404
6.218SerGlu: 6.218 ± 0.831
3.674SerPhe: 3.674 ± 0.62
4.805SerGly: 4.805 ± 0.24
3.392SerHis: 3.392 ± 0.456
7.349SerIle: 7.349 ± 0.768
5.936SerLys: 5.936 ± 1.635
6.783SerLeu: 6.783 ± 1.596
0.283SerMet: 0.283 ± 0.169
3.392SerAsn: 3.392 ± 0.483
5.088SerPro: 5.088 ± 1.245
1.413SerGln: 1.413 ± 1.111
3.957SerArg: 3.957 ± 0.724
7.914SerSer: 7.914 ± 1.927
4.522SerThr: 4.522 ± 0.356
5.088SerVal: 5.088 ± 0.83
0.565SerTrp: 0.565 ± 0.337
2.544SerTyr: 2.544 ± 0.945
0.0SerXaa: 0.0 ± 0.0
Thr
2.544ThrAla: 2.544 ± 0.718
1.696ThrCys: 1.696 ± 0.557
2.261ThrAsp: 2.261 ± 1.013
2.826ThrGlu: 2.826 ± 1.045
1.696ThrPhe: 1.696 ± 0.743
3.109ThrGly: 3.109 ± 1.224
1.413ThrHis: 1.413 ± 0.484
5.936ThrIle: 5.936 ± 0.955
3.109ThrLys: 3.109 ± 0.974
5.936ThrLeu: 5.936 ± 1.033
2.261ThrMet: 2.261 ± 0.992
3.957ThrAsn: 3.957 ± 0.855
1.413ThrPro: 1.413 ± 0.891
1.413ThrGln: 1.413 ± 0.295
2.544ThrArg: 2.544 ± 0.403
5.088ThrSer: 5.088 ± 1.28
4.805ThrThr: 4.805 ± 0.95
1.979ThrVal: 1.979 ± 0.592
1.696ThrTrp: 1.696 ± 0.348
1.413ThrTyr: 1.413 ± 0.843
0.0ThrXaa: 0.0 ± 0.0
Val
3.109ValAla: 3.109 ± 0.974
2.261ValCys: 2.261 ± 0.486
3.392ValAsp: 3.392 ± 1.065
4.522ValGlu: 4.522 ± 0.933
0.848ValPhe: 0.848 ± 0.316
2.826ValGly: 2.826 ± 0.863
1.413ValHis: 1.413 ± 0.382
4.522ValIle: 4.522 ± 1.131
3.392ValLys: 3.392 ± 0.944
3.957ValLeu: 3.957 ± 1.0
1.979ValMet: 1.979 ± 0.772
1.696ValAsn: 1.696 ± 0.47
2.826ValPro: 2.826 ± 0.909
1.696ValGln: 1.696 ± 0.885
2.544ValArg: 2.544 ± 0.624
5.088ValSer: 5.088 ± 1.39
2.544ValThr: 2.544 ± 0.927
2.544ValVal: 2.544 ± 0.766
0.565ValTrp: 0.565 ± 0.344
2.261ValTyr: 2.261 ± 0.616
0.0ValXaa: 0.0 ± 0.0
Trp
0.283TrpAla: 0.283 ± 0.44
0.0TrpCys: 0.0 ± 0.0
1.413TrpAsp: 1.413 ± 0.295
2.261TrpGlu: 2.261 ± 0.842
1.413TrpPhe: 1.413 ± 1.111
1.979TrpGly: 1.979 ± 0.883
0.283TrpHis: 0.283 ± 0.169
1.413TrpIle: 1.413 ± 0.639
1.131TrpLys: 1.131 ± 0.372
1.696TrpLeu: 1.696 ± 0.743
0.0TrpMet: 0.0 ± 0.0
0.848TrpAsn: 0.848 ± 0.316
0.565TrpPro: 0.565 ± 0.337
0.565TrpGln: 0.565 ± 0.371
0.565TrpArg: 0.565 ± 0.337
1.696TrpSer: 1.696 ± 1.012
0.848TrpThr: 0.848 ± 0.485
1.413TrpVal: 1.413 ± 0.946
0.283TrpTrp: 0.283 ± 0.414
0.283TrpTyr: 0.283 ± 0.44
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.979TyrAla: 1.979 ± 1.842
1.131TyrCys: 1.131 ± 0.688
1.131TyrAsp: 1.131 ± 0.564
0.848TyrGlu: 0.848 ± 0.402
2.261TyrPhe: 2.261 ± 0.759
2.261TyrGly: 2.261 ± 0.6
1.131TyrHis: 1.131 ± 0.372
0.848TyrIle: 0.848 ± 0.402
2.544TyrLys: 2.544 ± 1.207
4.24TyrLeu: 4.24 ± 1.056
0.565TyrMet: 0.565 ± 0.371
2.826TyrAsn: 2.826 ± 0.786
1.413TyrPro: 1.413 ± 0.377
1.413TyrGln: 1.413 ± 0.616
2.826TyrArg: 2.826 ± 1.304
2.544TyrSer: 2.544 ± 0.403
0.848TyrThr: 0.848 ± 0.316
1.696TyrVal: 1.696 ± 1.479
0.0TyrTrp: 0.0 ± 0.0
0.283TyrTyr: 0.283 ± 0.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3539 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski