Amino acid dipepetide frequency for Jurona vesiculovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.243AlaAla: 2.243 ± 0.901
1.683AlaCys: 1.683 ± 0.956
1.683AlaAsp: 1.683 ± 0.363
1.963AlaGlu: 1.963 ± 1.478
0.841AlaPhe: 0.841 ± 0.457
1.963AlaGly: 1.963 ± 1.098
1.402AlaHis: 1.402 ± 0.306
2.804AlaIle: 2.804 ± 0.745
2.243AlaLys: 2.243 ± 0.839
7.572AlaLeu: 7.572 ± 0.483
1.402AlaMet: 1.402 ± 0.405
2.524AlaAsn: 2.524 ± 0.612
1.402AlaPro: 1.402 ± 1.175
1.963AlaGln: 1.963 ± 0.544
1.683AlaArg: 1.683 ± 0.565
4.206AlaSer: 4.206 ± 1.456
1.683AlaThr: 1.683 ± 0.363
3.365AlaVal: 3.365 ± 1.845
1.122AlaTrp: 1.122 ± 0.719
1.122AlaTyr: 1.122 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
1.122CysAla: 1.122 ± 0.556
0.841CysCys: 0.841 ± 0.312
0.841CysAsp: 0.841 ± 1.17
1.122CysGlu: 1.122 ± 0.811
0.841CysPhe: 0.841 ± 0.39
0.561CysGly: 0.561 ± 0.319
0.841CysHis: 0.841 ± 0.412
0.561CysIle: 0.561 ± 0.319
2.243CysLys: 2.243 ± 0.95
1.122CysLeu: 1.122 ± 0.463
0.0CysMet: 0.0 ± 0.0
0.561CysAsn: 0.561 ± 0.315
1.122CysPro: 1.122 ± 0.63
1.122CysGln: 1.122 ± 0.382
0.561CysArg: 0.561 ± 0.32
1.122CysSer: 1.122 ± 0.467
0.28CysThr: 0.28 ± 0.159
1.402CysVal: 1.402 ± 0.495
0.841CysTrp: 0.841 ± 0.478
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.122AspAla: 1.122 ± 0.326
1.122AspCys: 1.122 ± 0.736
5.889AspAsp: 5.889 ± 1.854
3.926AspGlu: 3.926 ± 1.54
2.243AspPhe: 2.243 ± 0.6
3.646AspGly: 3.646 ± 0.594
0.28AspHis: 0.28 ± 0.39
3.085AspIle: 3.085 ± 0.82
3.085AspLys: 3.085 ± 0.521
6.73AspLeu: 6.73 ± 0.956
2.243AspMet: 2.243 ± 0.528
2.524AspAsn: 2.524 ± 0.5
3.365AspPro: 3.365 ± 0.46
1.402AspGln: 1.402 ± 0.399
1.683AspArg: 1.683 ± 0.956
3.926AspSer: 3.926 ± 1.113
3.085AspThr: 3.085 ± 0.875
2.804AspVal: 2.804 ± 1.361
0.841AspTrp: 0.841 ± 0.486
4.206AspTyr: 4.206 ± 0.975
0.0AspXaa: 0.0 ± 0.0
Glu
2.243GluAla: 2.243 ± 1.357
0.28GluCys: 0.28 ± 0.159
5.328GluAsp: 5.328 ± 1.748
6.45GluGlu: 6.45 ± 2.487
2.524GluPhe: 2.524 ± 0.852
1.683GluGly: 1.683 ± 0.711
2.243GluHis: 2.243 ± 0.648
3.646GluIle: 3.646 ± 1.03
3.926GluLys: 3.926 ± 1.851
6.169GluLeu: 6.169 ± 1.493
3.085GluMet: 3.085 ± 1.225
2.524GluAsn: 2.524 ± 1.453
1.683GluPro: 1.683 ± 0.363
1.963GluGln: 1.963 ± 0.56
1.683GluArg: 1.683 ± 0.714
5.328GluSer: 5.328 ± 1.54
3.365GluThr: 3.365 ± 1.647
3.085GluVal: 3.085 ± 0.844
1.683GluTrp: 1.683 ± 1.11
3.365GluTyr: 3.365 ± 1.042
0.0GluXaa: 0.0 ± 0.0
Phe
2.243PheAla: 2.243 ± 0.901
0.561PheCys: 0.561 ± 0.315
2.804PheAsp: 2.804 ± 0.548
0.841PheGlu: 0.841 ± 0.39
1.402PhePhe: 1.402 ± 0.632
2.524PheGly: 2.524 ± 1.065
1.402PheHis: 1.402 ± 0.804
1.402PheIle: 1.402 ± 0.399
6.169PheLys: 6.169 ± 1.411
5.328PheLeu: 5.328 ± 1.504
1.683PheMet: 1.683 ± 1.024
1.683PheAsn: 1.683 ± 0.846
3.085PhePro: 3.085 ± 1.513
0.561PheGln: 0.561 ± 0.319
1.683PheArg: 1.683 ± 0.629
4.206PheSer: 4.206 ± 0.921
0.0PheThr: 0.0 ± 0.0
1.963PheVal: 1.963 ± 0.648
0.561PheTrp: 0.561 ± 0.32
0.841PheTyr: 0.841 ± 0.633
0.0PheXaa: 0.0 ± 0.0
Gly
2.524GlyAla: 2.524 ± 2.072
0.28GlyCys: 0.28 ± 0.159
3.365GlyAsp: 3.365 ± 0.727
2.524GlyGlu: 2.524 ± 0.718
2.804GlyPhe: 2.804 ± 0.564
3.365GlyGly: 3.365 ± 1.065
1.122GlyHis: 1.122 ± 0.326
6.169GlyIle: 6.169 ± 2.78
3.646GlyLys: 3.646 ± 0.523
7.572GlyLeu: 7.572 ± 0.703
2.243GlyMet: 2.243 ± 0.531
2.804GlyAsn: 2.804 ± 0.733
2.524GlyPro: 2.524 ± 0.801
3.646GlyGln: 3.646 ± 0.968
3.926GlyArg: 3.926 ± 1.296
4.487GlySer: 4.487 ± 1.346
3.646GlyThr: 3.646 ± 0.932
3.646GlyVal: 3.646 ± 2.265
1.963GlyTrp: 1.963 ± 0.678
0.561GlyTyr: 0.561 ± 0.315
0.0GlyXaa: 0.0 ± 0.0
His
0.561HisAla: 0.561 ± 0.315
0.28HisCys: 0.28 ± 0.159
0.561HisAsp: 0.561 ± 0.319
0.561HisGlu: 0.561 ± 0.319
1.683HisPhe: 1.683 ± 0.683
1.683HisGly: 1.683 ± 0.363
0.841HisHis: 0.841 ± 0.312
1.402HisIle: 1.402 ± 0.306
0.841HisLys: 0.841 ± 0.478
1.963HisLeu: 1.963 ± 0.47
0.561HisMet: 0.561 ± 0.315
0.841HisAsn: 0.841 ± 1.17
1.683HisPro: 1.683 ± 0.821
1.402HisGln: 1.402 ± 0.572
1.402HisArg: 1.402 ± 0.495
3.646HisSer: 3.646 ± 0.825
0.841HisThr: 0.841 ± 0.39
1.963HisVal: 1.963 ± 0.533
1.402HisTrp: 1.402 ± 0.495
0.561HisTyr: 0.561 ± 0.32
0.0HisXaa: 0.0 ± 0.0
Ile
2.243IleAla: 2.243 ± 0.683
1.683IleCys: 1.683 ± 0.945
5.328IleAsp: 5.328 ± 1.091
4.487IleGlu: 4.487 ± 0.832
3.646IlePhe: 3.646 ± 0.661
2.524IleGly: 2.524 ± 0.927
1.683IleHis: 1.683 ± 0.629
2.804IleIle: 2.804 ± 1.972
5.609IleLys: 5.609 ± 0.966
5.609IleLeu: 5.609 ± 0.958
1.122IleMet: 1.122 ± 0.382
3.646IleAsn: 3.646 ± 0.702
5.609IlePro: 5.609 ± 0.863
3.646IleGln: 3.646 ± 1.418
4.487IleArg: 4.487 ± 2.22
5.048IleSer: 5.048 ± 0.926
3.085IleThr: 3.085 ± 1.098
2.804IleVal: 2.804 ± 1.166
1.402IleTrp: 1.402 ± 0.71
2.243IleTyr: 2.243 ± 0.973
0.0IleXaa: 0.0 ± 0.0
Lys
2.804LysAla: 2.804 ± 0.414
0.561LysCys: 0.561 ± 0.736
3.365LysAsp: 3.365 ± 0.61
5.328LysGlu: 5.328 ± 1.098
1.402LysPhe: 1.402 ± 0.716
4.487LysGly: 4.487 ± 0.759
0.841LysHis: 0.841 ± 0.478
5.889LysIle: 5.889 ± 2.128
5.048LysLys: 5.048 ± 0.8
5.328LysLeu: 5.328 ± 1.429
1.683LysMet: 1.683 ± 0.696
5.328LysAsn: 5.328 ± 0.925
1.683LysPro: 1.683 ± 0.321
1.402LysGln: 1.402 ± 0.405
3.926LysArg: 3.926 ± 0.681
6.45LysSer: 6.45 ± 1.009
3.926LysThr: 3.926 ± 0.625
4.206LysVal: 4.206 ± 1.339
1.402LysTrp: 1.402 ± 0.306
3.926LysTyr: 3.926 ± 1.183
0.0LysXaa: 0.0 ± 0.0
Leu
4.206LeuAla: 4.206 ± 0.665
2.243LeuCys: 2.243 ± 1.26
6.169LeuAsp: 6.169 ± 1.596
7.011LeuGlu: 7.011 ± 1.94
3.085LeuPhe: 3.085 ± 0.286
6.45LeuGly: 6.45 ± 1.588
2.243LeuHis: 2.243 ± 0.754
8.974LeuIle: 8.974 ± 2.482
8.413LeuLys: 8.413 ± 1.715
7.572LeuLeu: 7.572 ± 1.764
3.365LeuMet: 3.365 ± 0.839
4.487LeuAsn: 4.487 ± 1.443
4.767LeuPro: 4.767 ± 0.938
2.804LeuGln: 2.804 ± 0.623
7.291LeuArg: 7.291 ± 0.564
10.376LeuSer: 10.376 ± 2.195
4.487LeuThr: 4.487 ± 1.037
3.365LeuVal: 3.365 ± 0.621
0.841LeuTrp: 0.841 ± 0.478
3.646LeuTyr: 3.646 ± 0.291
0.0LeuXaa: 0.0 ± 0.0
Met
1.963MetAla: 1.963 ± 0.869
0.0MetCys: 0.0 ± 0.0
1.683MetAsp: 1.683 ± 0.582
2.243MetGlu: 2.243 ± 1.364
0.841MetPhe: 0.841 ± 0.633
1.683MetGly: 1.683 ± 0.629
0.841MetHis: 0.841 ± 0.478
2.243MetIle: 2.243 ± 0.395
3.085MetLys: 3.085 ± 1.387
2.243MetLeu: 2.243 ± 0.48
0.841MetMet: 0.841 ± 0.633
1.683MetAsn: 1.683 ± 0.623
0.28MetPro: 0.28 ± 0.372
1.122MetGln: 1.122 ± 0.382
1.402MetArg: 1.402 ± 0.572
3.365MetSer: 3.365 ± 0.948
0.841MetThr: 0.841 ± 0.39
0.561MetVal: 0.561 ± 0.78
0.28MetTrp: 0.28 ± 0.39
0.28MetTyr: 0.28 ± 0.554
0.0MetXaa: 0.0 ± 0.0
Asn
2.524AsnAla: 2.524 ± 0.634
0.28AsnCys: 0.28 ± 0.159
1.402AsnAsp: 1.402 ± 0.306
2.243AsnGlu: 2.243 ± 1.74
2.804AsnPhe: 2.804 ± 0.548
3.365AsnGly: 3.365 ± 1.077
1.122AsnHis: 1.122 ± 0.637
2.524AsnIle: 2.524 ± 0.634
1.963AsnLys: 1.963 ± 0.639
7.011AsnLeu: 7.011 ± 1.187
1.122AsnMet: 1.122 ± 1.446
1.963AsnAsn: 1.963 ± 0.507
3.646AsnPro: 3.646 ± 0.839
4.206AsnGln: 4.206 ± 1.375
1.402AsnArg: 1.402 ± 0.562
3.365AsnSer: 3.365 ± 0.621
3.085AsnThr: 3.085 ± 1.12
1.963AsnVal: 1.963 ± 0.695
1.683AsnTrp: 1.683 ± 0.714
1.122AsnTyr: 1.122 ± 0.526
0.0AsnXaa: 0.0 ± 0.0
Pro
2.524ProAla: 2.524 ± 0.818
0.0ProCys: 0.0 ± 0.0
2.804ProAsp: 2.804 ± 0.613
1.963ProGlu: 1.963 ± 1.955
1.683ProPhe: 1.683 ± 1.107
2.243ProGly: 2.243 ± 1.019
1.683ProHis: 1.683 ± 0.623
3.926ProIle: 3.926 ± 0.624
1.963ProLys: 1.963 ± 0.678
4.487ProLeu: 4.487 ± 1.377
0.841ProMet: 0.841 ± 1.234
1.402ProAsn: 1.402 ± 0.716
1.963ProPro: 1.963 ± 1.14
0.841ProGln: 0.841 ± 1.17
1.402ProArg: 1.402 ± 0.796
5.889ProSer: 5.889 ± 1.609
4.767ProThr: 4.767 ± 0.755
1.963ProVal: 1.963 ± 0.592
0.841ProTrp: 0.841 ± 0.39
1.122ProTyr: 1.122 ± 1.043
0.0ProXaa: 0.0 ± 0.0
Gln
1.963GlnAla: 1.963 ± 0.906
1.122GlnCys: 1.122 ± 0.326
0.841GlnAsp: 0.841 ± 0.312
1.683GlnGlu: 1.683 ± 1.001
2.243GlnPhe: 2.243 ± 0.683
2.524GlnGly: 2.524 ± 0.588
0.841GlnHis: 0.841 ± 0.478
1.963GlnIle: 1.963 ± 0.56
1.683GlnLys: 1.683 ± 0.618
2.524GlnLeu: 2.524 ± 0.623
0.561GlnMet: 0.561 ± 0.319
1.683GlnAsn: 1.683 ± 0.821
0.561GlnPro: 0.561 ± 0.319
0.28GlnGln: 0.28 ± 0.159
1.683GlnArg: 1.683 ± 0.998
2.243GlnSer: 2.243 ± 0.395
2.524GlnThr: 2.524 ± 0.779
1.963GlnVal: 1.963 ± 0.827
1.683GlnTrp: 1.683 ± 0.503
1.402GlnTyr: 1.402 ± 0.425
0.0GlnXaa: 0.0 ± 0.0
Arg
3.365ArgAla: 3.365 ± 1.236
0.841ArgCys: 0.841 ± 0.312
2.243ArgAsp: 2.243 ± 0.743
3.365ArgGlu: 3.365 ± 1.157
2.243ArgPhe: 2.243 ± 0.981
5.609ArgGly: 5.609 ± 1.307
1.683ArgHis: 1.683 ± 0.945
1.683ArgIle: 1.683 ± 0.459
2.243ArgLys: 2.243 ± 0.6
3.365ArgLeu: 3.365 ± 0.958
2.243ArgMet: 2.243 ± 0.459
2.243ArgAsn: 2.243 ± 1.274
1.683ArgPro: 1.683 ± 0.911
0.841ArgGln: 0.841 ± 0.343
1.683ArgArg: 1.683 ± 0.585
6.169ArgSer: 6.169 ± 1.211
3.926ArgThr: 3.926 ± 0.528
2.524ArgVal: 2.524 ± 0.711
0.841ArgTrp: 0.841 ± 0.312
1.122ArgTyr: 1.122 ± 0.63
0.0ArgXaa: 0.0 ± 0.0
Ser
5.328SerAla: 5.328 ± 1.184
1.402SerCys: 1.402 ± 1.275
4.767SerAsp: 4.767 ± 0.944
7.852SerGlu: 7.852 ± 2.165
4.487SerPhe: 4.487 ± 1.05
6.73SerGly: 6.73 ± 1.024
1.402SerHis: 1.402 ± 0.796
8.413SerIle: 8.413 ± 1.801
5.048SerLys: 5.048 ± 0.68
9.534SerLeu: 9.534 ± 2.08
1.122SerMet: 1.122 ± 0.316
3.926SerAsn: 3.926 ± 1.139
3.085SerPro: 3.085 ± 0.349
1.122SerGln: 1.122 ± 0.467
4.767SerArg: 4.767 ± 0.473
10.937SerSer: 10.937 ± 1.548
3.926SerThr: 3.926 ± 0.85
6.73SerVal: 6.73 ± 1.22
1.122SerTrp: 1.122 ± 0.428
3.365SerTyr: 3.365 ± 1.405
0.0SerXaa: 0.0 ± 0.0
Thr
2.243ThrAla: 2.243 ± 0.737
1.122ThrCys: 1.122 ± 0.382
2.524ThrAsp: 2.524 ± 1.433
1.963ThrGlu: 1.963 ± 0.839
1.963ThrPhe: 1.963 ± 0.448
3.085ThrGly: 3.085 ± 0.462
1.683ThrHis: 1.683 ± 0.623
3.926ThrIle: 3.926 ± 0.983
2.524ThrLys: 2.524 ± 1.236
7.011ThrLeu: 7.011 ± 1.785
1.402ThrMet: 1.402 ± 0.495
3.646ThrAsn: 3.646 ± 0.694
1.122ThrPro: 1.122 ± 0.736
1.402ThrGln: 1.402 ± 0.425
3.085ThrArg: 3.085 ± 0.785
5.609ThrSer: 5.609 ± 1.067
2.243ThrThr: 2.243 ± 0.528
1.683ThrVal: 1.683 ± 0.446
1.122ThrTrp: 1.122 ± 0.63
0.841ThrTyr: 0.841 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
1.402ValAla: 1.402 ± 0.632
2.243ValCys: 2.243 ± 0.48
2.804ValAsp: 2.804 ± 0.904
4.487ValGlu: 4.487 ± 0.307
1.683ValPhe: 1.683 ± 0.818
3.365ValGly: 3.365 ± 1.645
0.561ValHis: 0.561 ± 0.32
3.085ValIle: 3.085 ± 0.785
3.926ValLys: 3.926 ± 1.138
5.609ValLeu: 5.609 ± 1.89
1.122ValMet: 1.122 ± 0.526
1.683ValAsn: 1.683 ± 0.78
3.085ValPro: 3.085 ± 0.837
1.122ValGln: 1.122 ± 0.641
3.085ValArg: 3.085 ± 0.455
5.609ValSer: 5.609 ± 1.028
3.085ValThr: 3.085 ± 0.812
2.243ValVal: 2.243 ± 0.61
0.0ValTrp: 0.0 ± 0.0
1.122ValTyr: 1.122 ± 0.641
0.0ValXaa: 0.0 ± 0.0
Trp
0.28TrpAla: 0.28 ± 0.159
0.0TrpCys: 0.0 ± 0.0
2.243TrpAsp: 2.243 ± 0.894
0.841TrpGlu: 0.841 ± 0.312
0.841TrpPhe: 0.841 ± 0.691
2.243TrpGly: 2.243 ± 0.856
0.28TrpHis: 0.28 ± 0.159
2.524TrpIle: 2.524 ± 0.842
2.243TrpLys: 2.243 ± 0.531
1.683TrpLeu: 1.683 ± 0.787
0.0TrpMet: 0.0 ± 0.0
1.402TrpAsn: 1.402 ± 0.495
0.28TrpPro: 0.28 ± 0.159
0.0TrpGln: 0.0 ± 0.0
0.561TrpArg: 0.561 ± 0.315
1.122TrpSer: 1.122 ± 0.637
0.841TrpThr: 0.841 ± 0.39
1.683TrpVal: 1.683 ± 0.647
0.28TrpTrp: 0.28 ± 0.372
0.561TrpTyr: 0.561 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.963TyrAla: 1.963 ± 0.726
0.561TyrCys: 0.561 ± 0.315
0.561TyrAsp: 0.561 ± 0.319
1.402TyrGlu: 1.402 ± 0.306
1.683TyrPhe: 1.683 ± 0.686
3.085TyrGly: 3.085 ± 1.12
1.683TyrHis: 1.683 ± 0.924
1.683TyrIle: 1.683 ± 0.623
3.085TyrLys: 3.085 ± 0.938
3.365TyrLeu: 3.365 ± 0.791
0.561TyrMet: 0.561 ± 0.472
2.243TyrAsn: 2.243 ± 0.973
1.683TyrPro: 1.683 ± 0.582
1.122TyrGln: 1.122 ± 0.641
2.524TyrArg: 2.524 ± 1.223
2.243TyrSer: 2.243 ± 0.743
0.561TyrThr: 0.561 ± 0.668
1.402TyrVal: 1.402 ± 0.596
0.0TyrTrp: 0.0 ± 0.0
0.28TyrTyr: 0.28 ± 0.159
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski