Amino acid dipepetide frequency for Yug Bogdanovac vesiculovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.322AlaAla: 3.322 ± 0.789
1.384AlaCys: 1.384 ± 0.612
3.045AlaAsp: 3.045 ± 1.445
3.876AlaGlu: 3.876 ± 0.603
1.938AlaPhe: 1.938 ± 0.542
3.045AlaGly: 3.045 ± 0.772
1.661AlaHis: 1.661 ± 0.633
2.769AlaIle: 2.769 ± 1.2
1.938AlaLys: 1.938 ± 0.667
8.583AlaLeu: 8.583 ± 1.212
1.384AlaMet: 1.384 ± 0.777
2.769AlaAsn: 2.769 ± 0.428
2.492AlaPro: 2.492 ± 1.02
2.215AlaGln: 2.215 ± 0.761
2.492AlaArg: 2.492 ± 0.96
5.537AlaSer: 5.537 ± 1.566
1.661AlaThr: 1.661 ± 0.989
3.599AlaVal: 3.599 ± 1.114
1.384AlaTrp: 1.384 ± 1.221
2.769AlaTyr: 2.769 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.341
0.831CysCys: 0.831 ± 0.317
1.938CysAsp: 1.938 ± 1.426
0.554CysGlu: 0.554 ± 0.508
1.384CysPhe: 1.384 ± 0.298
1.107CysGly: 1.107 ± 0.395
0.277CysHis: 0.277 ± 0.352
1.661CysIle: 1.661 ± 0.901
0.831CysLys: 0.831 ± 0.695
0.831CysLeu: 0.831 ± 0.451
0.0CysMet: 0.0 ± 0.0
0.554CysAsn: 0.554 ± 0.299
1.107CysPro: 1.107 ± 0.649
0.831CysGln: 0.831 ± 0.317
1.938CysArg: 1.938 ± 0.771
0.554CysSer: 0.554 ± 0.299
0.277CysThr: 0.277 ± 0.15
0.831CysVal: 0.831 ± 0.424
0.554CysTrp: 0.554 ± 0.3
0.554CysTyr: 0.554 ± 0.373
0.0CysXaa: 0.0 ± 0.0
Asp
2.492AspAla: 2.492 ± 0.505
0.554AspCys: 0.554 ± 0.329
4.43AspAsp: 4.43 ± 2.19
3.322AspGlu: 3.322 ± 1.581
2.215AspPhe: 2.215 ± 0.363
3.322AspGly: 3.322 ± 0.205
1.938AspHis: 1.938 ± 0.319
3.045AspIle: 3.045 ± 1.235
2.769AspLys: 2.769 ± 0.806
6.368AspLeu: 6.368 ± 1.045
2.492AspMet: 2.492 ± 1.014
2.215AspAsn: 2.215 ± 0.729
5.26AspPro: 5.26 ± 1.317
2.215AspGln: 2.215 ± 0.904
1.661AspArg: 1.661 ± 0.515
4.707AspSer: 4.707 ± 0.939
3.045AspThr: 3.045 ± 1.035
4.707AspVal: 4.707 ± 1.628
1.938AspTrp: 1.938 ± 0.775
3.599AspTyr: 3.599 ± 1.036
0.0AspXaa: 0.0 ± 0.0
Glu
3.876GluAla: 3.876 ± 2.794
0.554GluCys: 0.554 ± 0.299
3.599GluAsp: 3.599 ± 1.021
3.599GluGlu: 3.599 ± 1.82
3.876GluPhe: 3.876 ± 1.062
3.045GluGly: 3.045 ± 1.073
1.384GluHis: 1.384 ± 0.597
3.322GluIle: 3.322 ± 0.716
2.215GluLys: 2.215 ± 0.898
4.983GluLeu: 4.983 ± 1.152
1.107GluMet: 1.107 ± 0.585
2.215GluAsn: 2.215 ± 0.695
2.215GluPro: 2.215 ± 0.841
2.769GluGln: 2.769 ± 0.49
3.322GluArg: 3.322 ± 0.814
5.26GluSer: 5.26 ± 1.6
3.322GluThr: 3.322 ± 0.539
3.599GluVal: 3.599 ± 0.797
1.384GluTrp: 1.384 ± 0.476
2.492GluTyr: 2.492 ± 1.431
0.0GluXaa: 0.0 ± 0.0
Phe
2.492PheAla: 2.492 ± 0.507
0.831PheCys: 0.831 ± 0.317
3.045PheAsp: 3.045 ± 0.45
0.554PheGlu: 0.554 ± 0.3
2.492PhePhe: 2.492 ± 0.639
3.322PheGly: 3.322 ± 0.39
2.769PheHis: 2.769 ± 0.82
1.938PheIle: 1.938 ± 0.621
3.045PheLys: 3.045 ± 0.664
3.322PheLeu: 3.322 ± 1.802
0.554PheMet: 0.554 ± 0.3
2.492PheAsn: 2.492 ± 1.127
3.322PhePro: 3.322 ± 1.486
0.831PheGln: 0.831 ± 0.451
3.599PheArg: 3.599 ± 0.961
3.045PheSer: 3.045 ± 0.859
2.215PheThr: 2.215 ± 1.533
3.045PheVal: 3.045 ± 0.805
0.554PheTrp: 0.554 ± 0.329
1.661PheTyr: 1.661 ± 0.999
0.0PheXaa: 0.0 ± 0.0
Gly
4.153GlyAla: 4.153 ± 1.574
0.277GlyCys: 0.277 ± 0.57
3.876GlyAsp: 3.876 ± 0.699
4.153GlyGlu: 4.153 ± 1.206
2.492GlyPhe: 2.492 ± 0.738
3.322GlyGly: 3.322 ± 0.589
1.384GlyHis: 1.384 ± 0.518
3.045GlyIle: 3.045 ± 0.82
3.876GlyLys: 3.876 ± 0.959
9.136GlyLeu: 9.136 ± 0.475
1.384GlyMet: 1.384 ± 0.751
1.661GlyAsn: 1.661 ± 0.901
2.492GlyPro: 2.492 ± 1.528
2.215GlyGln: 2.215 ± 0.988
3.876GlyArg: 3.876 ± 0.515
4.153GlySer: 4.153 ± 0.833
4.707GlyThr: 4.707 ± 0.991
6.091GlyVal: 6.091 ± 1.882
1.107GlyTrp: 1.107 ± 0.598
0.831GlyTyr: 0.831 ± 0.92
0.0GlyXaa: 0.0 ± 0.0
His
1.938HisAla: 1.938 ± 0.908
0.554HisCys: 0.554 ± 0.299
0.554HisAsp: 0.554 ± 0.3
1.107HisGlu: 1.107 ± 0.601
1.938HisPhe: 1.938 ± 0.741
1.661HisGly: 1.661 ± 0.769
1.661HisHis: 1.661 ± 1.116
1.661HisIle: 1.661 ± 0.682
1.384HisLys: 1.384 ± 0.554
1.661HisLeu: 1.661 ± 0.361
0.277HisMet: 0.277 ± 0.352
0.554HisAsn: 0.554 ± 0.3
2.492HisPro: 2.492 ± 0.897
0.554HisGln: 0.554 ± 0.3
1.938HisArg: 1.938 ± 0.775
3.322HisSer: 3.322 ± 0.972
1.938HisThr: 1.938 ± 0.513
2.215HisVal: 2.215 ± 0.507
1.384HisTrp: 1.384 ± 0.574
0.277HisTyr: 0.277 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
3.045IleAla: 3.045 ± 0.465
1.107IleCys: 1.107 ± 0.395
4.707IleAsp: 4.707 ± 1.293
4.707IleGlu: 4.707 ± 1.811
1.384IlePhe: 1.384 ± 0.499
3.876IleGly: 3.876 ± 0.56
0.831IleHis: 0.831 ± 0.387
3.045IleIle: 3.045 ± 1.385
4.983IleLys: 4.983 ± 0.806
2.769IleLeu: 2.769 ± 0.291
1.384IleMet: 1.384 ± 0.364
2.769IleAsn: 2.769 ± 0.55
3.599IlePro: 3.599 ± 0.702
3.876IleGln: 3.876 ± 0.895
6.091IleArg: 6.091 ± 2.103
3.876IleSer: 3.876 ± 1.111
2.215IleThr: 2.215 ± 0.908
3.876IleVal: 3.876 ± 0.997
0.831IleTrp: 0.831 ± 0.317
2.215IleTyr: 2.215 ± 1.021
0.0IleXaa: 0.0 ± 0.0
Lys
1.384LysAla: 1.384 ± 0.78
1.107LysCys: 1.107 ± 0.601
2.769LysAsp: 2.769 ± 0.49
3.876LysGlu: 3.876 ± 1.031
3.045LysPhe: 3.045 ± 0.859
5.814LysGly: 5.814 ± 1.089
2.492LysHis: 2.492 ± 0.96
3.599LysIle: 3.599 ± 0.671
4.983LysLys: 4.983 ± 1.414
5.537LysLeu: 5.537 ± 1.908
1.107LysMet: 1.107 ± 0.395
1.938LysAsn: 1.938 ± 0.366
1.384LysPro: 1.384 ± 0.612
0.554LysGln: 0.554 ± 0.52
2.769LysArg: 2.769 ± 1.398
4.707LysSer: 4.707 ± 0.942
2.215LysThr: 2.215 ± 0.695
2.769LysVal: 2.769 ± 0.291
1.384LysTrp: 1.384 ± 0.298
1.384LysTyr: 1.384 ± 0.733
0.0LysXaa: 0.0 ± 0.0
Leu
4.983LeuAla: 4.983 ± 1.431
2.215LeuCys: 2.215 ± 0.773
6.368LeuAsp: 6.368 ± 1.862
4.707LeuGlu: 4.707 ± 1.888
3.876LeuPhe: 3.876 ± 1.264
4.43LeuGly: 4.43 ± 1.382
3.045LeuHis: 3.045 ± 1.101
6.091LeuIle: 6.091 ± 1.588
6.368LeuLys: 6.368 ± 1.163
6.368LeuLeu: 6.368 ± 1.287
3.322LeuMet: 3.322 ± 0.864
4.153LeuAsn: 4.153 ± 0.64
2.769LeuPro: 2.769 ± 0.707
1.938LeuGln: 1.938 ± 0.466
8.583LeuArg: 8.583 ± 1.597
10.52LeuSer: 10.52 ± 1.539
4.153LeuThr: 4.153 ± 1.633
3.876LeuVal: 3.876 ± 0.62
1.107LeuTrp: 1.107 ± 0.395
3.322LeuTyr: 3.322 ± 0.999
0.0LeuXaa: 0.0 ± 0.0
Met
1.661MetAla: 1.661 ± 0.553
0.277MetCys: 0.277 ± 0.15
1.938MetAsp: 1.938 ± 0.933
1.938MetGlu: 1.938 ± 0.626
1.107MetPhe: 1.107 ± 0.695
1.107MetGly: 1.107 ± 0.412
0.831MetHis: 0.831 ± 0.451
1.938MetIle: 1.938 ± 0.666
0.554MetLys: 0.554 ± 0.508
2.215MetLeu: 2.215 ± 0.554
0.831MetMet: 0.831 ± 0.341
1.107MetAsn: 1.107 ± 0.598
1.384MetPro: 1.384 ± 0.828
0.277MetGln: 0.277 ± 0.15
1.661MetArg: 1.661 ± 0.361
3.876MetSer: 3.876 ± 1.259
0.831MetThr: 0.831 ± 0.341
1.384MetVal: 1.384 ± 0.554
0.0MetTrp: 0.0 ± 0.0
0.554MetTyr: 0.554 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
3.045AsnAla: 3.045 ± 0.821
0.0AsnCys: 0.0 ± 0.0
2.215AsnAsp: 2.215 ± 0.864
1.938AsnGlu: 1.938 ± 0.665
1.661AsnPhe: 1.661 ± 0.989
3.322AsnGly: 3.322 ± 0.621
1.384AsnHis: 1.384 ± 0.751
3.045AsnIle: 3.045 ± 0.755
1.661AsnLys: 1.661 ± 0.361
4.983AsnLeu: 4.983 ± 2.308
0.831AsnMet: 0.831 ± 0.582
1.661AsnAsn: 1.661 ± 0.553
3.045AsnPro: 3.045 ± 0.261
2.215AsnGln: 2.215 ± 0.571
1.661AsnArg: 1.661 ± 1.114
3.599AsnSer: 3.599 ± 0.79
3.322AsnThr: 3.322 ± 0.831
1.107AsnVal: 1.107 ± 1.075
0.831AsnTrp: 0.831 ± 0.317
1.661AsnTyr: 1.661 ± 0.665
0.0AsnXaa: 0.0 ± 0.0
Pro
4.983ProAla: 4.983 ± 1.073
0.277ProCys: 0.277 ± 0.381
2.769ProAsp: 2.769 ± 0.64
1.938ProGlu: 1.938 ± 1.724
3.045ProPhe: 3.045 ± 0.733
1.938ProGly: 1.938 ± 1.336
1.107ProHis: 1.107 ± 0.365
2.492ProIle: 2.492 ± 1.015
2.769ProLys: 2.769 ± 1.061
4.707ProLeu: 4.707 ± 1.066
1.107ProMet: 1.107 ± 0.642
2.769ProAsn: 2.769 ± 1.046
2.492ProPro: 2.492 ± 1.497
0.831ProGln: 0.831 ± 0.317
2.215ProArg: 2.215 ± 0.571
4.707ProSer: 4.707 ± 1.719
2.769ProThr: 2.769 ± 0.865
3.876ProVal: 3.876 ± 1.205
0.554ProTrp: 0.554 ± 0.3
0.831ProTyr: 0.831 ± 1.142
0.0ProXaa: 0.0 ± 0.0
Gln
1.938GlnAla: 1.938 ± 0.741
1.107GlnCys: 1.107 ± 0.304
1.661GlnAsp: 1.661 ± 0.665
2.492GlnGlu: 2.492 ± 0.397
1.661GlnPhe: 1.661 ± 0.426
1.384GlnGly: 1.384 ± 0.473
0.277GlnHis: 0.277 ± 0.15
1.938GlnIle: 1.938 ± 0.7
1.661GlnLys: 1.661 ± 0.641
2.492GlnLeu: 2.492 ± 0.69
1.938GlnMet: 1.938 ± 1.511
1.938GlnAsn: 1.938 ± 0.466
1.384GlnPro: 1.384 ± 0.928
0.277GlnGln: 0.277 ± 0.381
2.769GlnArg: 2.769 ± 0.626
1.938GlnSer: 1.938 ± 0.366
1.384GlnThr: 1.384 ± 0.507
1.384GlnVal: 1.384 ± 0.518
0.554GlnTrp: 0.554 ± 0.299
0.831GlnTyr: 0.831 ± 0.487
0.0GlnXaa: 0.0 ± 0.0
Arg
4.153ArgAla: 4.153 ± 1.023
1.661ArgCys: 1.661 ± 0.673
4.707ArgAsp: 4.707 ± 0.731
4.707ArgGlu: 4.707 ± 0.841
3.322ArgPhe: 3.322 ± 1.319
5.537ArgGly: 5.537 ± 1.556
0.831ArgHis: 0.831 ± 0.451
4.153ArgIle: 4.153 ± 0.934
2.215ArgLys: 2.215 ± 0.698
3.322ArgLeu: 3.322 ± 0.757
1.107ArgMet: 1.107 ± 0.601
3.045ArgAsn: 3.045 ± 1.008
2.215ArgPro: 2.215 ± 0.888
1.938ArgGln: 1.938 ± 0.507
4.43ArgArg: 4.43 ± 1.188
4.43ArgSer: 4.43 ± 1.212
3.322ArgThr: 3.322 ± 1.152
4.983ArgVal: 4.983 ± 0.883
0.831ArgTrp: 0.831 ± 0.317
1.384ArgTyr: 1.384 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
6.368SerAla: 6.368 ± 1.627
1.107SerCys: 1.107 ± 0.683
5.26SerAsp: 5.26 ± 1.211
5.537SerGlu: 5.537 ± 2.214
2.215SerPhe: 2.215 ± 0.954
6.091SerGly: 6.091 ± 1.133
1.938SerHis: 1.938 ± 0.7
7.475SerIle: 7.475 ± 1.399
4.153SerLys: 4.153 ± 1.392
7.752SerLeu: 7.752 ± 1.94
1.661SerMet: 1.661 ± 0.675
3.876SerAsn: 3.876 ± 0.421
3.599SerPro: 3.599 ± 1.612
3.045SerGln: 3.045 ± 0.833
4.707SerArg: 4.707 ± 1.08
7.475SerSer: 7.475 ± 1.921
5.537SerThr: 5.537 ± 0.746
4.983SerVal: 4.983 ± 0.677
1.938SerTrp: 1.938 ± 0.366
3.045SerTyr: 3.045 ± 0.819
0.0SerXaa: 0.0 ± 0.0
Thr
1.661ThrAla: 1.661 ± 0.515
1.384ThrCys: 1.384 ± 0.597
3.045ThrAsp: 3.045 ± 0.878
2.769ThrGlu: 2.769 ± 0.512
1.384ThrPhe: 1.384 ± 0.562
3.599ThrGly: 3.599 ± 0.78
1.661ThrHis: 1.661 ± 0.506
1.384ThrIle: 1.384 ± 0.562
2.769ThrLys: 2.769 ± 0.918
6.091ThrLeu: 6.091 ± 2.127
2.492ThrMet: 2.492 ± 1.074
2.769ThrAsn: 2.769 ± 0.939
1.384ThrPro: 1.384 ± 0.364
1.661ThrGln: 1.661 ± 0.508
1.938ThrArg: 1.938 ± 0.49
6.368ThrSer: 6.368 ± 0.857
2.769ThrThr: 2.769 ± 0.767
3.045ThrVal: 3.045 ± 0.821
1.107ThrTrp: 1.107 ± 0.51
0.554ThrTyr: 0.554 ± 0.3
0.0ThrXaa: 0.0 ± 0.0
Val
3.876ValAla: 3.876 ± 1.737
1.384ValCys: 1.384 ± 0.751
3.045ValAsp: 3.045 ± 0.808
3.599ValGlu: 3.599 ± 2.309
3.322ValPhe: 3.322 ± 1.108
3.876ValGly: 3.876 ± 1.079
1.107ValHis: 1.107 ± 0.657
4.707ValIle: 4.707 ± 0.767
3.322ValLys: 3.322 ± 0.948
5.26ValLeu: 5.26 ± 1.239
1.107ValMet: 1.107 ± 0.453
2.492ValAsn: 2.492 ± 0.315
3.045ValPro: 3.045 ± 0.978
1.661ValGln: 1.661 ± 1.052
3.599ValArg: 3.599 ± 0.709
5.26ValSer: 5.26 ± 0.845
2.769ValThr: 2.769 ± 1.495
2.215ValVal: 2.215 ± 1.404
1.938ValTrp: 1.938 ± 0.601
1.938ValTyr: 1.938 ± 0.775
0.0ValXaa: 0.0 ± 0.0
Trp
0.277TrpAla: 0.277 ± 0.15
0.277TrpCys: 0.277 ± 0.57
1.107TrpAsp: 1.107 ± 0.304
1.661TrpGlu: 1.661 ± 0.635
1.107TrpPhe: 1.107 ± 0.598
2.215TrpGly: 2.215 ± 0.363
0.277TrpHis: 0.277 ± 0.15
2.492TrpIle: 2.492 ± 1.919
1.107TrpLys: 1.107 ± 0.395
1.384TrpLeu: 1.384 ± 0.653
0.0TrpMet: 0.0 ± 0.0
0.831TrpAsn: 0.831 ± 0.635
0.277TrpPro: 0.277 ± 0.15
0.554TrpGln: 0.554 ± 0.3
0.831TrpArg: 0.831 ± 0.451
2.492TrpSer: 2.492 ± 1.055
0.554TrpThr: 0.554 ± 0.701
1.661TrpVal: 1.661 ± 0.974
0.277TrpTrp: 0.277 ± 0.381
0.277TrpTyr: 0.277 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.384TyrAla: 1.384 ± 0.789
0.554TyrCys: 0.554 ± 0.299
2.215TyrAsp: 2.215 ± 0.821
1.384TyrGlu: 1.384 ± 1.018
1.661TyrPhe: 1.661 ± 0.489
2.215TyrGly: 2.215 ± 0.912
2.215TyrHis: 2.215 ± 0.642
1.384TyrIle: 1.384 ± 0.603
2.215TyrLys: 2.215 ± 0.898
4.43TyrLeu: 4.43 ± 1.194
1.107TyrMet: 1.107 ± 0.412
1.384TyrAsn: 1.384 ± 0.984
2.215TyrPro: 2.215 ± 1.521
0.554TyrGln: 0.554 ± 0.329
2.215TyrArg: 2.215 ± 1.314
1.938TyrSer: 1.938 ± 0.775
0.831TyrThr: 0.831 ± 0.635
0.277TyrVal: 0.277 ± 0.381
0.0TyrTrp: 0.0 ± 0.0
0.554TyrTyr: 0.554 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3613 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski