Amino acid dipepetide frequency for Porcine epidemic diarrhea virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.031AlaAla: 6.031 ± 0.791
3.333AlaCys: 3.333 ± 0.702
3.015AlaAsp: 3.015 ± 0.489
2.539AlaGlu: 2.539 ± 0.546
5.396AlaPhe: 5.396 ± 1.357
4.444AlaGly: 4.444 ± 1.545
1.111AlaHis: 1.111 ± 0.281
3.968AlaIle: 3.968 ± 0.671
2.698AlaLys: 2.698 ± 1.02
6.824AlaLeu: 6.824 ± 0.521
1.904AlaMet: 1.904 ± 0.815
3.809AlaAsn: 3.809 ± 0.488
2.698AlaPro: 2.698 ± 0.34
1.746AlaGln: 1.746 ± 0.341
2.063AlaArg: 2.063 ± 0.323
6.824AlaSer: 6.824 ± 0.746
4.444AlaThr: 4.444 ± 1.349
6.031AlaVal: 6.031 ± 0.963
0.476AlaTrp: 0.476 ± 0.39
1.27AlaTyr: 1.27 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
1.904CysAla: 1.904 ± 0.677
2.222CysCys: 2.222 ± 0.768
1.111CysAsp: 1.111 ± 0.386
0.476CysGlu: 0.476 ± 0.264
2.698CysPhe: 2.698 ± 0.566
1.904CysGly: 1.904 ± 0.293
0.317CysHis: 0.317 ± 0.295
1.746CysIle: 1.746 ± 0.692
1.904CysLys: 1.904 ± 0.489
2.063CysLeu: 2.063 ± 0.48
0.159CysMet: 0.159 ± 0.088
1.428CysAsn: 1.428 ± 0.288
0.952CysPro: 0.952 ± 0.306
0.159CysGln: 0.159 ± 0.209
0.952CysArg: 0.952 ± 0.305
2.381CysSer: 2.381 ± 0.648
2.222CysThr: 2.222 ± 0.357
2.698CysVal: 2.698 ± 0.686
0.317CysTrp: 0.317 ± 0.176
1.587CysTyr: 1.587 ± 0.641
0.0CysXaa: 0.0 ± 0.0
Asp
4.126AspAla: 4.126 ± 1.192
0.794AspCys: 0.794 ± 0.228
3.015AspAsp: 3.015 ± 0.841
2.063AspGlu: 2.063 ± 0.323
3.015AspPhe: 3.015 ± 1.197
5.555AspGly: 5.555 ± 1.021
0.794AspHis: 0.794 ± 0.308
2.222AspIle: 2.222 ± 0.408
2.698AspLys: 2.698 ± 0.947
4.126AspLeu: 4.126 ± 0.676
0.952AspMet: 0.952 ± 0.529
1.746AspAsn: 1.746 ± 0.568
2.381AspPro: 2.381 ± 0.283
0.794AspGln: 0.794 ± 0.176
1.428AspArg: 1.428 ± 0.365
3.333AspSer: 3.333 ± 0.45
3.809AspThr: 3.809 ± 0.601
6.507AspVal: 6.507 ± 0.911
0.317AspTrp: 0.317 ± 0.15
2.063AspTyr: 2.063 ± 0.413
0.0AspXaa: 0.0 ± 0.0
Glu
2.381GluAla: 2.381 ± 0.285
0.794GluCys: 0.794 ± 0.228
2.063GluAsp: 2.063 ± 0.548
1.904GluGlu: 1.904 ± 0.415
2.222GluPhe: 2.222 ± 0.809
3.492GluGly: 3.492 ± 0.276
0.952GluHis: 0.952 ± 0.259
2.381GluIle: 2.381 ± 0.993
1.904GluLys: 1.904 ± 0.634
3.65GluLeu: 3.65 ± 0.956
0.476GluMet: 0.476 ± 0.298
1.587GluAsn: 1.587 ± 0.462
2.381GluPro: 2.381 ± 0.821
1.587GluGln: 1.587 ± 0.442
1.904GluArg: 1.904 ± 0.847
3.492GluSer: 3.492 ± 0.882
1.27GluThr: 1.27 ± 0.346
3.174GluVal: 3.174 ± 0.816
0.952GluTrp: 0.952 ± 0.405
0.952GluTyr: 0.952 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
3.174PheAla: 3.174 ± 0.865
2.063PheCys: 2.063 ± 0.533
3.65PheAsp: 3.65 ± 0.924
2.698PheGlu: 2.698 ± 0.71
3.65PhePhe: 3.65 ± 0.775
4.285PheGly: 4.285 ± 0.764
0.476PheHis: 0.476 ± 0.264
2.698PheIle: 2.698 ± 0.794
3.65PheLys: 3.65 ± 0.755
6.031PheLeu: 6.031 ± 1.311
1.27PheMet: 1.27 ± 0.47
4.285PheAsn: 4.285 ± 1.346
0.952PhePro: 0.952 ± 0.305
1.428PheGln: 1.428 ± 0.564
0.952PheArg: 0.952 ± 0.305
5.237PheSer: 5.237 ± 1.242
3.492PheThr: 3.492 ± 0.895
6.189PheVal: 6.189 ± 1.054
1.27PheTrp: 1.27 ± 0.204
3.174PheTyr: 3.174 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
4.761GlyAla: 4.761 ± 0.525
2.539GlyCys: 2.539 ± 0.661
4.444GlyAsp: 4.444 ± 0.92
2.381GlyGlu: 2.381 ± 0.981
4.761GlyPhe: 4.761 ± 0.95
5.237GlyGly: 5.237 ± 0.813
1.27GlyHis: 1.27 ± 0.731
3.333GlyIle: 3.333 ± 0.365
3.65GlyLys: 3.65 ± 1.42
5.713GlyLeu: 5.713 ± 1.137
0.952GlyMet: 0.952 ± 0.305
4.126GlyAsn: 4.126 ± 2.266
2.063GlyPro: 2.063 ± 0.359
1.746GlyGln: 1.746 ± 0.518
3.015GlyArg: 3.015 ± 1.737
4.602GlySer: 4.602 ± 1.628
4.126GlyThr: 4.126 ± 0.682
7.935GlyVal: 7.935 ± 0.857
0.476GlyTrp: 0.476 ± 0.39
3.492GlyTyr: 3.492 ± 0.948
0.0GlyXaa: 0.0 ± 0.0
His
1.428HisAla: 1.428 ± 0.422
0.159HisCys: 0.159 ± 0.088
0.794HisAsp: 0.794 ± 0.44
0.794HisGlu: 0.794 ± 0.266
0.317HisPhe: 0.317 ± 0.232
1.428HisGly: 1.428 ± 0.666
0.159HisHis: 0.159 ± 0.088
0.794HisIle: 0.794 ± 0.5
0.476HisLys: 0.476 ± 0.13
1.587HisLeu: 1.587 ± 0.366
0.476HisMet: 0.476 ± 0.353
1.428HisAsn: 1.428 ± 0.555
0.476HisPro: 0.476 ± 0.13
0.476HisGln: 0.476 ± 0.384
0.317HisArg: 0.317 ± 0.295
1.587HisSer: 1.587 ± 0.607
1.587HisThr: 1.587 ± 0.406
1.428HisVal: 1.428 ± 0.555
0.317HisTrp: 0.317 ± 0.176
0.952HisTyr: 0.952 ± 0.572
0.0HisXaa: 0.0 ± 0.0
Ile
3.333IleAla: 3.333 ± 0.778
1.111IleCys: 1.111 ± 0.328
3.015IleAsp: 3.015 ± 0.88
1.904IleGlu: 1.904 ± 0.428
2.857IlePhe: 2.857 ± 0.301
3.333IleGly: 3.333 ± 0.829
0.0IleHis: 0.0 ± 0.0
2.698IleIle: 2.698 ± 0.903
3.015IleLys: 3.015 ± 1.197
4.285IleLeu: 4.285 ± 1.52
0.476IleMet: 0.476 ± 0.617
2.381IleAsn: 2.381 ± 0.421
3.015IlePro: 3.015 ± 0.799
0.635IleGln: 0.635 ± 0.335
2.063IleArg: 2.063 ± 0.782
3.968IleSer: 3.968 ± 1.744
3.968IleThr: 3.968 ± 0.85
5.079IleVal: 5.079 ± 1.095
0.635IleTrp: 0.635 ± 0.352
1.904IleTyr: 1.904 ± 0.717
0.0IleXaa: 0.0 ± 0.0
Lys
3.809LysAla: 3.809 ± 0.997
1.904LysCys: 1.904 ± 0.609
2.698LysAsp: 2.698 ± 1.111
1.27LysGlu: 1.27 ± 0.629
2.857LysPhe: 2.857 ± 0.773
2.698LysGly: 2.698 ± 0.554
0.952LysHis: 0.952 ± 0.476
1.428LysIle: 1.428 ± 0.389
1.904LysLys: 1.904 ± 0.909
4.761LysLeu: 4.761 ± 1.733
0.635LysMet: 0.635 ± 0.448
3.015LysAsn: 3.015 ± 1.117
2.857LysPro: 2.857 ± 0.843
1.27LysGln: 1.27 ± 0.397
1.904LysArg: 1.904 ± 0.391
4.126LysSer: 4.126 ± 1.011
2.698LysThr: 2.698 ± 0.507
6.348LysVal: 6.348 ± 1.453
0.317LysTrp: 0.317 ± 0.15
2.857LysTyr: 2.857 ± 0.694
0.0LysXaa: 0.0 ± 0.0
Leu
7.935LeuAla: 7.935 ± 0.436
2.222LeuCys: 2.222 ± 0.815
3.968LeuAsp: 3.968 ± 1.145
3.492LeuGlu: 3.492 ± 0.58
5.237LeuPhe: 5.237 ± 1.514
7.142LeuGly: 7.142 ± 0.702
2.381LeuHis: 2.381 ± 0.6
3.174LeuIle: 3.174 ± 1.01
4.444LeuLys: 4.444 ± 1.497
8.411LeuLeu: 8.411 ± 1.328
1.428LeuMet: 1.428 ± 0.829
5.237LeuAsn: 5.237 ± 0.87
3.65LeuPro: 3.65 ± 1.722
4.602LeuGln: 4.602 ± 1.067
3.333LeuArg: 3.333 ± 0.605
7.618LeuSer: 7.618 ± 0.672
3.968LeuThr: 3.968 ± 1.884
7.459LeuVal: 7.459 ± 1.565
0.952LeuTrp: 0.952 ± 1.844
4.285LeuTyr: 4.285 ± 1.446
0.0LeuXaa: 0.0 ± 0.0
Met
1.587MetAla: 1.587 ± 0.816
0.159MetCys: 0.159 ± 0.088
0.317MetAsp: 0.317 ± 0.176
0.317MetGlu: 0.317 ± 0.176
1.428MetPhe: 1.428 ± 0.427
0.317MetGly: 0.317 ± 0.398
0.635MetHis: 0.635 ± 0.352
0.159MetIle: 0.159 ± 0.088
0.317MetLys: 0.317 ± 0.15
2.539MetLeu: 2.539 ± 0.631
0.159MetMet: 0.159 ± 0.088
0.635MetAsn: 0.635 ± 0.352
0.635MetPro: 0.635 ± 0.352
0.635MetGln: 0.635 ± 0.276
0.635MetArg: 0.635 ± 0.282
1.27MetSer: 1.27 ± 0.326
1.27MetThr: 1.27 ± 0.346
1.587MetVal: 1.587 ± 0.456
0.476MetTrp: 0.476 ± 0.264
1.111MetTyr: 1.111 ± 0.362
0.0MetXaa: 0.0 ± 0.0
Asn
3.492AsnAla: 3.492 ± 0.711
2.222AsnCys: 2.222 ± 0.646
3.809AsnAsp: 3.809 ± 1.072
1.587AsnGlu: 1.587 ± 0.62
3.174AsnPhe: 3.174 ± 0.82
4.761AsnGly: 4.761 ± 0.713
0.635AsnHis: 0.635 ± 0.163
2.698AsnIle: 2.698 ± 0.774
2.698AsnLys: 2.698 ± 0.974
3.492AsnLeu: 3.492 ± 0.391
0.794AsnMet: 0.794 ± 0.348
4.285AsnAsn: 4.285 ± 2.102
1.27AsnPro: 1.27 ± 0.758
1.587AsnGln: 1.587 ± 1.137
1.587AsnArg: 1.587 ± 1.162
4.761AsnSer: 4.761 ± 1.462
3.174AsnThr: 3.174 ± 0.542
6.189AsnVal: 6.189 ± 0.925
0.952AsnTrp: 0.952 ± 0.386
2.063AsnTyr: 2.063 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
1.904ProAla: 1.904 ± 0.489
0.794ProCys: 0.794 ± 0.266
1.428ProAsp: 1.428 ± 0.226
2.222ProGlu: 2.222 ± 0.276
1.746ProPhe: 1.746 ± 0.441
1.746ProGly: 1.746 ± 0.441
0.794ProHis: 0.794 ± 0.176
1.904ProIle: 1.904 ± 0.496
1.746ProLys: 1.746 ± 0.837
3.015ProLeu: 3.015 ± 0.843
0.476ProMet: 0.476 ± 0.264
1.746ProAsn: 1.746 ± 0.958
0.952ProPro: 0.952 ± 0.196
0.952ProGln: 0.952 ± 0.264
1.27ProArg: 1.27 ± 0.204
3.65ProSer: 3.65 ± 0.875
3.015ProThr: 3.015 ± 1.043
5.396ProVal: 5.396 ± 1.792
0.476ProTrp: 0.476 ± 0.13
0.635ProTyr: 0.635 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
2.381GlnAla: 2.381 ± 0.711
0.635GlnCys: 0.635 ± 0.352
0.952GlnAsp: 0.952 ± 0.264
1.428GlnGlu: 1.428 ± 0.763
0.794GlnPhe: 0.794 ± 0.768
1.428GlnGly: 1.428 ± 0.574
0.635GlnHis: 0.635 ± 0.475
2.063GlnIle: 2.063 ± 0.863
0.952GlnLys: 0.952 ± 0.264
4.602GlnLeu: 4.602 ± 1.835
0.635GlnMet: 0.635 ± 0.184
1.27GlnAsn: 1.27 ± 0.397
1.587GlnPro: 1.587 ± 0.515
1.904GlnGln: 1.904 ± 1.085
1.746GlnArg: 1.746 ± 0.713
2.381GlnSer: 2.381 ± 0.589
1.27GlnThr: 1.27 ± 0.599
2.381GlnVal: 2.381 ± 1.146
0.317GlnTrp: 0.317 ± 0.176
1.428GlnTyr: 1.428 ± 0.571
0.0GlnXaa: 0.0 ± 0.0
Arg
1.428ArgAla: 1.428 ± 0.422
0.476ArgCys: 0.476 ± 0.627
1.111ArgAsp: 1.111 ± 0.391
1.428ArgGlu: 1.428 ± 0.499
2.222ArgPhe: 2.222 ± 0.555
3.492ArgGly: 3.492 ± 2.371
0.952ArgHis: 0.952 ± 0.529
1.587ArgIle: 1.587 ± 0.352
2.539ArgLys: 2.539 ± 0.651
3.174ArgLeu: 3.174 ± 1.159
0.952ArgMet: 0.952 ± 0.196
1.746ArgAsn: 1.746 ± 0.748
0.476ArgPro: 0.476 ± 0.264
1.428ArgGln: 1.428 ± 0.811
1.587ArgArg: 1.587 ± 0.724
2.539ArgSer: 2.539 ± 1.318
2.381ArgThr: 2.381 ± 0.526
3.174ArgVal: 3.174 ± 0.352
0.159ArgTrp: 0.159 ± 0.253
1.111ArgTyr: 1.111 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
6.824SerAla: 6.824 ± 1.061
1.587SerCys: 1.587 ± 0.641
4.444SerAsp: 4.444 ± 0.842
3.809SerGlu: 3.809 ± 0.46
5.872SerPhe: 5.872 ± 1.97
5.555SerGly: 5.555 ± 1.571
1.27SerHis: 1.27 ± 0.326
4.761SerIle: 4.761 ± 1.124
3.65SerLys: 3.65 ± 0.468
6.507SerLeu: 6.507 ± 1.119
0.635SerMet: 0.635 ± 0.3
4.126SerAsn: 4.126 ± 1.1
0.952SerPro: 0.952 ± 0.409
3.492SerGln: 3.492 ± 1.899
3.174SerArg: 3.174 ± 2.234
4.92SerSer: 4.92 ± 1.363
4.761SerThr: 4.761 ± 1.056
7.618SerVal: 7.618 ± 0.738
0.476SerTrp: 0.476 ± 0.617
3.968SerTyr: 3.968 ± 1.071
0.0SerXaa: 0.0 ± 0.0
Thr
3.492ThrAla: 3.492 ± 0.931
1.428ThrCys: 1.428 ± 0.654
3.015ThrAsp: 3.015 ± 0.791
2.222ThrGlu: 2.222 ± 0.885
3.492ThrPhe: 3.492 ± 1.072
5.237ThrGly: 5.237 ± 0.969
0.635ThrHis: 0.635 ± 0.49
5.396ThrIle: 5.396 ± 1.1
3.174ThrLys: 3.174 ± 0.812
6.824ThrLeu: 6.824 ± 1.323
0.794ThrMet: 0.794 ± 0.44
2.857ThrAsn: 2.857 ± 1.9
3.174ThrPro: 3.174 ± 0.513
1.428ThrGln: 1.428 ± 0.331
1.428ThrArg: 1.428 ± 0.574
4.285ThrSer: 4.285 ± 1.016
4.126ThrThr: 4.126 ± 1.18
6.507ThrVal: 6.507 ± 0.522
0.635ThrTrp: 0.635 ± 0.3
3.174ThrTyr: 3.174 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
6.348ValAla: 6.348 ± 1.301
3.015ValCys: 3.015 ± 0.561
5.872ValAsp: 5.872 ± 0.998
4.761ValGlu: 4.761 ± 1.315
5.237ValPhe: 5.237 ± 0.836
5.396ValGly: 5.396 ± 1.846
2.063ValHis: 2.063 ± 0.548
3.492ValIle: 3.492 ± 0.837
6.031ValLys: 6.031 ± 1.662
8.887ValLeu: 8.887 ± 1.558
1.428ValMet: 1.428 ± 0.774
6.031ValAsn: 6.031 ± 1.678
4.285ValPro: 4.285 ± 1.016
4.126ValGln: 4.126 ± 1.784
2.539ValArg: 2.539 ± 0.705
8.411ValSer: 8.411 ± 0.696
8.094ValThr: 8.094 ± 0.654
12.062ValVal: 12.062 ± 3.078
0.476ValTrp: 0.476 ± 0.13
3.65ValTyr: 3.65 ± 0.892
0.0ValXaa: 0.0 ± 0.0
Trp
0.635TrpAla: 0.635 ± 0.796
0.0TrpCys: 0.0 ± 0.0
0.794TrpAsp: 0.794 ± 0.176
0.635TrpGlu: 0.635 ± 0.352
1.111TrpPhe: 1.111 ± 0.386
0.0TrpGly: 0.0 ± 0.0
0.476TrpHis: 0.476 ± 0.243
0.635TrpIle: 0.635 ± 0.796
0.159TrpLys: 0.159 ± 0.088
1.587TrpLeu: 1.587 ± 0.358
0.0TrpMet: 0.0 ± 0.0
0.317TrpAsn: 0.317 ± 0.232
0.635TrpPro: 0.635 ± 0.379
0.159TrpGln: 0.159 ± 0.088
1.111TrpArg: 1.111 ± 0.82
0.476TrpSer: 0.476 ± 0.423
0.635TrpThr: 0.635 ± 0.163
1.111TrpVal: 1.111 ± 0.356
0.635TrpTrp: 0.635 ± 0.379
0.159TrpTyr: 0.159 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.65TyrAla: 3.65 ± 1.013
1.746TyrCys: 1.746 ± 0.618
2.222TyrAsp: 2.222 ± 0.674
1.587TyrGlu: 1.587 ± 0.275
2.539TyrPhe: 2.539 ± 0.51
3.174TyrGly: 3.174 ± 1.056
0.476TyrHis: 0.476 ± 0.13
2.539TyrIle: 2.539 ± 0.74
2.698TyrLys: 2.698 ± 0.924
2.857TyrLeu: 2.857 ± 0.814
1.428TyrMet: 1.428 ± 0.405
3.174TyrAsn: 3.174 ± 0.648
0.476TyrPro: 0.476 ± 0.353
0.635TyrGln: 0.635 ± 0.3
0.952TyrArg: 0.952 ± 0.39
2.539TyrSer: 2.539 ± 0.86
3.015TyrThr: 3.015 ± 0.454
3.492TyrVal: 3.492 ± 0.634
0.635TyrTrp: 0.635 ± 0.282
2.698TyrTyr: 2.698 ± 0.655
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (6302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski