Amino acid dipepetide frequency for Porcine reproductive and respiratory syndrome virus (strain HB-1) (PRRSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.486AlaAla: 8.486 ± 0.783
3.637AlaCys: 3.637 ± 0.842
3.233AlaAsp: 3.233 ± 0.941
3.637AlaGlu: 3.637 ± 0.352
4.31AlaPhe: 4.31 ± 0.498
6.735AlaGly: 6.735 ± 0.976
1.886AlaHis: 1.886 ± 0.358
4.445AlaIle: 4.445 ± 1.394
3.502AlaLys: 3.502 ± 0.5
6.735AlaLeu: 6.735 ± 0.925
0.808AlaMet: 0.808 ± 0.261
2.829AlaAsn: 2.829 ± 0.596
5.927AlaPro: 5.927 ± 1.218
3.367AlaGln: 3.367 ± 0.748
3.637AlaArg: 3.637 ± 0.565
4.445AlaSer: 4.445 ± 1.671
5.657AlaThr: 5.657 ± 0.615
6.196AlaVal: 6.196 ± 1.111
1.347AlaTrp: 1.347 ± 0.373
1.886AlaTyr: 1.886 ± 0.639
0.0AlaXaa: 0.0 ± 0.0
Cys
2.02CysAla: 2.02 ± 0.732
1.212CysCys: 1.212 ± 0.422
1.212CysAsp: 1.212 ± 0.374
1.482CysGlu: 1.482 ± 0.251
1.078CysPhe: 1.078 ± 1.025
3.502CysGly: 3.502 ± 0.708
0.539CysHis: 0.539 ± 0.199
1.616CysIle: 1.616 ± 0.758
1.078CysLys: 1.078 ± 0.744
4.445CysLeu: 4.445 ± 0.849
0.404CysMet: 0.404 ± 0.271
0.539CysAsn: 0.539 ± 0.199
1.347CysPro: 1.347 ± 0.384
0.673CysGln: 0.673 ± 0.304
1.751CysArg: 1.751 ± 0.389
1.078CysSer: 1.078 ± 0.612
2.02CysThr: 2.02 ± 0.458
1.482CysVal: 1.482 ± 0.445
1.751CysTrp: 1.751 ± 0.53
0.673CysTyr: 0.673 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
1.616AspAla: 1.616 ± 0.549
1.078AspCys: 1.078 ± 0.316
1.078AspAsp: 1.078 ± 0.323
2.963AspGlu: 2.963 ± 0.261
2.425AspPhe: 2.425 ± 0.485
3.233AspGly: 3.233 ± 0.628
0.269AspHis: 0.269 ± 0.621
2.29AspIle: 2.29 ± 1.111
1.886AspLys: 1.886 ± 0.495
5.523AspLeu: 5.523 ± 1.101
1.078AspMet: 1.078 ± 0.316
0.404AspAsn: 0.404 ± 0.329
4.176AspPro: 4.176 ± 1.218
1.482AspGln: 1.482 ± 0.445
2.829AspArg: 2.829 ± 0.632
2.963AspSer: 2.963 ± 0.724
2.963AspThr: 2.963 ± 0.564
2.559AspVal: 2.559 ± 0.656
1.212AspTrp: 1.212 ± 0.44
0.673AspTyr: 0.673 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
4.849GluAla: 4.849 ± 0.93
2.02GluCys: 2.02 ± 0.616
4.041GluAsp: 4.041 ± 0.921
3.098GluGlu: 3.098 ± 0.911
1.347GluPhe: 1.347 ± 0.676
3.098GluGly: 3.098 ± 0.466
1.347GluHis: 1.347 ± 0.442
2.29GluIle: 2.29 ± 0.469
2.02GluLys: 2.02 ± 0.429
4.445GluLeu: 4.445 ± 0.69
0.673GluMet: 0.673 ± 0.493
0.943GluAsn: 0.943 ± 0.48
3.098GluPro: 3.098 ± 0.697
2.02GluGln: 2.02 ± 0.465
1.482GluArg: 1.482 ± 0.439
1.886GluSer: 1.886 ± 0.563
2.155GluThr: 2.155 ± 0.384
3.637GluVal: 3.637 ± 0.59
0.808GluTrp: 0.808 ± 0.281
1.212GluTyr: 1.212 ± 0.374
0.0GluXaa: 0.0 ± 0.0
Phe
4.041PheAla: 4.041 ± 0.625
2.29PheCys: 2.29 ± 0.77
2.559PheAsp: 2.559 ± 0.59
2.425PheGlu: 2.425 ± 0.522
2.425PhePhe: 2.425 ± 0.538
2.559PheGly: 2.559 ± 0.31
0.673PheHis: 0.673 ± 0.723
1.078PheIle: 1.078 ± 0.323
2.155PheLys: 2.155 ± 0.659
5.388PheLeu: 5.388 ± 2.258
0.673PheMet: 0.673 ± 0.533
0.943PheAsn: 0.943 ± 0.222
3.098PhePro: 3.098 ± 0.318
1.212PheGln: 1.212 ± 0.693
1.616PheArg: 1.616 ± 0.28
3.233PheSer: 3.233 ± 0.839
3.502PheThr: 3.502 ± 1.355
3.098PheVal: 3.098 ± 1.191
0.673PheTrp: 0.673 ± 0.83
1.212PheTyr: 1.212 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
6.6GlyAla: 6.6 ± 1.356
2.155GlyCys: 2.155 ± 0.387
4.714GlyAsp: 4.714 ± 1.148
2.29GlyGlu: 2.29 ± 0.595
4.58GlyPhe: 4.58 ± 1.214
6.6GlyGly: 6.6 ± 1.425
2.02GlyHis: 2.02 ± 0.652
2.425GlyIle: 2.425 ± 0.406
4.31GlyLys: 4.31 ± 0.757
5.657GlyLeu: 5.657 ± 0.806
2.02GlyMet: 2.02 ± 0.477
3.233GlyAsn: 3.233 ± 0.872
4.58GlyPro: 4.58 ± 1.339
1.886GlyGln: 1.886 ± 0.544
3.906GlyArg: 3.906 ± 0.643
7.139GlySer: 7.139 ± 1.18
2.694GlyThr: 2.694 ± 0.526
6.466GlyVal: 6.466 ± 1.426
0.943GlyTrp: 0.943 ± 0.664
1.751GlyTyr: 1.751 ± 0.321
0.0GlyXaa: 0.0 ± 0.0
His
0.943HisAla: 0.943 ± 0.319
0.539HisCys: 0.539 ± 0.199
1.078HisAsp: 1.078 ± 1.525
1.078HisGlu: 1.078 ± 0.27
1.078HisPhe: 1.078 ± 0.643
1.482HisGly: 1.482 ± 0.405
1.078HisHis: 1.078 ± 0.27
0.943HisIle: 0.943 ± 0.816
1.078HisLys: 1.078 ± 0.27
3.233HisLeu: 3.233 ± 0.549
0.539HisMet: 0.539 ± 0.199
0.808HisAsn: 0.808 ± 0.312
1.886HisPro: 1.886 ± 0.632
1.482HisGln: 1.482 ± 0.944
1.886HisArg: 1.886 ± 0.383
0.539HisSer: 0.539 ± 0.533
1.616HisThr: 1.616 ± 0.471
2.425HisVal: 2.425 ± 0.663
0.808HisTrp: 0.808 ± 0.367
0.539HisTyr: 0.539 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
3.098IleAla: 3.098 ± 0.584
0.539IleCys: 0.539 ± 0.57
2.155IleAsp: 2.155 ± 0.342
2.02IleGlu: 2.02 ± 0.402
2.963IlePhe: 2.963 ± 1.491
2.425IleGly: 2.425 ± 0.834
0.404IleHis: 0.404 ± 0.125
2.559IleIle: 2.559 ± 0.942
1.078IleLys: 1.078 ± 0.474
4.849IleLeu: 4.849 ± 0.977
0.808IleMet: 0.808 ± 0.4
0.539IleAsn: 0.539 ± 0.41
1.616IlePro: 1.616 ± 0.54
1.347IleGln: 1.347 ± 0.246
2.29IleArg: 2.29 ± 0.635
2.425IleSer: 2.425 ± 1.017
2.02IleThr: 2.02 ± 0.682
3.233IleVal: 3.233 ± 1.15
0.269IleTrp: 0.269 ± 0.178
1.078IleTyr: 1.078 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
2.829LysAla: 2.829 ± 0.434
1.616LysCys: 1.616 ± 0.317
1.347LysAsp: 1.347 ± 0.442
2.829LysGlu: 2.829 ± 0.665
2.155LysPhe: 2.155 ± 0.69
2.559LysGly: 2.559 ± 0.462
1.212LysHis: 1.212 ± 0.356
2.963LysIle: 2.963 ± 0.933
1.886LysLys: 1.886 ± 0.639
4.849LysLeu: 4.849 ± 0.797
0.539LysMet: 0.539 ± 0.174
1.482LysAsn: 1.482 ± 0.47
3.098LysPro: 3.098 ± 0.744
0.808LysGln: 0.808 ± 0.312
1.212LysArg: 1.212 ± 0.22
1.482LysSer: 1.482 ± 0.36
2.155LysThr: 2.155 ± 0.616
3.367LysVal: 3.367 ± 0.322
0.673LysTrp: 0.673 ± 0.304
2.02LysTyr: 2.02 ± 0.396
0.0LysXaa: 0.0 ± 0.0
Leu
9.159LeuAla: 9.159 ± 1.311
2.963LeuCys: 2.963 ± 0.5
4.984LeuAsp: 4.984 ± 0.884
4.445LeuGlu: 4.445 ± 1.032
3.233LeuPhe: 3.233 ± 1.73
7.678LeuGly: 7.678 ± 0.582
3.233LeuHis: 3.233 ± 0.722
2.155LeuIle: 2.155 ± 1.57
4.176LeuLys: 4.176 ± 0.932
8.621LeuLeu: 8.621 ± 1.549
1.751LeuMet: 1.751 ± 0.318
4.445LeuAsn: 4.445 ± 0.558
8.351LeuPro: 8.351 ± 1.343
4.041LeuGln: 4.041 ± 0.808
6.061LeuArg: 6.061 ± 1.153
9.159LeuSer: 9.159 ± 1.194
8.351LeuThr: 8.351 ± 1.306
7.678LeuVal: 7.678 ± 1.493
1.616LeuTrp: 1.616 ± 1.26
1.482LeuTyr: 1.482 ± 0.473
0.0LeuXaa: 0.0 ± 0.0
Met
1.751MetAla: 1.751 ± 0.677
0.135MetCys: 0.135 ± 0.089
0.673MetAsp: 0.673 ± 0.196
0.673MetGlu: 0.673 ± 0.304
0.404MetPhe: 0.404 ± 0.125
1.886MetGly: 1.886 ± 0.516
0.404MetHis: 0.404 ± 0.125
0.673MetIle: 0.673 ± 0.196
0.539MetLys: 0.539 ± 0.357
2.829MetLeu: 2.829 ± 0.958
1.078MetMet: 1.078 ± 0.349
0.539MetAsn: 0.539 ± 0.174
0.539MetPro: 0.539 ± 0.357
0.135MetGln: 0.135 ± 0.578
1.212MetArg: 1.212 ± 0.375
2.425MetSer: 2.425 ± 0.484
1.886MetThr: 1.886 ± 0.428
2.29MetVal: 2.29 ± 0.31
0.404MetTrp: 0.404 ± 0.125
0.404MetTyr: 0.404 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
1.347AsnAla: 1.347 ± 0.766
1.347AsnCys: 1.347 ± 0.258
0.539AsnAsp: 0.539 ± 0.174
1.616AsnGlu: 1.616 ± 0.471
1.751AsnPhe: 1.751 ± 1.347
2.963AsnGly: 2.963 ± 0.597
0.539AsnHis: 0.539 ± 0.199
0.808AsnIle: 0.808 ± 0.281
1.616AsnLys: 1.616 ± 0.471
2.29AsnLeu: 2.29 ± 0.366
0.808AsnMet: 0.808 ± 0.246
0.539AsnAsn: 0.539 ± 0.329
1.482AsnPro: 1.482 ± 0.445
1.347AsnGln: 1.347 ± 0.297
2.694AsnArg: 2.694 ± 0.53
3.098AsnSer: 3.098 ± 0.798
2.02AsnThr: 2.02 ± 0.463
3.233AsnVal: 3.233 ± 0.772
0.539AsnTrp: 0.539 ± 0.311
0.539AsnTyr: 0.539 ± 0.754
0.0AsnXaa: 0.0 ± 0.0
Pro
7.139ProAla: 7.139 ± 1.799
1.616ProCys: 1.616 ± 0.39
1.886ProAsp: 1.886 ± 0.639
4.714ProGlu: 4.714 ± 0.694
2.829ProPhe: 2.829 ± 0.696
5.253ProGly: 5.253 ± 0.825
1.347ProHis: 1.347 ± 0.442
2.829ProIle: 2.829 ± 0.829
3.502ProLys: 3.502 ± 0.763
7.139ProLeu: 7.139 ± 0.675
1.078ProMet: 1.078 ± 0.625
2.425ProAsn: 2.425 ± 0.707
5.792ProPro: 5.792 ± 1.222
0.808ProGln: 0.808 ± 0.281
4.041ProArg: 4.041 ± 0.729
5.523ProSer: 5.523 ± 1.329
2.963ProThr: 2.963 ± 0.895
8.89ProVal: 8.89 ± 1.47
0.943ProTrp: 0.943 ± 0.276
1.347ProTyr: 1.347 ± 0.563
0.0ProXaa: 0.0 ± 0.0
Gln
3.098GlnAla: 3.098 ± 0.495
0.943GlnCys: 0.943 ± 0.439
0.404GlnAsp: 0.404 ± 0.387
1.078GlnGlu: 1.078 ± 0.349
1.616GlnPhe: 1.616 ± 0.562
2.694GlnGly: 2.694 ± 0.783
1.212GlnHis: 1.212 ± 0.923
0.673GlnIle: 0.673 ± 0.196
1.078GlnLys: 1.078 ± 0.471
4.041GlnLeu: 4.041 ± 0.955
0.539GlnMet: 0.539 ± 0.174
1.482GlnAsn: 1.482 ± 0.439
1.347GlnPro: 1.347 ± 0.392
0.808GlnGln: 0.808 ± 0.27
1.347GlnArg: 1.347 ± 0.356
1.212GlnSer: 1.212 ± 0.523
1.751GlnThr: 1.751 ± 0.509
4.714GlnVal: 4.714 ± 0.72
0.539GlnTrp: 0.539 ± 0.577
0.943GlnTyr: 0.943 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
4.714ArgAla: 4.714 ± 0.644
1.616ArgCys: 1.616 ± 0.562
0.673ArgAsp: 0.673 ± 0.446
1.751ArgGlu: 1.751 ± 0.566
2.02ArgPhe: 2.02 ± 0.521
3.367ArgGly: 3.367 ± 0.81
2.02ArgHis: 2.02 ± 0.378
2.425ArgIle: 2.425 ± 0.826
1.482ArgLys: 1.482 ± 0.36
5.119ArgLeu: 5.119 ± 0.859
3.098ArgMet: 3.098 ± 0.791
1.616ArgAsn: 1.616 ± 0.345
3.367ArgPro: 3.367 ± 0.848
1.212ArgGln: 1.212 ± 0.375
3.502ArgArg: 3.502 ± 1.045
3.637ArgSer: 3.637 ± 0.778
3.098ArgThr: 3.098 ± 0.573
5.119ArgVal: 5.119 ± 0.796
1.751ArgTrp: 1.751 ± 0.477
1.751ArgTyr: 1.751 ± 0.998
0.0ArgXaa: 0.0 ± 0.0
Ser
5.657SerAla: 5.657 ± 0.713
2.425SerCys: 2.425 ± 0.853
3.502SerAsp: 3.502 ± 0.645
3.233SerGlu: 3.233 ± 0.605
2.29SerPhe: 2.29 ± 0.853
7.274SerGly: 7.274 ± 1.709
2.29SerHis: 2.29 ± 0.772
1.482SerIle: 1.482 ± 0.797
1.886SerLys: 1.886 ± 0.563
7.408SerLeu: 7.408 ± 0.597
2.155SerMet: 2.155 ± 0.916
2.425SerAsn: 2.425 ± 1.063
6.735SerPro: 6.735 ± 0.555
2.559SerGln: 2.559 ± 0.699
3.233SerArg: 3.233 ± 2.214
7.274SerSer: 7.274 ± 3.45
3.098SerThr: 3.098 ± 0.57
4.984SerVal: 4.984 ± 0.758
1.212SerTrp: 1.212 ± 0.803
2.29SerTyr: 2.29 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
4.984ThrAla: 4.984 ± 0.868
1.078ThrCys: 1.078 ± 0.287
2.559ThrAsp: 2.559 ± 0.351
1.482ThrGlu: 1.482 ± 0.467
1.347ThrPhe: 1.347 ± 0.775
4.041ThrGly: 4.041 ± 0.916
1.482ThrHis: 1.482 ± 0.298
3.098ThrIle: 3.098 ± 0.887
2.829ThrLys: 2.829 ± 0.404
4.445ThrLeu: 4.445 ± 1.194
0.943ThrMet: 0.943 ± 0.331
2.02ThrAsn: 2.02 ± 0.359
7.274ThrPro: 7.274 ± 1.086
2.425ThrGln: 2.425 ± 0.642
3.233ThrArg: 3.233 ± 0.729
5.253ThrSer: 5.253 ± 0.686
2.963ThrThr: 2.963 ± 0.431
5.388ThrVal: 5.388 ± 0.624
1.212ThrTrp: 1.212 ± 0.264
0.943ThrTyr: 0.943 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
7.274ValAla: 7.274 ± 0.686
1.212ValCys: 1.212 ± 1.026
3.772ValAsp: 3.772 ± 1.178
3.637ValGlu: 3.637 ± 0.884
4.984ValPhe: 4.984 ± 0.687
5.657ValGly: 5.657 ± 0.825
1.616ValHis: 1.616 ± 0.649
2.425ValIle: 2.425 ± 0.686
3.772ValLys: 3.772 ± 0.627
10.237ValLeu: 10.237 ± 0.669
1.482ValMet: 1.482 ± 0.445
2.963ValAsn: 2.963 ± 0.42
6.6ValPro: 6.6 ± 1.377
2.02ValGln: 2.02 ± 0.458
4.58ValArg: 4.58 ± 0.568
7.543ValSer: 7.543 ± 1.554
4.714ValThr: 4.714 ± 0.738
6.6ValVal: 6.6 ± 1.509
0.943ValTrp: 0.943 ± 0.276
2.963ValTyr: 2.963 ± 0.894
0.0ValXaa: 0.0 ± 0.0
Trp
0.943TrpAla: 0.943 ± 0.351
0.943TrpCys: 0.943 ± 0.705
0.943TrpAsp: 0.943 ± 0.276
0.808TrpGlu: 0.808 ± 0.249
0.943TrpPhe: 0.943 ± 0.774
1.482TrpGly: 1.482 ± 0.743
0.673TrpHis: 0.673 ± 0.304
0.269TrpIle: 0.269 ± 0.087
0.404TrpLys: 0.404 ± 0.339
3.637TrpLeu: 3.637 ± 1.601
0.135TrpMet: 0.135 ± 0.371
0.269TrpAsn: 0.269 ± 0.087
0.404TrpPro: 0.404 ± 0.339
0.404TrpGln: 0.404 ± 0.125
1.482TrpArg: 1.482 ± 0.493
0.539TrpSer: 0.539 ± 0.511
2.155TrpThr: 2.155 ± 0.697
0.808TrpVal: 0.808 ± 0.249
0.539TrpTrp: 0.539 ± 0.297
0.673TrpTyr: 0.673 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.155TyrAla: 2.155 ± 0.46
0.673TyrCys: 0.673 ± 0.282
1.347TyrAsp: 1.347 ± 0.442
1.212TyrGlu: 1.212 ± 0.284
1.078TyrPhe: 1.078 ± 0.287
1.482TyrGly: 1.482 ± 0.251
0.808TyrHis: 0.808 ± 0.425
0.269TyrIle: 0.269 ± 0.384
0.808TyrLys: 0.808 ± 0.535
2.694TyrLeu: 2.694 ± 0.554
0.135TyrMet: 0.135 ± 0.089
0.673TyrAsn: 0.673 ± 0.535
1.347TyrPro: 1.347 ± 0.392
1.482TyrGln: 1.482 ± 0.305
1.347TyrArg: 1.347 ± 0.781
2.559TyrSer: 2.559 ± 0.351
1.212TyrThr: 1.212 ± 0.792
2.829TyrVal: 2.829 ± 0.337
0.269TyrTrp: 0.269 ± 0.087
0.808TyrTyr: 0.808 ± 0.503
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (7425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski