Amino acid dipepetide frequency for Equine arteritis virus (strain Bucyrus) (EAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.139AlaAla: 12.139 ± 1.074
3.773AlaCys: 3.773 ± 0.972
3.117AlaAsp: 3.117 ± 0.806
3.281AlaGlu: 3.281 ± 0.799
2.297AlaPhe: 2.297 ± 1.073
7.054AlaGly: 7.054 ± 1.19
1.476AlaHis: 1.476 ± 0.298
3.937AlaIle: 3.937 ± 2.975
3.445AlaLys: 3.445 ± 0.652
9.022AlaLeu: 9.022 ± 0.977
1.804AlaMet: 1.804 ± 0.552
3.773AlaAsn: 3.773 ± 0.891
4.593AlaPro: 4.593 ± 0.572
2.297AlaGln: 2.297 ± 0.492
4.101AlaArg: 4.101 ± 0.988
8.694AlaSer: 8.694 ± 0.979
6.726AlaThr: 6.726 ± 0.775
9.186AlaVal: 9.186 ± 1.41
1.312AlaTrp: 1.312 ± 0.364
3.773AlaTyr: 3.773 ± 1.272
0.0AlaXaa: 0.0 ± 0.0
Cys
1.804CysAla: 1.804 ± 0.393
1.476CysCys: 1.476 ± 0.344
3.117CysAsp: 3.117 ± 0.522
1.804CysGlu: 1.804 ± 0.585
1.969CysPhe: 1.969 ± 0.87
2.461CysGly: 2.461 ± 0.741
1.64CysHis: 1.64 ± 0.508
0.656CysIle: 0.656 ± 0.752
0.82CysLys: 0.82 ± 0.275
4.757CysLeu: 4.757 ± 0.88
0.328CysMet: 0.328 ± 0.527
0.328CysAsn: 0.328 ± 0.123
0.984CysPro: 0.984 ± 0.323
0.492CysGln: 0.492 ± 0.295
1.804CysArg: 1.804 ± 0.314
2.461CysSer: 2.461 ± 0.622
1.969CysThr: 1.969 ± 0.548
2.133CysVal: 2.133 ± 0.427
1.148CysTrp: 1.148 ± 0.537
1.64CysTyr: 1.64 ± 0.32
0.0CysXaa: 0.0 ± 0.0
Asp
3.609AspAla: 3.609 ± 0.716
1.312AspCys: 1.312 ± 0.542
2.789AspAsp: 2.789 ± 0.694
2.133AspGlu: 2.133 ± 0.586
3.281AspPhe: 3.281 ± 0.351
3.609AspGly: 3.609 ± 0.794
1.969AspHis: 1.969 ± 0.474
1.148AspIle: 1.148 ± 0.396
1.64AspLys: 1.64 ± 0.396
6.07AspLeu: 6.07 ± 1.199
0.492AspMet: 0.492 ± 0.303
0.656AspAsn: 0.656 ± 0.274
3.117AspPro: 3.117 ± 0.824
1.148AspGln: 1.148 ± 0.386
2.953AspArg: 2.953 ± 0.766
1.969AspSer: 1.969 ± 0.523
1.804AspThr: 1.804 ± 0.454
4.265AspVal: 4.265 ± 0.737
1.148AspTrp: 1.148 ± 0.432
1.476AspTyr: 1.476 ± 0.392
0.0AspXaa: 0.0 ± 0.0
Glu
3.609GluAla: 3.609 ± 0.705
0.492GluCys: 0.492 ± 0.161
0.82GluAsp: 0.82 ± 0.357
2.789GluGlu: 2.789 ± 0.714
0.656GluPhe: 0.656 ± 0.344
4.429GluGly: 4.429 ± 1.024
1.804GluHis: 1.804 ± 0.39
0.656GluIle: 0.656 ± 0.406
1.148GluLys: 1.148 ± 0.525
2.953GluLeu: 2.953 ± 0.743
0.656GluMet: 0.656 ± 0.258
0.0GluAsn: 0.0 ± 0.0
1.476GluPro: 1.476 ± 0.517
2.461GluGln: 2.461 ± 0.701
0.984GluArg: 0.984 ± 0.371
1.969GluSer: 1.969 ± 0.409
0.82GluThr: 0.82 ± 0.507
2.625GluVal: 2.625 ± 0.876
0.656GluTrp: 0.656 ± 0.24
0.82GluTyr: 0.82 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
5.085PheAla: 5.085 ± 1.366
1.312PheCys: 1.312 ± 0.347
2.297PheAsp: 2.297 ± 0.319
1.148PheGlu: 1.148 ± 0.525
1.969PhePhe: 1.969 ± 1.322
2.297PheGly: 2.297 ± 0.58
1.148PheHis: 1.148 ± 0.396
1.969PheIle: 1.969 ± 1.309
1.804PheLys: 1.804 ± 0.452
5.085PheLeu: 5.085 ± 1.105
1.64PheMet: 1.64 ± 0.462
0.492PheAsn: 0.492 ± 0.337
2.953PhePro: 2.953 ± 0.586
1.476PheGln: 1.476 ± 0.343
2.133PheArg: 2.133 ± 0.558
3.773PheSer: 3.773 ± 1.037
2.461PheThr: 2.461 ± 0.505
4.265PheVal: 4.265 ± 1.357
0.164PheTrp: 0.164 ± 0.54
1.312PheTyr: 1.312 ± 0.939
0.0PheXaa: 0.0 ± 0.0
Gly
6.234GlyAla: 6.234 ± 0.927
2.461GlyCys: 2.461 ± 0.651
5.577GlyAsp: 5.577 ± 1.117
1.312GlyGlu: 1.312 ± 0.348
2.625GlyPhe: 2.625 ± 0.68
4.101GlyGly: 4.101 ± 1.101
1.804GlyHis: 1.804 ± 0.617
2.625GlyIle: 2.625 ± 0.837
2.133GlyLys: 2.133 ± 0.603
9.35GlyLeu: 9.35 ± 1.019
1.476GlyMet: 1.476 ± 0.45
3.445GlyAsn: 3.445 ± 0.716
3.609GlyPro: 3.609 ± 0.431
2.789GlyGln: 2.789 ± 0.651
4.101GlyArg: 4.101 ± 0.802
6.89GlySer: 6.89 ± 0.819
3.445GlyThr: 3.445 ± 0.68
4.757GlyVal: 4.757 ± 0.618
1.969GlyTrp: 1.969 ± 0.653
3.609GlyTyr: 3.609 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
2.133HisAla: 2.133 ± 0.557
0.984HisCys: 0.984 ± 0.255
0.492HisAsp: 0.492 ± 0.433
0.328HisGlu: 0.328 ± 0.546
2.461HisPhe: 2.461 ± 0.778
1.312HisGly: 1.312 ± 0.769
0.0HisHis: 0.0 ± 0.0
1.804HisIle: 1.804 ± 0.436
0.656HisLys: 0.656 ± 0.24
2.133HisLeu: 2.133 ± 0.966
0.164HisMet: 0.164 ± 0.101
0.164HisAsn: 0.164 ± 0.366
1.148HisPro: 1.148 ± 0.547
0.984HisGln: 0.984 ± 0.37
1.64HisArg: 1.64 ± 0.878
1.148HisSer: 1.148 ± 0.395
1.64HisThr: 1.64 ± 0.907
1.312HisVal: 1.312 ± 0.49
0.656HisTrp: 0.656 ± 0.274
1.476HisTyr: 1.476 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
2.953IleAla: 2.953 ± 0.71
2.133IleCys: 2.133 ± 0.758
2.789IleAsp: 2.789 ± 0.41
0.492IleGlu: 0.492 ± 0.161
1.312IlePhe: 1.312 ± 0.618
3.937IleGly: 3.937 ± 0.766
0.492IleHis: 0.492 ± 0.303
1.312IleIle: 1.312 ± 1.166
0.984IleLys: 0.984 ± 0.427
3.281IleLeu: 3.281 ± 1.671
0.984IleMet: 0.984 ± 0.462
1.312IleAsn: 1.312 ± 0.711
3.117IlePro: 3.117 ± 0.554
0.82IleGln: 0.82 ± 0.761
0.328IleArg: 0.328 ± 0.283
3.937IleSer: 3.937 ± 1.081
2.953IleThr: 2.953 ± 0.441
2.133IleVal: 2.133 ± 1.621
0.656IleTrp: 0.656 ± 0.467
1.969IleTyr: 1.969 ± 1.028
0.0IleXaa: 0.0 ± 0.0
Lys
2.133LysAla: 2.133 ± 0.715
0.492LysCys: 0.492 ± 0.161
2.789LysAsp: 2.789 ± 0.66
1.969LysGlu: 1.969 ± 0.469
1.312LysPhe: 1.312 ± 0.48
2.133LysGly: 2.133 ± 0.715
0.328LysHis: 0.328 ± 0.123
1.969LysIle: 1.969 ± 0.372
0.82LysLys: 0.82 ± 0.269
3.609LysLeu: 3.609 ± 0.713
0.492LysMet: 0.492 ± 0.253
0.82LysAsn: 0.82 ± 0.331
1.969LysPro: 1.969 ± 0.506
1.312LysGln: 1.312 ± 0.547
2.625LysArg: 2.625 ± 0.516
1.969LysSer: 1.969 ± 0.719
2.133LysThr: 2.133 ± 0.412
3.609LysVal: 3.609 ± 0.69
0.328LysTrp: 0.328 ± 0.123
1.969LysTyr: 1.969 ± 0.719
0.0LysXaa: 0.0 ± 0.0
Leu
11.647LeuAla: 11.647 ± 1.436
3.937LeuCys: 3.937 ± 0.614
5.413LeuAsp: 5.413 ± 1.206
3.937LeuGlu: 3.937 ± 0.636
5.085LeuPhe: 5.085 ± 2.053
6.89LeuGly: 6.89 ± 0.624
1.969LeuHis: 1.969 ± 1.574
4.101LeuIle: 4.101 ± 0.865
3.773LeuLys: 3.773 ± 1.23
16.24LeuLeu: 16.24 ± 3.95
1.476LeuMet: 1.476 ± 1.015
2.789LeuAsn: 2.789 ± 0.506
6.398LeuPro: 6.398 ± 0.622
3.937LeuGln: 3.937 ± 0.989
4.757LeuArg: 4.757 ± 0.951
6.89LeuSer: 6.89 ± 0.585
7.382LeuThr: 7.382 ± 0.842
9.022LeuVal: 9.022 ± 1.921
1.969LeuTrp: 1.969 ± 0.557
2.953LeuTyr: 2.953 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
1.969MetAla: 1.969 ± 0.548
0.984MetCys: 0.984 ± 0.279
0.328MetAsp: 0.328 ± 0.123
0.984MetGlu: 0.984 ± 0.255
0.82MetPhe: 0.82 ± 0.728
1.64MetGly: 1.64 ± 0.735
0.164MetHis: 0.164 ± 0.311
0.984MetIle: 0.984 ± 0.62
0.656MetLys: 0.656 ± 0.245
3.281MetLeu: 3.281 ± 0.607
0.82MetMet: 0.82 ± 0.32
0.656MetAsn: 0.656 ± 0.247
1.312MetPro: 1.312 ± 1.425
0.164MetGln: 0.164 ± 0.366
1.312MetArg: 1.312 ± 0.48
0.492MetSer: 0.492 ± 0.308
0.328MetThr: 0.328 ± 0.123
1.148MetVal: 1.148 ± 0.525
0.984MetTrp: 0.984 ± 0.37
0.164MetTyr: 0.164 ± 0.101
0.0MetXaa: 0.0 ± 0.0
Asn
2.789AsnAla: 2.789 ± 0.341
1.804AsnCys: 1.804 ± 0.535
1.148AsnAsp: 1.148 ± 0.508
0.656AsnGlu: 0.656 ± 0.24
1.148AsnPhe: 1.148 ± 0.487
1.148AsnGly: 1.148 ± 0.459
0.492AsnHis: 0.492 ± 0.313
1.476AsnIle: 1.476 ± 0.364
0.82AsnLys: 0.82 ± 0.431
3.117AsnLeu: 3.117 ± 0.81
0.656AsnMet: 0.656 ± 0.373
0.984AsnAsn: 0.984 ± 0.267
1.804AsnPro: 1.804 ± 0.429
1.148AsnGln: 1.148 ± 0.382
1.148AsnArg: 1.148 ± 0.363
1.969AsnSer: 1.969 ± 0.446
0.984AsnThr: 0.984 ± 0.299
3.117AsnVal: 3.117 ± 0.67
0.328AsnTrp: 0.328 ± 0.123
0.984AsnTyr: 0.984 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
6.726ProAla: 6.726 ± 0.953
1.312ProCys: 1.312 ± 0.49
1.969ProAsp: 1.969 ± 0.469
0.984ProGlu: 0.984 ± 0.427
1.148ProPhe: 1.148 ± 0.515
4.921ProGly: 4.921 ± 0.733
1.148ProHis: 1.148 ± 0.525
3.281ProIle: 3.281 ± 0.878
3.773ProLys: 3.773 ± 1.228
3.609ProLeu: 3.609 ± 0.535
1.312ProMet: 1.312 ± 0.691
1.476ProAsn: 1.476 ± 0.423
3.937ProPro: 3.937 ± 0.85
2.133ProGln: 2.133 ± 0.426
3.281ProArg: 3.281 ± 0.578
4.265ProSer: 4.265 ± 1.24
5.085ProThr: 5.085 ± 1.051
6.234ProVal: 6.234 ± 1.046
0.328ProTrp: 0.328 ± 0.123
1.804ProTyr: 1.804 ± 0.696
0.0ProXaa: 0.0 ± 0.0
Gln
2.297GlnAla: 2.297 ± 0.78
1.476GlnCys: 1.476 ± 0.4
1.476GlnAsp: 1.476 ± 0.334
3.117GlnGlu: 3.117 ± 0.607
0.82GlnPhe: 0.82 ± 0.287
1.64GlnGly: 1.64 ± 0.555
0.984GlnHis: 0.984 ± 0.323
0.656GlnIle: 0.656 ± 0.515
0.82GlnLys: 0.82 ± 0.269
4.265GlnLeu: 4.265 ± 0.668
0.82GlnMet: 0.82 ± 0.501
0.164GlnAsn: 0.164 ± 0.101
1.804GlnPro: 1.804 ± 0.749
0.492GlnGln: 0.492 ± 0.433
2.953GlnArg: 2.953 ± 0.976
1.969GlnSer: 1.969 ± 0.469
0.984GlnThr: 0.984 ± 0.472
1.804GlnVal: 1.804 ± 0.347
0.0GlnTrp: 0.0 ± 0.0
0.984GlnTyr: 0.984 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
5.413ArgAla: 5.413 ± 0.616
2.297ArgCys: 2.297 ± 0.662
1.64ArgAsp: 1.64 ± 0.537
1.476ArgGlu: 1.476 ± 0.451
2.953ArgPhe: 2.953 ± 0.384
2.953ArgGly: 2.953 ± 0.59
0.984ArgHis: 0.984 ± 0.602
0.328ArgIle: 0.328 ± 0.203
0.984ArgLys: 0.984 ± 1.23
5.577ArgLeu: 5.577 ± 0.864
0.984ArgMet: 0.984 ± 0.498
2.133ArgAsn: 2.133 ± 0.524
3.117ArgPro: 3.117 ± 0.819
1.476ArgGln: 1.476 ± 0.715
3.773ArgArg: 3.773 ± 1.329
4.757ArgSer: 4.757 ± 0.776
3.281ArgThr: 3.281 ± 0.688
6.398ArgVal: 6.398 ± 1.36
1.148ArgTrp: 1.148 ± 0.415
1.969ArgTyr: 1.969 ± 0.936
0.0ArgXaa: 0.0 ± 0.0
Ser
7.546SerAla: 7.546 ± 1.18
1.804SerCys: 1.804 ± 0.316
1.969SerAsp: 1.969 ± 0.465
1.476SerGlu: 1.476 ± 0.484
4.101SerPhe: 4.101 ± 1.03
7.546SerGly: 7.546 ± 0.85
1.312SerHis: 1.312 ± 0.76
2.953SerIle: 2.953 ± 0.932
3.281SerLys: 3.281 ± 0.681
7.71SerLeu: 7.71 ± 1.262
1.64SerMet: 1.64 ± 0.381
2.461SerAsn: 2.461 ± 0.824
3.773SerPro: 3.773 ± 0.972
0.82SerGln: 0.82 ± 0.419
3.281SerArg: 3.281 ± 1.539
4.921SerSer: 4.921 ± 1.64
4.757SerThr: 4.757 ± 0.969
5.413SerVal: 5.413 ± 0.418
0.82SerTrp: 0.82 ± 0.387
3.609SerTyr: 3.609 ± 0.47
0.0SerXaa: 0.0 ± 0.0
Thr
5.906ThrAla: 5.906 ± 1.437
0.82ThrCys: 0.82 ± 0.426
2.297ThrAsp: 2.297 ± 0.731
1.148ThrGlu: 1.148 ± 0.396
3.937ThrPhe: 3.937 ± 0.722
6.234ThrGly: 6.234 ± 1.697
1.148ThrHis: 1.148 ± 0.547
2.789ThrIle: 2.789 ± 0.639
2.133ThrLys: 2.133 ± 0.643
5.906ThrLeu: 5.906 ± 0.922
1.969ThrMet: 1.969 ± 0.465
1.804ThrAsn: 1.804 ± 0.549
4.265ThrPro: 4.265 ± 0.71
2.789ThrGln: 2.789 ± 0.561
3.773ThrArg: 3.773 ± 0.66
4.265ThrSer: 4.265 ± 0.626
5.085ThrThr: 5.085 ± 0.94
5.413ThrVal: 5.413 ± 1.058
0.164ThrTrp: 0.164 ± 0.366
0.984ThrTyr: 0.984 ± 0.578
0.0ThrXaa: 0.0 ± 0.0
Val
7.382ValAla: 7.382 ± 1.261
3.117ValCys: 3.117 ± 0.812
4.757ValAsp: 4.757 ± 0.818
1.476ValGlu: 1.476 ± 0.302
3.609ValPhe: 3.609 ± 0.502
7.382ValGly: 7.382 ± 0.831
1.969ValHis: 1.969 ± 1.269
2.789ValIle: 2.789 ± 0.733
3.937ValLys: 3.937 ± 1.124
8.53ValLeu: 8.53 ± 1.952
0.984ValMet: 0.984 ± 0.758
2.461ValAsn: 2.461 ± 0.573
6.726ValPro: 6.726 ± 0.863
2.133ValGln: 2.133 ± 0.457
3.773ValArg: 3.773 ± 0.333
4.757ValSer: 4.757 ± 1.1
8.53ValThr: 8.53 ± 1.38
11.319ValVal: 11.319 ± 1.12
1.148ValTrp: 1.148 ± 0.456
3.117ValTyr: 3.117 ± 0.498
0.0ValXaa: 0.0 ± 0.0
Trp
1.312TrpAla: 1.312 ± 0.317
0.82TrpCys: 0.82 ± 0.261
0.984TrpAsp: 0.984 ± 0.323
0.328TrpGlu: 0.328 ± 0.293
1.476TrpPhe: 1.476 ± 0.346
0.328TrpGly: 0.328 ± 0.293
0.656TrpHis: 0.656 ± 0.247
0.656TrpIle: 0.656 ± 0.247
0.328TrpLys: 0.328 ± 0.123
2.461TrpLeu: 2.461 ± 0.917
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.82TrpPro: 0.82 ± 0.269
0.0TrpGln: 0.0 ± 0.0
1.804TrpArg: 1.804 ± 0.327
1.64TrpSer: 1.64 ± 0.482
0.82TrpThr: 0.82 ± 0.269
0.984TrpVal: 0.984 ± 0.52
0.328TrpTrp: 0.328 ± 0.337
0.984TrpTyr: 0.984 ± 0.43
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.117TyrAla: 3.117 ± 1.839
0.984TyrCys: 0.984 ± 0.496
0.82TyrAsp: 0.82 ± 0.78
0.984TyrGlu: 0.984 ± 0.299
1.804TyrPhe: 1.804 ± 0.578
2.297TyrGly: 2.297 ± 0.818
1.148TyrHis: 1.148 ± 0.796
1.804TyrIle: 1.804 ± 0.499
0.82TyrLys: 0.82 ± 0.431
3.609TyrLeu: 3.609 ± 0.59
0.328TyrMet: 0.328 ± 0.337
1.969TyrAsn: 1.969 ± 0.404
1.804TyrPro: 1.804 ± 0.633
0.656TyrGln: 0.656 ± 0.247
3.117TyrArg: 3.117 ± 0.428
2.461TyrSer: 2.461 ± 0.637
1.64TyrThr: 1.64 ± 0.489
4.757TyrVal: 4.757 ± 0.827
1.476TyrTrp: 1.476 ± 0.484
1.969TyrTyr: 1.969 ± 0.463
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6097 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski