Amino acid dipepetide frequency for Equine arteritis virus (EAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.589AlaAla: 12.589 ± 1.686
3.81AlaCys: 3.81 ± 0.977
2.982AlaAsp: 2.982 ± 0.656
3.313AlaGlu: 3.313 ± 0.87
2.319AlaPhe: 2.319 ± 1.194
7.123AlaGly: 7.123 ± 0.994
1.491AlaHis: 1.491 ± 0.385
3.81AlaIle: 3.81 ± 1.946
3.81AlaLys: 3.81 ± 0.662
8.614AlaLeu: 8.614 ± 0.662
1.822AlaMet: 1.822 ± 0.481
3.479AlaAsn: 3.479 ± 0.736
4.472AlaPro: 4.472 ± 0.463
2.319AlaGln: 2.319 ± 0.436
4.141AlaArg: 4.141 ± 0.961
9.11AlaSer: 9.11 ± 0.978
6.129AlaThr: 6.129 ± 0.4
9.11AlaVal: 9.11 ± 1.513
1.325AlaTrp: 1.325 ± 0.458
3.81AlaTyr: 3.81 ± 1.385
0.0AlaXaa: 0.0 ± 0.0
Cys
1.656CysAla: 1.656 ± 0.555
1.491CysCys: 1.491 ± 0.329
3.147CysAsp: 3.147 ± 0.744
1.822CysGlu: 1.822 ± 0.606
1.988CysPhe: 1.988 ± 0.873
2.485CysGly: 2.485 ± 0.774
1.656CysHis: 1.656 ± 0.49
0.497CysIle: 0.497 ± 0.588
0.828CysLys: 0.828 ± 0.31
4.804CysLeu: 4.804 ± 0.899
0.331CysMet: 0.331 ± 0.457
0.497CysAsn: 0.497 ± 0.286
0.994CysPro: 0.994 ± 0.336
0.331CysGln: 0.331 ± 0.128
1.822CysArg: 1.822 ± 0.337
2.319CysSer: 2.319 ± 0.336
1.988CysThr: 1.988 ± 0.62
2.153CysVal: 2.153 ± 0.351
1.16CysTrp: 1.16 ± 0.558
1.656CysTyr: 1.656 ± 0.318
0.0CysXaa: 0.0 ± 0.0
Asp
3.644AspAla: 3.644 ± 0.618
1.16AspCys: 1.16 ± 0.441
2.816AspAsp: 2.816 ± 0.642
2.153AspGlu: 2.153 ± 0.476
3.313AspPhe: 3.313 ± 0.356
3.479AspGly: 3.479 ± 0.858
1.656AspHis: 1.656 ± 0.379
1.16AspIle: 1.16 ± 0.417
1.491AspLys: 1.491 ± 0.504
5.963AspLeu: 5.963 ± 1.012
0.497AspMet: 0.497 ± 0.334
0.663AspAsn: 0.663 ± 0.317
3.313AspPro: 3.313 ± 0.692
1.16AspGln: 1.16 ± 0.398
2.982AspArg: 2.982 ± 0.699
1.988AspSer: 1.988 ± 0.488
1.822AspThr: 1.822 ± 0.353
4.141AspVal: 4.141 ± 0.834
0.994AspTrp: 0.994 ± 0.287
1.491AspTyr: 1.491 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
3.644GluAla: 3.644 ± 0.611
0.497GluCys: 0.497 ± 0.168
0.663GluAsp: 0.663 ± 0.227
2.816GluGlu: 2.816 ± 0.688
0.663GluPhe: 0.663 ± 0.376
4.472GluGly: 4.472 ± 0.927
1.822GluHis: 1.822 ± 0.409
0.663GluIle: 0.663 ± 0.444
1.16GluLys: 1.16 ± 0.567
3.147GluLeu: 3.147 ± 0.693
0.663GluMet: 0.663 ± 0.249
0.166GluAsn: 0.166 ± 0.362
1.491GluPro: 1.491 ± 0.476
2.485GluGln: 2.485 ± 0.559
0.994GluArg: 0.994 ± 0.289
1.988GluSer: 1.988 ± 0.398
0.828GluThr: 0.828 ± 0.554
2.485GluVal: 2.485 ± 0.833
0.663GluTrp: 0.663 ± 0.255
0.828GluTyr: 0.828 ± 0.455
0.0GluXaa: 0.0 ± 0.0
Phe
4.969PheAla: 4.969 ± 1.007
1.325PheCys: 1.325 ± 0.329
2.319PheAsp: 2.319 ± 0.314
1.16PheGlu: 1.16 ± 0.567
1.822PhePhe: 1.822 ± 0.84
2.319PheGly: 2.319 ± 0.664
1.16PheHis: 1.16 ± 0.417
1.988PheIle: 1.988 ± 1.577
1.822PheLys: 1.822 ± 0.422
4.969PheLeu: 4.969 ± 0.676
1.656PheMet: 1.656 ± 0.409
0.497PheAsn: 0.497 ± 0.352
2.982PhePro: 2.982 ± 0.697
1.491PheGln: 1.491 ± 0.407
2.153PheArg: 2.153 ± 0.574
3.81PheSer: 3.81 ± 1.23
2.485PheThr: 2.485 ± 0.699
4.472PheVal: 4.472 ± 1.697
0.166PheTrp: 0.166 ± 0.29
1.16PheTyr: 1.16 ± 0.821
0.0PheXaa: 0.0 ± 0.0
Gly
6.295GlyAla: 6.295 ± 0.848
2.319GlyCys: 2.319 ± 0.649
5.466GlyAsp: 5.466 ± 1.097
1.325GlyGlu: 1.325 ± 0.31
2.65GlyPhe: 2.65 ± 0.651
4.141GlyGly: 4.141 ± 1.174
2.319GlyHis: 2.319 ± 0.558
2.319GlyIle: 2.319 ± 0.825
2.153GlyLys: 2.153 ± 0.581
9.442GlyLeu: 9.442 ± 1.113
1.16GlyMet: 1.16 ± 0.354
3.479GlyAsn: 3.479 ± 0.516
3.313GlyPro: 3.313 ± 0.468
2.816GlyGln: 2.816 ± 0.625
4.141GlyArg: 4.141 ± 0.809
6.957GlySer: 6.957 ± 1.147
3.81GlyThr: 3.81 ± 0.711
5.135GlyVal: 5.135 ± 0.598
1.988GlyTrp: 1.988 ± 0.674
3.644GlyTyr: 3.644 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.988HisAla: 1.988 ± 0.62
0.994HisCys: 0.994 ± 0.272
0.497HisAsp: 0.497 ± 0.316
0.331HisGlu: 0.331 ± 0.637
2.485HisPhe: 2.485 ± 0.767
1.491HisGly: 1.491 ± 1.255
0.0HisHis: 0.0 ± 0.0
1.822HisIle: 1.822 ± 0.533
0.663HisLys: 0.663 ± 0.255
1.988HisLeu: 1.988 ± 0.818
0.166HisMet: 0.166 ± 0.111
0.166HisAsn: 0.166 ± 0.424
1.16HisPro: 1.16 ± 0.599
0.994HisGln: 0.994 ± 0.383
1.325HisArg: 1.325 ± 0.479
1.16HisSer: 1.16 ± 0.479
1.656HisThr: 1.656 ± 0.665
1.491HisVal: 1.491 ± 0.551
0.663HisTrp: 0.663 ± 0.317
1.491HisTyr: 1.491 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
2.816IleAla: 2.816 ± 0.467
1.822IleCys: 1.822 ± 0.665
3.147IleAsp: 3.147 ± 0.422
0.497IleGlu: 0.497 ± 0.168
1.325IlePhe: 1.325 ± 0.762
3.975IleGly: 3.975 ± 0.698
0.663IleHis: 0.663 ± 0.573
1.325IleIle: 1.325 ± 1.122
0.994IleLys: 0.994 ± 0.46
3.147IleLeu: 3.147 ± 1.806
0.828IleMet: 0.828 ± 0.31
1.325IleAsn: 1.325 ± 0.801
3.147IlePro: 3.147 ± 0.451
0.828IleGln: 0.828 ± 0.448
0.331IleArg: 0.331 ± 0.233
3.81IleSer: 3.81 ± 0.543
2.816IleThr: 2.816 ± 0.387
2.485IleVal: 2.485 ± 1.866
0.663IleTrp: 0.663 ± 0.576
1.656IleTyr: 1.656 ± 0.894
0.0IleXaa: 0.0 ± 0.0
Lys
2.153LysAla: 2.153 ± 0.749
0.497LysCys: 0.497 ± 0.168
2.816LysAsp: 2.816 ± 0.685
1.988LysGlu: 1.988 ± 0.474
1.325LysPhe: 1.325 ± 0.509
2.153LysGly: 2.153 ± 0.749
0.331LysHis: 0.331 ± 0.128
1.988LysIle: 1.988 ± 0.32
0.828LysLys: 0.828 ± 0.277
3.644LysLeu: 3.644 ± 0.689
0.497LysMet: 0.497 ± 0.31
0.828LysAsn: 0.828 ± 0.355
2.319LysPro: 2.319 ± 0.603
1.325LysGln: 1.325 ± 0.51
2.65LysArg: 2.65 ± 0.449
1.988LysSer: 1.988 ± 0.764
2.153LysThr: 2.153 ± 0.26
3.313LysVal: 3.313 ± 0.565
0.331LysTrp: 0.331 ± 0.128
1.988LysTyr: 1.988 ± 0.764
0.0LysXaa: 0.0 ± 0.0
Leu
11.761LeuAla: 11.761 ± 1.546
3.975LeuCys: 3.975 ± 0.49
5.466LeuAsp: 5.466 ± 0.939
3.975LeuGlu: 3.975 ± 0.818
4.969LeuPhe: 4.969 ± 1.815
6.957LeuGly: 6.957 ± 0.713
1.491LeuHis: 1.491 ± 0.928
3.975LeuIle: 3.975 ± 0.695
3.81LeuLys: 3.81 ± 1.27
16.068LeuLeu: 16.068 ± 4.506
1.656LeuMet: 1.656 ± 1.243
2.65LeuAsn: 2.65 ± 0.533
6.46LeuPro: 6.46 ± 0.604
3.81LeuGln: 3.81 ± 0.752
4.638LeuArg: 4.638 ± 1.05
7.288LeuSer: 7.288 ± 1.172
7.62LeuThr: 7.62 ± 0.776
8.779LeuVal: 8.779 ± 1.509
1.988LeuTrp: 1.988 ± 0.448
3.147LeuTyr: 3.147 ± 0.666
0.0LeuXaa: 0.0 ± 0.0
Met
1.988MetAla: 1.988 ± 0.566
0.994MetCys: 0.994 ± 0.224
0.0MetAsp: 0.0 ± 0.0
0.663MetGlu: 0.663 ± 0.277
0.663MetPhe: 0.663 ± 0.471
1.988MetGly: 1.988 ± 0.598
0.166MetHis: 0.166 ± 0.236
0.994MetIle: 0.994 ± 0.689
0.663MetLys: 0.663 ± 0.277
3.313MetLeu: 3.313 ± 0.561
0.828MetMet: 0.828 ± 0.318
0.663MetAsn: 0.663 ± 0.255
1.325MetPro: 1.325 ± 1.667
0.166MetGln: 0.166 ± 0.424
1.325MetArg: 1.325 ± 0.525
0.497MetSer: 0.497 ± 0.359
0.331MetThr: 0.331 ± 0.128
1.16MetVal: 1.16 ± 0.567
0.994MetTrp: 0.994 ± 0.383
0.166MetTyr: 0.166 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
2.485AsnAla: 2.485 ± 0.321
1.822AsnCys: 1.822 ± 0.62
1.16AsnAsp: 1.16 ± 0.341
0.663AsnGlu: 0.663 ± 0.255
1.16AsnPhe: 1.16 ± 0.577
1.16AsnGly: 1.16 ± 0.484
0.497AsnHis: 0.497 ± 0.331
1.491AsnIle: 1.491 ± 0.378
0.994AsnLys: 0.994 ± 0.586
3.147AsnLeu: 3.147 ± 1.043
0.828AsnMet: 0.828 ± 0.39
0.994AsnAsn: 0.994 ± 0.285
1.822AsnPro: 1.822 ± 0.413
1.16AsnGln: 1.16 ± 0.306
1.16AsnArg: 1.16 ± 0.302
1.988AsnSer: 1.988 ± 0.459
0.994AsnThr: 0.994 ± 0.349
3.147AsnVal: 3.147 ± 0.68
0.331AsnTrp: 0.331 ± 0.128
0.994AsnTyr: 0.994 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
6.791ProAla: 6.791 ± 0.704
1.325ProCys: 1.325 ± 0.553
1.988ProAsp: 1.988 ± 0.474
0.994ProGlu: 0.994 ± 0.46
0.994ProPhe: 0.994 ± 0.752
4.969ProGly: 4.969 ± 0.649
1.16ProHis: 1.16 ± 0.567
3.313ProIle: 3.313 ± 0.847
3.81ProLys: 3.81 ± 1.276
3.644ProLeu: 3.644 ± 0.376
1.325ProMet: 1.325 ± 0.7
1.491ProAsn: 1.491 ± 0.402
3.975ProPro: 3.975 ± 0.737
1.988ProGln: 1.988 ± 0.502
3.479ProArg: 3.479 ± 0.664
4.307ProSer: 4.307 ± 1.447
4.969ProThr: 4.969 ± 0.623
6.295ProVal: 6.295 ± 0.834
0.331ProTrp: 0.331 ± 0.128
1.656ProTyr: 1.656 ± 0.609
0.0ProXaa: 0.0 ± 0.0
Gln
2.319GlnAla: 2.319 ± 0.608
1.656GlnCys: 1.656 ± 0.442
1.491GlnAsp: 1.491 ± 0.369
3.147GlnGlu: 3.147 ± 0.588
0.663GlnPhe: 0.663 ± 0.255
1.656GlnGly: 1.656 ± 0.583
0.994GlnHis: 0.994 ± 0.336
0.828GlnIle: 0.828 ± 0.743
0.828GlnLys: 0.828 ± 0.277
4.141GlnLeu: 4.141 ± 0.557
0.663GlnMet: 0.663 ± 0.526
0.166GlnAsn: 0.166 ± 0.111
1.822GlnPro: 1.822 ± 0.552
0.663GlnGln: 0.663 ± 0.467
2.816GlnArg: 2.816 ± 0.785
1.988GlnSer: 1.988 ± 0.474
0.994GlnThr: 0.994 ± 0.336
1.656GlnVal: 1.656 ± 0.461
0.0GlnTrp: 0.0 ± 0.0
0.994GlnTyr: 0.994 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
5.632ArgAla: 5.632 ± 0.631
2.319ArgCys: 2.319 ± 0.695
1.656ArgAsp: 1.656 ± 0.555
1.491ArgGlu: 1.491 ± 0.381
2.982ArgPhe: 2.982 ± 0.321
2.982ArgGly: 2.982 ± 0.493
0.994ArgHis: 0.994 ± 0.491
0.331ArgIle: 0.331 ± 0.222
0.994ArgLys: 0.994 ± 1.109
5.466ArgLeu: 5.466 ± 0.86
0.994ArgMet: 0.994 ± 0.535
2.153ArgAsn: 2.153 ± 0.554
3.147ArgPro: 3.147 ± 0.597
1.491ArgGln: 1.491 ± 0.844
3.81ArgArg: 3.81 ± 1.116
4.472ArgSer: 4.472 ± 0.895
3.147ArgThr: 3.147 ± 0.838
6.791ArgVal: 6.791 ± 1.488
1.16ArgTrp: 1.16 ± 0.406
1.822ArgTyr: 1.822 ± 0.735
0.0ArgXaa: 0.0 ± 0.0
Ser
7.62SerAla: 7.62 ± 1.214
1.822SerCys: 1.822 ± 0.362
1.988SerAsp: 1.988 ± 0.554
1.491SerGlu: 1.491 ± 0.504
4.307SerPhe: 4.307 ± 1.042
7.454SerGly: 7.454 ± 0.784
1.16SerHis: 1.16 ± 0.536
2.982SerIle: 2.982 ± 1.068
3.313SerLys: 3.313 ± 0.509
7.951SerLeu: 7.951 ± 0.978
1.656SerMet: 1.656 ± 0.36
2.485SerAsn: 2.485 ± 0.492
3.81SerPro: 3.81 ± 0.977
0.994SerGln: 0.994 ± 0.768
3.147SerArg: 3.147 ± 0.92
4.969SerSer: 4.969 ± 1.83
4.804SerThr: 4.804 ± 0.956
5.632SerVal: 5.632 ± 0.38
0.828SerTrp: 0.828 ± 0.313
3.644SerTyr: 3.644 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
5.798ThrAla: 5.798 ± 1.249
0.994ThrCys: 0.994 ± 0.224
1.988ThrAsp: 1.988 ± 0.974
1.16ThrGlu: 1.16 ± 0.417
3.975ThrPhe: 3.975 ± 0.561
6.295ThrGly: 6.295 ± 1.844
1.16ThrHis: 1.16 ± 0.599
3.147ThrIle: 3.147 ± 0.484
1.822ThrLys: 1.822 ± 0.617
6.129ThrLeu: 6.129 ± 0.8
1.988ThrMet: 1.988 ± 0.554
1.822ThrAsn: 1.822 ± 0.663
3.975ThrPro: 3.975 ± 0.579
2.816ThrGln: 2.816 ± 0.594
4.141ThrArg: 4.141 ± 0.657
4.307ThrSer: 4.307 ± 0.693
4.804ThrThr: 4.804 ± 0.913
5.135ThrVal: 5.135 ± 1.05
0.166ThrTrp: 0.166 ± 0.424
0.994ThrTyr: 0.994 ± 0.641
0.0ThrXaa: 0.0 ± 0.0
Val
7.288ValAla: 7.288 ± 1.123
3.147ValCys: 3.147 ± 0.937
4.804ValAsp: 4.804 ± 0.645
1.822ValGlu: 1.822 ± 0.373
3.81ValPhe: 3.81 ± 0.482
7.288ValGly: 7.288 ± 0.946
1.988ValHis: 1.988 ± 1.043
2.485ValIle: 2.485 ± 0.344
3.975ValLys: 3.975 ± 1.094
8.779ValLeu: 8.779 ± 2.428
0.828ValMet: 0.828 ± 0.447
2.485ValAsn: 2.485 ± 0.467
6.791ValPro: 6.791 ± 0.669
2.153ValGln: 2.153 ± 0.472
3.81ValArg: 3.81 ± 0.407
4.804ValSer: 4.804 ± 1.096
8.614ValThr: 8.614 ± 1.582
11.264ValVal: 11.264 ± 1.245
1.16ValTrp: 1.16 ± 0.366
2.982ValTyr: 2.982 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
1.325TrpAla: 1.325 ± 0.322
0.828TrpCys: 0.828 ± 0.306
0.994TrpAsp: 0.994 ± 0.336
0.331TrpGlu: 0.331 ± 0.326
1.491TrpPhe: 1.491 ± 0.343
0.331TrpGly: 0.331 ± 0.326
0.663TrpHis: 0.663 ± 0.255
0.663TrpIle: 0.663 ± 0.255
0.331TrpLys: 0.331 ± 0.128
2.485TrpLeu: 2.485 ± 0.912
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.828TrpPro: 0.828 ± 0.277
0.0TrpGln: 0.0 ± 0.0
1.822TrpArg: 1.822 ± 0.368
1.656TrpSer: 1.656 ± 0.501
0.828TrpThr: 0.828 ± 0.277
0.994TrpVal: 0.994 ± 0.43
0.166TrpTrp: 0.166 ± 0.111
0.828TrpTyr: 0.828 ± 0.265
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.816TyrAla: 2.816 ± 1.568
0.828TyrCys: 0.828 ± 0.443
0.497TyrAsp: 0.497 ± 0.334
0.994TyrGlu: 0.994 ± 0.349
1.822TyrPhe: 1.822 ± 0.634
2.153TyrGly: 2.153 ± 0.725
1.16TyrHis: 1.16 ± 0.886
1.822TyrIle: 1.822 ± 0.541
0.828TyrLys: 0.828 ± 0.443
3.644TyrLeu: 3.644 ± 0.685
0.331TyrMet: 0.331 ± 0.389
1.988TyrAsn: 1.988 ± 0.314
1.656TyrPro: 1.656 ± 0.69
0.663TyrGln: 0.663 ± 0.255
3.147TyrArg: 3.147 ± 0.402
2.65TyrSer: 2.65 ± 0.735
1.822TyrThr: 1.822 ± 0.733
4.638TyrVal: 4.638 ± 0.863
1.491TyrTrp: 1.491 ± 0.504
1.988TyrTyr: 1.988 ± 0.583
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (6038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski