Amino acid dipepetide frequency for Caprine arthritis encephalitis virus (CAEV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.176AlaAla: 5.176 ± 1.703
2.218AlaCys: 2.218 ± 0.634
1.109AlaAsp: 1.109 ± 0.719
4.067AlaGlu: 4.067 ± 0.612
1.109AlaPhe: 1.109 ± 0.385
4.806AlaGly: 4.806 ± 1.468
1.109AlaHis: 1.109 ± 0.45
2.957AlaIle: 2.957 ± 0.414
1.848AlaLys: 1.848 ± 0.587
4.436AlaLeu: 4.436 ± 0.815
2.957AlaMet: 2.957 ± 0.994
3.327AlaAsn: 3.327 ± 1.149
2.218AlaPro: 2.218 ± 0.691
3.327AlaGln: 3.327 ± 1.157
6.285AlaArg: 6.285 ± 0.808
1.848AlaSer: 1.848 ± 0.641
4.436AlaThr: 4.436 ± 0.844
4.436AlaVal: 4.436 ± 1.249
0.37AlaTrp: 0.37 ± 0.56
2.957AlaTyr: 2.957 ± 0.652
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.385
0.37CysCys: 0.37 ± 0.401
0.0CysAsp: 0.0 ± 0.0
1.848CysGlu: 1.848 ± 0.998
0.37CysPhe: 0.37 ± 0.281
1.479CysGly: 1.479 ± 0.607
1.109CysHis: 1.109 ± 0.85
1.109CysIle: 1.109 ± 0.385
1.479CysLys: 1.479 ± 0.36
1.109CysLeu: 1.109 ± 1.198
0.739CysMet: 0.739 ± 0.492
1.479CysAsn: 1.479 ± 1.157
0.0CysPro: 0.0 ± 0.0
1.848CysGln: 1.848 ± 0.339
1.848CysArg: 1.848 ± 1.173
1.848CysSer: 1.848 ± 0.891
2.218CysThr: 2.218 ± 1.282
2.218CysVal: 2.218 ± 1.735
1.109CysTrp: 1.109 ± 0.568
1.479CysTyr: 1.479 ± 0.802
0.0CysXaa: 0.0 ± 0.0
Asp
3.697AspAla: 3.697 ± 1.078
1.848AspCys: 1.848 ± 0.891
0.37AspAsp: 0.37 ± 0.281
2.218AspGlu: 2.218 ± 0.696
2.957AspPhe: 2.957 ± 1.582
3.697AspGly: 3.697 ± 1.424
0.739AspHis: 0.739 ± 0.58
1.848AspIle: 1.848 ± 1.073
2.957AspLys: 2.957 ± 0.732
3.697AspLeu: 3.697 ± 0.801
0.37AspMet: 0.37 ± 0.289
1.479AspAsn: 1.479 ± 0.459
2.588AspPro: 2.588 ± 1.552
1.109AspGln: 1.109 ± 0.75
2.957AspArg: 2.957 ± 0.919
2.218AspSer: 2.218 ± 0.449
2.218AspThr: 2.218 ± 0.52
1.479AspVal: 1.479 ± 0.952
1.479AspTrp: 1.479 ± 0.69
0.37AspTyr: 0.37 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
6.285GluAla: 6.285 ± 1.17
0.739GluCys: 0.739 ± 0.401
5.176GluAsp: 5.176 ± 1.102
8.133GluGlu: 8.133 ± 1.603
1.109GluPhe: 1.109 ± 0.433
8.133GluGly: 8.133 ± 1.716
0.739GluHis: 0.739 ± 0.561
5.176GluIle: 5.176 ± 1.497
7.763GluLys: 7.763 ± 1.269
4.067GluLeu: 4.067 ± 1.127
1.848GluMet: 1.848 ± 0.453
0.739GluAsn: 0.739 ± 0.578
4.067GluPro: 4.067 ± 1.384
2.218GluGln: 2.218 ± 0.427
3.327GluArg: 3.327 ± 1.35
2.588GluSer: 2.588 ± 0.499
3.327GluThr: 3.327 ± 1.201
4.067GluVal: 4.067 ± 0.826
2.588GluTrp: 2.588 ± 0.864
4.067GluTyr: 4.067 ± 0.927
0.0GluXaa: 0.0 ± 0.0
Phe
0.37PheAla: 0.37 ± 0.289
1.109PheCys: 1.109 ± 0.433
0.0PheAsp: 0.0 ± 0.0
1.109PheGlu: 1.109 ± 0.877
0.0PhePhe: 0.0 ± 0.0
0.739PheGly: 0.739 ± 0.561
0.37PheHis: 0.37 ± 0.401
1.109PheIle: 1.109 ± 0.565
0.739PheLys: 0.739 ± 0.578
1.109PheLeu: 1.109 ± 0.489
0.37PheMet: 0.37 ± 0.289
0.739PheAsn: 0.739 ± 0.578
0.739PhePro: 0.739 ± 0.415
1.479PheGln: 1.479 ± 0.817
1.848PheArg: 1.848 ± 0.724
1.109PheSer: 1.109 ± 0.385
2.588PheThr: 2.588 ± 0.676
2.588PheVal: 2.588 ± 1.03
1.109PheTrp: 1.109 ± 0.385
0.37PheTyr: 0.37 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
4.806GlyAla: 4.806 ± 1.357
1.848GlyCys: 1.848 ± 0.998
2.588GlyAsp: 2.588 ± 1.09
5.176GlyGlu: 5.176 ± 1.149
2.218GlyPhe: 2.218 ± 0.769
6.285GlyGly: 6.285 ± 0.866
2.588GlyHis: 2.588 ± 0.809
7.024GlyIle: 7.024 ± 2.129
9.242GlyLys: 9.242 ± 1.122
4.067GlyLeu: 4.067 ± 0.948
1.848GlyMet: 1.848 ± 0.588
5.915GlyAsn: 5.915 ± 1.461
3.327GlyPro: 3.327 ± 1.549
1.479GlyGln: 1.479 ± 0.477
5.176GlyArg: 5.176 ± 1.473
2.588GlySer: 2.588 ± 0.655
3.327GlyThr: 3.327 ± 1.35
2.957GlyVal: 2.957 ± 0.963
1.848GlyTrp: 1.848 ± 0.607
2.957GlyTyr: 2.957 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
0.37HisAla: 0.37 ± 0.399
0.0HisCys: 0.0 ± 0.0
0.739HisAsp: 0.739 ± 0.239
0.739HisGlu: 0.739 ± 0.239
0.0HisPhe: 0.0 ± 0.0
1.109HisGly: 1.109 ± 0.231
0.37HisHis: 0.37 ± 0.281
0.739HisIle: 0.739 ± 0.578
2.957HisLys: 2.957 ± 1.551
2.218HisLeu: 2.218 ± 0.634
0.739HisMet: 0.739 ± 0.401
0.37HisAsn: 0.37 ± 0.289
1.848HisPro: 1.848 ± 1.073
1.479HisGln: 1.479 ± 0.326
2.588HisArg: 2.588 ± 0.726
1.109HisSer: 1.109 ± 0.842
0.37HisThr: 0.37 ± 0.281
1.479HisVal: 1.479 ± 0.459
2.218HisTrp: 2.218 ± 0.673
0.739HisTyr: 0.739 ± 0.239
0.0HisXaa: 0.0 ± 0.0
Ile
2.218IleAla: 2.218 ± 0.603
1.848IleCys: 1.848 ± 0.936
2.588IleAsp: 2.588 ± 0.491
3.327IleGlu: 3.327 ± 1.094
1.848IlePhe: 1.848 ± 0.339
5.915IleGly: 5.915 ± 1.91
0.37IleHis: 0.37 ± 0.399
3.327IleIle: 3.327 ± 1.35
4.436IleLys: 4.436 ± 0.473
4.806IleLeu: 4.806 ± 1.464
2.218IleMet: 2.218 ± 0.857
2.218IleAsn: 2.218 ± 1.038
4.436IlePro: 4.436 ± 2.069
2.957IleGln: 2.957 ± 0.343
3.327IleArg: 3.327 ± 1.35
2.218IleSer: 2.218 ± 0.407
1.479IleThr: 1.479 ± 0.477
6.285IleVal: 6.285 ± 2.14
0.37IleTrp: 0.37 ± 0.289
1.848IleTyr: 1.848 ± 1.403
0.0IleXaa: 0.0 ± 0.0
Lys
4.436LysAla: 4.436 ± 1.751
1.109LysCys: 1.109 ± 0.549
4.806LysAsp: 4.806 ± 1.044
6.654LysGlu: 6.654 ± 2.303
2.218LysPhe: 2.218 ± 0.9
5.545LysGly: 5.545 ± 1.809
1.109LysHis: 1.109 ± 0.842
4.436LysIle: 4.436 ± 1.629
6.285LysLys: 6.285 ± 0.931
8.872LysLeu: 8.872 ± 2.278
1.109LysMet: 1.109 ± 0.706
2.588LysAsn: 2.588 ± 1.163
3.327LysPro: 3.327 ± 1.325
2.218LysGln: 2.218 ± 1.126
4.436LysArg: 4.436 ± 0.526
2.588LysSer: 2.588 ± 0.323
4.436LysThr: 4.436 ± 0.68
1.479LysVal: 1.479 ± 0.326
2.957LysTrp: 2.957 ± 1.428
1.848LysTyr: 1.848 ± 0.641
0.0LysXaa: 0.0 ± 0.0
Leu
7.394LeuAla: 7.394 ± 1.482
0.37LeuCys: 0.37 ± 0.401
2.957LeuAsp: 2.957 ± 0.414
7.024LeuGlu: 7.024 ± 0.961
0.739LeuPhe: 0.739 ± 0.561
6.654LeuGly: 6.654 ± 1.24
0.739LeuHis: 0.739 ± 0.561
3.327LeuIle: 3.327 ± 1.194
4.806LeuLys: 4.806 ± 1.163
5.545LeuLeu: 5.545 ± 1.857
1.109LeuMet: 1.109 ± 0.45
1.479LeuAsn: 1.479 ± 0.797
4.806LeuPro: 4.806 ± 1.956
6.654LeuGln: 6.654 ± 1.798
7.394LeuArg: 7.394 ± 1.588
1.848LeuSer: 1.848 ± 0.656
3.697LeuThr: 3.697 ± 0.858
5.176LeuVal: 5.176 ± 1.264
3.697LeuTrp: 3.697 ± 1.04
1.848LeuTyr: 1.848 ± 0.66
0.0LeuXaa: 0.0 ± 0.0
Met
0.739MetAla: 0.739 ± 0.397
0.37MetCys: 0.37 ± 0.401
1.848MetAsp: 1.848 ± 0.588
1.848MetGlu: 1.848 ± 0.724
0.0MetPhe: 0.0 ± 0.0
1.109MetGly: 1.109 ± 0.231
0.0MetHis: 0.0 ± 0.0
1.479MetIle: 1.479 ± 0.336
2.218MetLys: 2.218 ± 0.581
1.848MetLeu: 1.848 ± 0.641
1.109MetMet: 1.109 ± 0.231
1.109MetAsn: 1.109 ± 0.231
2.588MetPro: 2.588 ± 0.628
3.327MetGln: 3.327 ± 1.709
2.588MetArg: 2.588 ± 0.858
1.109MetSer: 1.109 ± 0.466
1.479MetThr: 1.479 ± 0.36
1.109MetVal: 1.109 ± 0.868
0.37MetTrp: 0.37 ± 0.281
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.479AsnAla: 1.479 ± 0.336
3.697AsnCys: 3.697 ± 1.65
0.0AsnAsp: 0.0 ± 0.0
0.739AsnGlu: 0.739 ± 0.239
0.739AsnPhe: 0.739 ± 0.397
2.218AsnGly: 2.218 ± 0.9
0.739AsnHis: 0.739 ± 0.553
3.327AsnIle: 3.327 ± 1.35
5.176AsnLys: 5.176 ± 0.42
2.218AsnLeu: 2.218 ± 0.9
2.588AsnMet: 2.588 ± 0.655
1.479AsnAsn: 1.479 ± 0.477
2.588AsnPro: 2.588 ± 0.452
1.109AsnGln: 1.109 ± 0.489
1.479AsnArg: 1.479 ± 0.36
1.479AsnSer: 1.479 ± 0.459
2.588AsnThr: 2.588 ± 0.89
2.218AsnVal: 2.218 ± 0.407
2.218AsnTrp: 2.218 ± 1.496
0.739AsnTyr: 0.739 ± 0.492
0.0AsnXaa: 0.0 ± 0.0
Pro
1.848ProAla: 1.848 ± 1.037
0.37ProCys: 0.37 ± 0.281
2.218ProAsp: 2.218 ± 0.931
5.545ProGlu: 5.545 ± 0.892
0.739ProPhe: 0.739 ± 0.492
5.545ProGly: 5.545 ± 1.69
1.848ProHis: 1.848 ± 0.641
2.218ProIle: 2.218 ± 0.461
1.479ProLys: 1.479 ± 0.924
3.697ProLeu: 3.697 ± 0.887
1.479ProMet: 1.479 ± 0.777
0.739ProAsn: 0.739 ± 0.561
2.588ProPro: 2.588 ± 0.699
4.436ProGln: 4.436 ± 1.882
1.848ProArg: 1.848 ± 0.563
2.588ProSer: 2.588 ± 0.499
2.957ProThr: 2.957 ± 1.15
2.957ProVal: 2.957 ± 0.906
3.327ProTrp: 3.327 ± 1.077
2.588ProTyr: 2.588 ± 0.884
0.0ProXaa: 0.0 ± 0.0
Gln
4.806GlnAla: 4.806 ± 2.703
1.109GlnCys: 1.109 ± 0.549
1.479GlnAsp: 1.479 ± 0.83
6.654GlnGlu: 6.654 ± 1.287
0.739GlnPhe: 0.739 ± 0.561
2.957GlnGly: 2.957 ± 0.808
2.218GlnHis: 2.218 ± 0.258
3.697GlnIle: 3.697 ± 0.701
6.285GlnLys: 6.285 ± 2.079
4.067GlnLeu: 4.067 ± 0.628
0.739GlnMet: 0.739 ± 0.709
3.697GlnAsn: 3.697 ± 0.493
1.479GlnPro: 1.479 ± 0.326
4.067GlnGln: 4.067 ± 0.634
2.218GlnArg: 2.218 ± 0.769
3.697GlnSer: 3.697 ± 0.674
1.848GlnThr: 1.848 ± 1.134
3.697GlnVal: 3.697 ± 0.711
1.848GlnTrp: 1.848 ± 0.432
1.479GlnTyr: 1.479 ± 0.952
0.0GlnXaa: 0.0 ± 0.0
Arg
3.327ArgAla: 3.327 ± 0.951
0.739ArgCys: 0.739 ± 0.578
4.067ArgAsp: 4.067 ± 1.425
5.545ArgGlu: 5.545 ± 0.316
1.109ArgPhe: 1.109 ± 0.466
5.176ArgGly: 5.176 ± 1.835
1.109ArgHis: 1.109 ± 0.549
4.806ArgIle: 4.806 ± 1.706
5.915ArgLys: 5.915 ± 0.826
2.957ArgLeu: 2.957 ± 0.605
1.109ArgMet: 1.109 ± 0.466
2.957ArgAsn: 2.957 ± 1.487
2.588ArgPro: 2.588 ± 0.499
5.545ArgGln: 5.545 ± 1.802
4.436ArgArg: 4.436 ± 3.163
3.697ArgSer: 3.697 ± 1.53
6.654ArgThr: 6.654 ± 1.655
4.806ArgVal: 4.806 ± 1.057
2.218ArgTrp: 2.218 ± 0.833
1.848ArgTyr: 1.848 ± 0.483
0.0ArgXaa: 0.0 ± 0.0
Ser
1.479SerAla: 1.479 ± 0.81
1.848SerCys: 1.848 ± 0.838
2.218SerAsp: 2.218 ± 1.065
3.327SerGlu: 3.327 ± 0.73
0.37SerPhe: 0.37 ± 0.289
2.957SerGly: 2.957 ± 0.582
1.109SerHis: 1.109 ± 0.565
2.218SerIle: 2.218 ± 1.168
0.37SerLys: 0.37 ± 0.399
6.285SerLeu: 6.285 ± 0.926
0.739SerMet: 0.739 ± 0.401
1.109SerAsn: 1.109 ± 0.45
2.588SerPro: 2.588 ± 0.603
1.479SerGln: 1.479 ± 0.607
4.436SerArg: 4.436 ± 1.114
1.479SerSer: 1.479 ± 0.81
3.697SerThr: 3.697 ± 0.323
1.109SerVal: 1.109 ± 0.842
1.109SerTrp: 1.109 ± 0.568
1.109SerTyr: 1.109 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
2.957ThrAla: 2.957 ± 0.774
2.957ThrCys: 2.957 ± 0.797
2.588ThrAsp: 2.588 ± 0.889
4.806ThrGlu: 4.806 ± 1.702
0.739ThrPhe: 0.739 ± 0.561
6.285ThrGly: 6.285 ± 0.75
1.848ThrHis: 1.848 ± 0.66
2.218ThrIle: 2.218 ± 0.508
1.479ThrLys: 1.479 ± 0.81
7.024ThrLeu: 7.024 ± 0.841
0.739ThrMet: 0.739 ± 0.247
2.218ThrAsn: 2.218 ± 0.461
1.479ThrPro: 1.479 ± 0.326
4.806ThrGln: 4.806 ± 0.616
4.067ThrArg: 4.067 ± 1.844
2.957ThrSer: 2.957 ± 0.92
1.479ThrThr: 1.479 ± 0.36
3.327ThrVal: 3.327 ± 0.692
1.848ThrTrp: 1.848 ± 0.585
2.218ThrTyr: 2.218 ± 0.641
0.0ThrXaa: 0.0 ± 0.0
Val
4.806ValAla: 4.806 ± 0.806
0.739ValCys: 0.739 ± 0.239
3.327ValAsp: 3.327 ± 0.977
3.327ValGlu: 3.327 ± 0.692
0.739ValPhe: 0.739 ± 0.401
2.957ValGly: 2.957 ± 0.854
2.218ValHis: 2.218 ± 1.235
3.327ValIle: 3.327 ± 1.383
2.218ValLys: 2.218 ± 0.828
4.806ValLeu: 4.806 ± 0.457
2.957ValMet: 2.957 ± 1.146
1.848ValAsn: 1.848 ± 0.246
2.957ValPro: 2.957 ± 0.454
4.067ValGln: 4.067 ± 0.634
3.327ValArg: 3.327 ± 0.613
2.218ValSer: 2.218 ± 0.822
4.067ValThr: 4.067 ± 1.347
4.067ValVal: 4.067 ± 0.948
2.957ValTrp: 2.957 ± 0.807
1.848ValTyr: 1.848 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
0.739TrpAla: 0.739 ± 0.578
1.109TrpCys: 1.109 ± 0.549
1.848TrpAsp: 1.848 ± 0.498
1.479TrpGlu: 1.479 ± 0.749
1.479TrpPhe: 1.479 ± 0.986
1.479TrpGly: 1.479 ± 0.539
1.109TrpHis: 1.109 ± 0.45
1.479TrpIle: 1.479 ± 1.122
2.588TrpLys: 2.588 ± 0.747
2.588TrpLeu: 2.588 ± 0.614
0.739TrpMet: 0.739 ± 0.401
1.848TrpAsn: 1.848 ± 0.563
1.109TrpPro: 1.109 ± 0.45
2.218TrpGln: 2.218 ± 1.004
4.067TrpArg: 4.067 ± 1.096
1.109TrpSer: 1.109 ± 0.385
3.327TrpThr: 3.327 ± 1.143
3.327TrpVal: 3.327 ± 0.455
0.0TrpTrp: 0.0 ± 0.0
1.109TrpTyr: 1.109 ± 0.7
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.957TyrAla: 2.957 ± 0.605
0.37TyrCys: 0.37 ± 0.289
0.37TyrAsp: 0.37 ± 0.281
2.588TyrGlu: 2.588 ± 0.722
0.37TyrPhe: 0.37 ± 0.281
2.218TyrGly: 2.218 ± 0.9
0.739TyrHis: 0.739 ± 0.578
2.218TyrIle: 2.218 ± 0.716
2.588TyrLys: 2.588 ± 0.876
1.848TyrLeu: 1.848 ± 0.563
0.37TyrMet: 0.37 ± 0.281
1.479TyrAsn: 1.479 ± 0.336
3.697TyrPro: 3.697 ± 1.317
2.957TyrGln: 2.957 ± 0.854
2.957TyrArg: 2.957 ± 1.213
0.739TyrSer: 0.739 ± 0.492
1.848TyrThr: 1.848 ± 0.563
0.0TyrVal: 0.0 ± 0.0
1.109TyrTrp: 1.109 ± 0.385
1.479TyrTyr: 1.479 ± 0.797
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski