Amino acid dipepetide frequency for Sendai virus (strain Z) (SeV) (Sendai virus (strain HVJ))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.03AlaAla: 4.03 ± 0.704
0.576AlaCys: 0.576 ± 0.344
2.591AlaAsp: 2.591 ± 1.276
4.318AlaGlu: 4.318 ± 1.057
2.591AlaPhe: 2.591 ± 0.739
4.03AlaGly: 4.03 ± 0.592
4.318AlaHis: 4.318 ± 1.065
2.879AlaIle: 2.879 ± 1.215
4.03AlaLys: 4.03 ± 0.698
5.757AlaLeu: 5.757 ± 1.921
2.879AlaMet: 2.879 ± 1.023
2.303AlaAsn: 2.303 ± 0.709
2.303AlaPro: 2.303 ± 0.432
1.151AlaGln: 1.151 ± 0.766
2.879AlaArg: 2.879 ± 0.701
4.893AlaSer: 4.893 ± 0.546
2.879AlaThr: 2.879 ± 0.759
4.893AlaVal: 4.893 ± 0.88
1.727AlaTrp: 1.727 ± 0.812
1.151AlaTyr: 1.151 ± 0.467
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.288CysGlu: 0.288 ± 0.267
0.288CysPhe: 0.288 ± 0.26
1.439CysGly: 1.439 ± 0.72
0.0CysHis: 0.0 ± 0.0
1.151CysIle: 1.151 ± 0.801
1.151CysLys: 1.151 ± 0.551
0.576CysLeu: 0.576 ± 0.344
0.288CysMet: 0.288 ± 0.254
0.288CysAsn: 0.288 ± 0.254
1.727CysPro: 1.727 ± 0.666
0.288CysGln: 0.288 ± 0.267
1.727CysArg: 1.727 ± 0.754
0.288CysSer: 0.288 ± 0.254
0.576CysThr: 0.576 ± 0.31
0.576CysVal: 0.576 ± 0.336
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.893AspAla: 4.893 ± 1.004
1.151AspCys: 1.151 ± 0.425
1.151AspAsp: 1.151 ± 0.329
5.469AspGlu: 5.469 ± 1.419
2.015AspPhe: 2.015 ± 0.656
2.015AspGly: 2.015 ± 0.527
0.0AspHis: 0.0 ± 0.0
4.893AspIle: 4.893 ± 0.35
2.879AspLys: 2.879 ± 0.73
3.454AspLeu: 3.454 ± 0.78
0.864AspMet: 0.864 ± 0.476
2.015AspAsn: 2.015 ± 0.367
2.015AspPro: 2.015 ± 0.597
6.333AspGln: 6.333 ± 1.653
2.879AspArg: 2.879 ± 1.034
5.181AspSer: 5.181 ± 0.894
3.454AspThr: 3.454 ± 0.907
2.303AspVal: 2.303 ± 0.419
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
7.196GluAla: 7.196 ± 1.167
0.0GluCys: 0.0 ± 0.0
6.045GluAsp: 6.045 ± 1.418
8.348GluGlu: 8.348 ± 1.934
0.864GluPhe: 0.864 ± 0.477
6.045GluGly: 6.045 ± 1.814
1.151GluHis: 1.151 ± 0.597
1.727GluIle: 1.727 ± 0.399
6.045GluLys: 6.045 ± 1.381
4.893GluLeu: 4.893 ± 0.766
3.166GluMet: 3.166 ± 0.681
2.879GluAsn: 2.879 ± 1.302
4.03GluPro: 4.03 ± 0.834
3.454GluGln: 3.454 ± 1.031
5.469GluArg: 5.469 ± 0.876
7.772GluSer: 7.772 ± 1.276
5.757GluThr: 5.757 ± 1.085
4.893GluVal: 4.893 ± 1.002
0.576GluTrp: 0.576 ± 0.337
0.864GluTyr: 0.864 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
1.151PheAla: 1.151 ± 0.448
0.288PheCys: 0.288 ± 0.254
0.864PheAsp: 0.864 ± 0.307
0.864PheGlu: 0.864 ± 0.335
0.864PhePhe: 0.864 ± 0.355
2.015PheGly: 2.015 ± 0.707
0.288PheHis: 0.288 ± 0.26
1.727PheIle: 1.727 ± 0.672
0.0PheLys: 0.0 ± 0.0
3.166PheLeu: 3.166 ± 0.75
0.864PheMet: 0.864 ± 0.556
0.864PheAsn: 0.864 ± 0.359
0.576PhePro: 0.576 ± 0.52
1.727PheGln: 1.727 ± 0.684
1.439PheArg: 1.439 ± 0.501
1.151PheSer: 1.151 ± 0.558
0.288PheThr: 0.288 ± 0.251
1.439PheVal: 1.439 ± 0.642
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.318GlyAla: 4.318 ± 1.277
0.576GlyCys: 0.576 ± 0.533
4.318GlyAsp: 4.318 ± 1.452
10.363GlyGlu: 10.363 ± 1.43
2.303GlyPhe: 2.303 ± 0.551
7.484GlyGly: 7.484 ± 1.649
0.864GlyHis: 0.864 ± 0.352
3.454GlyIle: 3.454 ± 0.676
1.727GlyLys: 1.727 ± 0.498
4.03GlyLeu: 4.03 ± 0.786
0.288GlyMet: 0.288 ± 0.26
2.015GlyAsn: 2.015 ± 0.621
3.166GlyPro: 3.166 ± 0.436
1.151GlyGln: 1.151 ± 0.758
6.908GlyArg: 6.908 ± 1.468
7.196GlySer: 7.196 ± 1.55
6.621GlyThr: 6.621 ± 0.902
6.045GlyVal: 6.045 ± 1.95
0.288GlyTrp: 0.288 ± 0.251
1.439GlyTyr: 1.439 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
1.151HisAla: 1.151 ± 0.386
0.0HisCys: 0.0 ± 0.0
0.576HisAsp: 0.576 ± 0.533
0.288HisGlu: 0.288 ± 0.267
0.288HisPhe: 0.288 ± 0.251
0.864HisGly: 0.864 ± 0.543
0.288HisHis: 0.288 ± 0.251
3.166HisIle: 3.166 ± 0.623
0.576HisLys: 0.576 ± 0.533
0.576HisLeu: 0.576 ± 0.344
1.727HisMet: 1.727 ± 0.603
0.864HisAsn: 0.864 ± 0.431
2.879HisPro: 2.879 ± 0.868
0.0HisGln: 0.0 ± 0.0
3.166HisArg: 3.166 ± 1.012
1.727HisSer: 1.727 ± 0.556
0.0HisThr: 0.0 ± 0.0
0.288HisVal: 0.288 ± 0.26
0.0HisTrp: 0.0 ± 0.0
0.288HisTyr: 0.288 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
2.591IleAla: 2.591 ± 1.534
0.864IleCys: 0.864 ± 0.355
2.591IleAsp: 2.591 ± 0.857
4.03IleGlu: 4.03 ± 0.534
0.864IlePhe: 0.864 ± 0.527
5.469IleGly: 5.469 ± 1.095
2.015IleHis: 2.015 ± 0.813
3.454IleIle: 3.454 ± 0.879
2.015IleLys: 2.015 ± 1.325
6.045IleLeu: 6.045 ± 1.055
1.727IleMet: 1.727 ± 0.556
3.454IleAsn: 3.454 ± 0.749
4.318IlePro: 4.318 ± 1.24
1.727IleGln: 1.727 ± 0.533
5.181IleArg: 5.181 ± 1.306
2.591IleSer: 2.591 ± 1.292
3.166IleThr: 3.166 ± 1.215
4.606IleVal: 4.606 ± 1.302
1.439IleTrp: 1.439 ± 0.663
3.166IleTyr: 3.166 ± 0.753
0.0IleXaa: 0.0 ± 0.0
Lys
3.454LysAla: 3.454 ± 0.98
0.576LysCys: 0.576 ± 0.309
4.318LysAsp: 4.318 ± 1.164
4.606LysGlu: 4.606 ± 1.257
0.576LysPhe: 0.576 ± 0.31
2.879LysGly: 2.879 ± 0.79
1.439LysHis: 1.439 ± 0.663
6.333LysIle: 6.333 ± 1.977
2.015LysLys: 2.015 ± 0.661
3.454LysLeu: 3.454 ± 0.975
2.015LysMet: 2.015 ± 0.847
0.576LysAsn: 0.576 ± 0.344
4.606LysPro: 4.606 ± 1.8
1.727LysGln: 1.727 ± 0.782
4.606LysArg: 4.606 ± 0.355
5.181LysSer: 5.181 ± 0.745
5.757LysThr: 5.757 ± 1.012
4.03LysVal: 4.03 ± 1.309
0.0LysTrp: 0.0 ± 0.0
1.151LysTyr: 1.151 ± 0.511
0.0LysXaa: 0.0 ± 0.0
Leu
4.606LeuAla: 4.606 ± 1.526
0.576LeuCys: 0.576 ± 0.509
3.454LeuAsp: 3.454 ± 0.629
6.333LeuGlu: 6.333 ± 0.879
0.864LeuPhe: 0.864 ± 0.752
7.484LeuGly: 7.484 ± 1.289
1.439LeuHis: 1.439 ± 0.514
7.196LeuIle: 7.196 ± 1.814
7.484LeuLys: 7.484 ± 1.856
6.621LeuLeu: 6.621 ± 1.509
2.591LeuMet: 2.591 ± 0.673
1.727LeuAsn: 1.727 ± 0.479
4.893LeuPro: 4.893 ± 1.663
3.166LeuGln: 3.166 ± 1.258
7.196LeuArg: 7.196 ± 1.309
8.636LeuSer: 8.636 ± 0.993
6.333LeuThr: 6.333 ± 1.218
5.469LeuVal: 5.469 ± 0.817
0.288LeuTrp: 0.288 ± 0.251
2.303LeuTyr: 2.303 ± 0.794
0.0LeuXaa: 0.0 ± 0.0
Met
2.879MetAla: 2.879 ± 0.508
0.0MetCys: 0.0 ± 0.0
1.151MetAsp: 1.151 ± 0.427
4.606MetGlu: 4.606 ± 1.083
0.288MetPhe: 0.288 ± 0.251
0.576MetGly: 0.576 ± 0.337
0.288MetHis: 0.288 ± 0.251
0.576MetIle: 0.576 ± 0.52
3.166MetLys: 3.166 ± 1.079
3.454MetLeu: 3.454 ± 1.473
0.0MetMet: 0.0 ± 0.0
0.864MetAsn: 0.864 ± 0.543
0.576MetPro: 0.576 ± 0.352
0.288MetGln: 0.288 ± 0.251
2.303MetArg: 2.303 ± 0.767
0.576MetSer: 0.576 ± 0.35
1.727MetThr: 1.727 ± 0.793
2.015MetVal: 2.015 ± 0.856
0.0MetTrp: 0.0 ± 0.0
0.288MetTyr: 0.288 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
1.727AsnAla: 1.727 ± 0.549
0.576AsnCys: 0.576 ± 0.533
0.864AsnAsp: 0.864 ± 0.554
0.864AsnGlu: 0.864 ± 0.431
0.864AsnPhe: 0.864 ± 0.516
4.318AsnGly: 4.318 ± 0.748
0.0AsnHis: 0.0 ± 0.0
2.879AsnIle: 2.879 ± 0.843
4.03AsnLys: 4.03 ± 0.89
2.591AsnLeu: 2.591 ± 0.443
0.576AsnMet: 0.576 ± 0.336
0.864AsnAsn: 0.864 ± 0.431
2.015AsnPro: 2.015 ± 0.598
1.151AsnGln: 1.151 ± 0.567
3.454AsnArg: 3.454 ± 1.106
4.318AsnSer: 4.318 ± 1.697
2.879AsnThr: 2.879 ± 1.029
0.576AsnVal: 0.576 ± 0.35
0.0AsnTrp: 0.0 ± 0.0
0.864AsnTyr: 0.864 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
4.318ProAla: 4.318 ± 0.928
0.0ProCys: 0.0 ± 0.0
3.742ProAsp: 3.742 ± 0.724
4.03ProGlu: 4.03 ± 1.125
0.576ProPhe: 0.576 ± 0.31
6.333ProGly: 6.333 ± 2.242
0.576ProHis: 0.576 ± 0.52
2.015ProIle: 2.015 ± 0.683
5.757ProLys: 5.757 ± 1.913
4.893ProLeu: 4.893 ± 0.974
0.576ProMet: 0.576 ± 0.501
1.439ProAsn: 1.439 ± 0.56
4.606ProPro: 4.606 ± 2.085
1.727ProGln: 1.727 ± 0.256
2.879ProArg: 2.879 ± 0.617
5.469ProSer: 5.469 ± 1.548
2.591ProThr: 2.591 ± 0.851
1.727ProVal: 1.727 ± 0.426
0.288ProTrp: 0.288 ± 0.26
1.151ProTyr: 1.151 ± 0.567
0.0ProXaa: 0.0 ± 0.0
Gln
3.742GlnAla: 3.742 ± 0.921
0.864GlnCys: 0.864 ± 0.331
4.893GlnAsp: 4.893 ± 1.425
3.742GlnGlu: 3.742 ± 0.968
0.0GlnPhe: 0.0 ± 0.0
1.727GlnGly: 1.727 ± 0.503
1.439GlnHis: 1.439 ± 0.538
2.015GlnIle: 2.015 ± 1.023
3.166GlnLys: 3.166 ± 1.067
2.015GlnLeu: 2.015 ± 0.551
1.439GlnMet: 1.439 ± 0.59
2.303GlnAsn: 2.303 ± 0.708
0.576GlnPro: 0.576 ± 0.344
1.151GlnGln: 1.151 ± 0.758
2.303GlnArg: 2.303 ± 0.852
1.151GlnSer: 1.151 ± 0.498
1.439GlnThr: 1.439 ± 0.663
4.606GlnVal: 4.606 ± 1.698
0.0GlnTrp: 0.0 ± 0.0
0.576GlnTyr: 0.576 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
4.893ArgAla: 4.893 ± 0.994
0.288ArgCys: 0.288 ± 0.267
5.469ArgAsp: 5.469 ± 0.839
5.469ArgGlu: 5.469 ± 1.644
1.151ArgPhe: 1.151 ± 0.504
5.757ArgGly: 5.757 ± 0.97
0.864ArgHis: 0.864 ± 0.531
3.166ArgIle: 3.166 ± 0.933
2.591ArgLys: 2.591 ± 0.451
4.03ArgLeu: 4.03 ± 1.289
1.151ArgMet: 1.151 ± 0.471
1.151ArgAsn: 1.151 ± 0.588
3.166ArgPro: 3.166 ± 0.515
2.879ArgGln: 2.879 ± 1.106
7.772ArgArg: 7.772 ± 0.916
9.499ArgSer: 9.499 ± 2.517
4.318ArgThr: 4.318 ± 0.817
5.181ArgVal: 5.181 ± 1.407
1.727ArgTrp: 1.727 ± 0.556
3.166ArgTyr: 3.166 ± 0.579
0.0ArgXaa: 0.0 ± 0.0
Ser
4.318SerAla: 4.318 ± 1.217
1.439SerCys: 1.439 ± 0.538
3.454SerAsp: 3.454 ± 0.689
5.469SerGlu: 5.469 ± 1.038
1.727SerPhe: 1.727 ± 0.416
6.908SerGly: 6.908 ± 2.148
1.439SerHis: 1.439 ± 0.564
2.879SerIle: 2.879 ± 0.596
4.318SerLys: 4.318 ± 0.93
11.802SerLeu: 11.802 ± 1.049
3.166SerMet: 3.166 ± 0.729
2.591SerAsn: 2.591 ± 0.792
3.742SerPro: 3.742 ± 1.407
4.893SerGln: 4.893 ± 0.858
4.893SerArg: 4.893 ± 1.182
5.469SerSer: 5.469 ± 1.065
9.211SerThr: 9.211 ± 1.973
4.606SerVal: 4.606 ± 0.999
1.151SerTrp: 1.151 ± 0.597
1.439SerTyr: 1.439 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
2.015ThrAla: 2.015 ± 1.325
0.864ThrCys: 0.864 ± 0.551
4.606ThrAsp: 4.606 ± 1.197
4.318ThrGlu: 4.318 ± 1.103
1.439ThrPhe: 1.439 ± 0.721
6.045ThrGly: 6.045 ± 0.617
0.576ThrHis: 0.576 ± 0.533
2.879ThrIle: 2.879 ± 0.843
4.03ThrLys: 4.03 ± 0.68
9.499ThrLeu: 9.499 ± 2.168
0.576ThrMet: 0.576 ± 0.286
3.742ThrAsn: 3.742 ± 0.69
6.045ThrPro: 6.045 ± 1.702
2.303ThrGln: 2.303 ± 0.851
2.879ThrArg: 2.879 ± 0.99
4.893ThrSer: 4.893 ± 0.656
3.742ThrThr: 3.742 ± 1.237
4.318ThrVal: 4.318 ± 1.179
0.288ThrTrp: 0.288 ± 0.267
1.439ThrTyr: 1.439 ± 0.593
0.0ThrXaa: 0.0 ± 0.0
Val
2.303ValAla: 2.303 ± 0.984
1.439ValCys: 1.439 ± 0.768
2.591ValAsp: 2.591 ± 0.997
6.045ValGlu: 6.045 ± 0.945
2.015ValPhe: 2.015 ± 0.62
1.727ValGly: 1.727 ± 0.794
1.727ValHis: 1.727 ± 0.684
6.908ValIle: 6.908 ± 1.68
3.166ValLys: 3.166 ± 0.782
6.045ValLeu: 6.045 ± 0.582
0.864ValMet: 0.864 ± 0.307
3.742ValAsn: 3.742 ± 1.186
1.727ValPro: 1.727 ± 0.774
3.454ValGln: 3.454 ± 1.172
4.893ValArg: 4.893 ± 1.375
5.757ValSer: 5.757 ± 1.467
3.742ValThr: 3.742 ± 1.204
2.879ValVal: 2.879 ± 1.747
0.288ValTrp: 0.288 ± 0.251
0.576ValTyr: 0.576 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
1.727TrpAla: 1.727 ± 0.684
0.288TrpCys: 0.288 ± 0.254
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.576TrpGly: 0.576 ± 0.337
0.0TrpHis: 0.0 ± 0.0
0.576TrpIle: 0.576 ± 0.358
0.288TrpLys: 0.288 ± 0.26
2.303TrpLeu: 2.303 ± 0.541
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.727TrpSer: 1.727 ± 0.684
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.576TrpTyr: 0.576 ± 0.533
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.864TyrAla: 0.864 ± 0.307
0.288TyrCys: 0.288 ± 0.26
0.288TyrAsp: 0.288 ± 0.267
2.015TyrGlu: 2.015 ± 0.683
0.0TyrPhe: 0.0 ± 0.0
0.864TyrGly: 0.864 ± 0.543
0.0TyrHis: 0.0 ± 0.0
1.439TyrIle: 1.439 ± 0.779
0.864TyrLys: 0.864 ± 0.496
3.454TyrLeu: 3.454 ± 1.36
0.576TyrMet: 0.576 ± 0.31
1.727TyrAsn: 1.727 ± 0.491
1.727TyrPro: 1.727 ± 0.458
0.576TyrGln: 0.576 ± 0.502
0.864TyrArg: 0.864 ± 0.355
1.439TyrSer: 1.439 ± 0.617
2.015TyrThr: 2.015 ± 0.806
1.439TyrVal: 1.439 ± 0.53
0.0TyrTrp: 0.0 ± 0.0
0.576TyrTyr: 0.576 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3475 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski