Amino acid dipepetide frequency for Sendai virus (strain Ohita) (SeV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.809AlaAla: 4.809 ± 1.227
0.555AlaCys: 0.555 ± 0.254
3.884AlaAsp: 3.884 ± 0.746
3.514AlaGlu: 3.514 ± 1.027
1.295AlaPhe: 1.295 ± 0.374
3.144AlaGly: 3.144 ± 0.911
1.665AlaHis: 1.665 ± 0.53
3.699AlaIle: 3.699 ± 0.744
2.404AlaLys: 2.404 ± 0.64
8.323AlaLeu: 8.323 ± 1.065
2.034AlaMet: 2.034 ± 0.701
2.959AlaAsn: 2.959 ± 1.015
1.665AlaPro: 1.665 ± 0.538
1.849AlaGln: 1.849 ± 0.639
3.144AlaArg: 3.144 ± 0.811
3.884AlaSer: 3.884 ± 1.064
4.069AlaThr: 4.069 ± 1.015
4.439AlaVal: 4.439 ± 0.765
1.11AlaTrp: 1.11 ± 0.603
1.849AlaTyr: 1.849 ± 0.595
0.0AlaXaa: 0.0 ± 0.0
Cys
0.555CysAla: 0.555 ± 0.262
0.185CysCys: 0.185 ± 0.115
1.295CysAsp: 1.295 ± 0.655
0.555CysGlu: 0.555 ± 0.218
0.74CysPhe: 0.74 ± 0.274
0.925CysGly: 0.925 ± 0.718
0.37CysHis: 0.37 ± 0.23
2.034CysIle: 2.034 ± 0.66
0.925CysLys: 0.925 ± 0.268
1.11CysLeu: 1.11 ± 0.389
0.185CysMet: 0.185 ± 0.115
0.925CysAsn: 0.925 ± 0.622
0.925CysPro: 0.925 ± 0.513
0.74CysGln: 0.74 ± 0.234
0.37CysArg: 0.37 ± 0.212
1.11CysSer: 1.11 ± 0.524
0.925CysThr: 0.925 ± 0.318
0.555CysVal: 0.555 ± 0.235
0.0CysTrp: 0.0 ± 0.0
0.185CysTyr: 0.185 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
3.514AspAla: 3.514 ± 1.761
0.37AspCys: 0.37 ± 0.288
2.034AspAsp: 2.034 ± 0.764
4.624AspGlu: 4.624 ± 0.778
1.48AspPhe: 1.48 ± 0.521
2.589AspGly: 2.589 ± 0.547
1.11AspHis: 1.11 ± 0.234
4.254AspIle: 4.254 ± 0.503
2.774AspLys: 2.774 ± 0.793
5.548AspLeu: 5.548 ± 0.902
1.48AspMet: 1.48 ± 0.376
3.144AspAsn: 3.144 ± 0.683
3.884AspPro: 3.884 ± 0.907
2.959AspGln: 2.959 ± 0.955
2.959AspArg: 2.959 ± 0.698
4.809AspSer: 4.809 ± 0.567
3.884AspThr: 3.884 ± 0.919
3.884AspVal: 3.884 ± 0.529
0.925AspTrp: 0.925 ± 0.576
0.555AspTyr: 0.555 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
5.733GluAla: 5.733 ± 0.772
1.11GluCys: 1.11 ± 0.547
4.624GluAsp: 4.624 ± 1.315
5.363GluGlu: 5.363 ± 1.826
1.11GluPhe: 1.11 ± 0.374
4.439GluGly: 4.439 ± 0.621
0.555GluHis: 0.555 ± 0.275
4.254GluIle: 4.254 ± 1.392
5.178GluLys: 5.178 ± 1.046
5.733GluLeu: 5.733 ± 0.965
1.665GluMet: 1.665 ± 0.647
2.219GluAsn: 2.219 ± 0.885
2.959GluPro: 2.959 ± 0.593
1.849GluGln: 1.849 ± 1.054
3.884GluArg: 3.884 ± 1.223
6.658GluSer: 6.658 ± 1.503
4.254GluThr: 4.254 ± 0.91
3.884GluVal: 3.884 ± 0.433
0.74GluTrp: 0.74 ± 0.238
1.295GluTyr: 1.295 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
1.48PheAla: 1.48 ± 0.54
0.37PheCys: 0.37 ± 0.23
0.925PheAsp: 0.925 ± 0.254
1.665PheGlu: 1.665 ± 0.751
0.925PhePhe: 0.925 ± 0.263
2.589PheGly: 2.589 ± 0.712
0.555PheHis: 0.555 ± 0.246
2.034PheIle: 2.034 ± 0.842
1.295PheLys: 1.295 ± 0.655
3.514PheLeu: 3.514 ± 0.548
1.11PheMet: 1.11 ± 0.416
1.849PheAsn: 1.849 ± 0.638
1.11PhePro: 1.11 ± 0.434
1.11PheGln: 1.11 ± 0.509
1.295PheArg: 1.295 ± 0.513
2.034PheSer: 2.034 ± 0.499
0.555PheThr: 0.555 ± 0.277
1.11PheVal: 1.11 ± 0.398
0.74PheTrp: 0.74 ± 0.345
0.185PheTyr: 0.185 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
4.069GlyAla: 4.069 ± 1.198
1.11GlyCys: 1.11 ± 0.685
3.329GlyAsp: 3.329 ± 0.499
5.733GlyGlu: 5.733 ± 1.226
2.034GlyPhe: 2.034 ± 0.544
5.363GlyGly: 5.363 ± 1.48
1.11GlyHis: 1.11 ± 0.333
4.254GlyIle: 4.254 ± 0.604
2.589GlyLys: 2.589 ± 0.769
3.699GlyLeu: 3.699 ± 0.711
1.11GlyMet: 1.11 ± 0.389
1.295GlyAsn: 1.295 ± 0.369
2.404GlyPro: 2.404 ± 0.668
2.589GlyGln: 2.589 ± 0.637
4.254GlyArg: 4.254 ± 0.754
4.994GlySer: 4.994 ± 0.602
3.514GlyThr: 3.514 ± 1.056
5.363GlyVal: 5.363 ± 1.316
0.37GlyTrp: 0.37 ± 0.327
3.514GlyTyr: 3.514 ± 1.001
0.0GlyXaa: 0.0 ± 0.0
His
0.925HisAla: 0.925 ± 0.303
0.185HisCys: 0.185 ± 0.115
0.925HisAsp: 0.925 ± 0.464
1.665HisGlu: 1.665 ± 0.463
0.37HisPhe: 0.37 ± 0.327
1.295HisGly: 1.295 ± 0.499
0.37HisHis: 0.37 ± 0.327
1.295HisIle: 1.295 ± 0.602
0.925HisLys: 0.925 ± 0.379
1.48HisLeu: 1.48 ± 0.585
1.11HisMet: 1.11 ± 0.476
0.74HisAsn: 0.74 ± 0.337
2.034HisPro: 2.034 ± 0.52
0.555HisGln: 0.555 ± 0.345
1.295HisArg: 1.295 ± 0.416
1.48HisSer: 1.48 ± 0.519
0.925HisThr: 0.925 ± 0.288
1.295HisVal: 1.295 ± 0.677
0.185HisTrp: 0.185 ± 0.201
0.37HisTyr: 0.37 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
5.363IleAla: 5.363 ± 1.215
1.295IleCys: 1.295 ± 0.229
2.589IleAsp: 2.589 ± 0.591
4.254IleGlu: 4.254 ± 0.503
1.849IlePhe: 1.849 ± 0.761
3.884IleGly: 3.884 ± 0.819
1.48IleHis: 1.48 ± 0.532
3.884IleIle: 3.884 ± 0.789
3.144IleLys: 3.144 ± 0.642
5.733IleLeu: 5.733 ± 0.92
1.48IleMet: 1.48 ± 0.479
2.589IleAsn: 2.589 ± 0.649
4.254IlePro: 4.254 ± 0.813
1.849IleGln: 1.849 ± 0.644
5.548IleArg: 5.548 ± 0.733
5.363IleSer: 5.363 ± 1.043
5.178IleThr: 5.178 ± 0.857
3.884IleVal: 3.884 ± 1.653
1.11IleTrp: 1.11 ± 0.432
3.514IleTyr: 3.514 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
3.884LysAla: 3.884 ± 0.685
0.74LysCys: 0.74 ± 0.263
4.069LysAsp: 4.069 ± 0.945
3.699LysGlu: 3.699 ± 0.561
1.295LysPhe: 1.295 ± 0.495
3.699LysGly: 3.699 ± 0.578
0.925LysHis: 0.925 ± 0.451
5.548LysIle: 5.548 ± 1.447
2.034LysLys: 2.034 ± 0.641
4.439LysLeu: 4.439 ± 0.648
1.849LysMet: 1.849 ± 0.597
1.11LysAsn: 1.11 ± 0.288
1.48LysPro: 1.48 ± 0.54
1.849LysGln: 1.849 ± 0.475
4.069LysArg: 4.069 ± 0.609
4.624LysSer: 4.624 ± 0.839
4.994LysThr: 4.994 ± 0.805
3.514LysVal: 3.514 ± 0.896
0.37LysTrp: 0.37 ± 0.23
1.11LysTyr: 1.11 ± 0.425
0.0LysXaa: 0.0 ± 0.0
Leu
5.363LeuAla: 5.363 ± 0.973
0.925LeuCys: 0.925 ± 0.622
4.994LeuAsp: 4.994 ± 1.168
6.288LeuGlu: 6.288 ± 0.943
2.774LeuPhe: 2.774 ± 1.154
7.398LeuGly: 7.398 ± 0.908
1.665LeuHis: 1.665 ± 0.568
6.658LeuIle: 6.658 ± 1.232
8.323LeuLys: 8.323 ± 1.674
7.583LeuLeu: 7.583 ± 0.977
2.219LeuMet: 2.219 ± 0.457
3.329LeuAsn: 3.329 ± 1.095
3.699LeuPro: 3.699 ± 1.245
4.439LeuGln: 4.439 ± 0.755
7.583LeuArg: 7.583 ± 1.299
10.912LeuSer: 10.912 ± 1.148
8.138LeuThr: 8.138 ± 1.093
6.843LeuVal: 6.843 ± 0.843
0.74LeuTrp: 0.74 ± 0.308
3.144LeuTyr: 3.144 ± 0.535
0.0LeuXaa: 0.0 ± 0.0
Met
1.849MetAla: 1.849 ± 0.912
0.185MetCys: 0.185 ± 0.115
0.925MetAsp: 0.925 ± 0.268
2.589MetGlu: 2.589 ± 0.913
0.925MetPhe: 0.925 ± 0.308
0.925MetGly: 0.925 ± 0.744
0.555MetHis: 0.555 ± 0.345
0.925MetIle: 0.925 ± 0.407
2.034MetLys: 2.034 ± 0.791
2.404MetLeu: 2.404 ± 0.843
0.0MetMet: 0.0 ± 0.0
1.11MetAsn: 1.11 ± 0.425
0.555MetPro: 0.555 ± 0.425
0.37MetGln: 0.37 ± 0.21
2.034MetArg: 2.034 ± 0.437
1.11MetSer: 1.11 ± 0.384
1.849MetThr: 1.849 ± 0.685
1.665MetVal: 1.665 ± 0.478
0.185MetTrp: 0.185 ± 0.115
0.925MetTyr: 0.925 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
1.665AsnAla: 1.665 ± 0.376
0.555AsnCys: 0.555 ± 0.371
1.849AsnAsp: 1.849 ± 0.435
1.48AsnGlu: 1.48 ± 0.531
0.555AsnPhe: 0.555 ± 0.25
3.514AsnGly: 3.514 ± 0.768
0.185AsnHis: 0.185 ± 0.115
4.809AsnIle: 4.809 ± 0.988
2.959AsnLys: 2.959 ± 0.972
4.069AsnLeu: 4.069 ± 1.014
0.925AsnMet: 0.925 ± 0.293
1.295AsnAsn: 1.295 ± 0.45
2.959AsnPro: 2.959 ± 0.782
1.665AsnGln: 1.665 ± 0.532
1.849AsnArg: 1.849 ± 0.636
3.144AsnSer: 3.144 ± 0.772
2.404AsnThr: 2.404 ± 1.097
1.295AsnVal: 1.295 ± 0.399
0.74AsnTrp: 0.74 ± 0.345
1.48AsnTyr: 1.48 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
3.144ProAla: 3.144 ± 0.615
0.37ProCys: 0.37 ± 0.23
3.514ProAsp: 3.514 ± 0.882
3.514ProGlu: 3.514 ± 0.67
0.925ProPhe: 0.925 ± 0.308
2.219ProGly: 2.219 ± 0.638
1.11ProHis: 1.11 ± 0.503
1.665ProIle: 1.665 ± 0.443
2.589ProLys: 2.589 ± 0.91
5.178ProLeu: 5.178 ± 0.88
0.37ProMet: 0.37 ± 0.327
1.11ProAsn: 1.11 ± 0.476
1.665ProPro: 1.665 ± 1.181
1.11ProGln: 1.11 ± 0.339
3.144ProArg: 3.144 ± 0.683
4.069ProSer: 4.069 ± 1.515
2.774ProThr: 2.774 ± 1.249
3.329ProVal: 3.329 ± 0.519
0.185ProTrp: 0.185 ± 0.195
2.034ProTyr: 2.034 ± 0.689
0.0ProXaa: 0.0 ± 0.0
Gln
2.034GlnAla: 2.034 ± 0.871
0.555GlnCys: 0.555 ± 0.25
2.589GlnAsp: 2.589 ± 0.812
3.884GlnGlu: 3.884 ± 0.671
0.185GlnPhe: 0.185 ± 0.115
1.849GlnGly: 1.849 ± 0.803
0.37GlnHis: 0.37 ± 0.329
2.404GlnIle: 2.404 ± 0.952
2.959GlnLys: 2.959 ± 0.716
4.254GlnLeu: 4.254 ± 1.085
0.74GlnMet: 0.74 ± 0.416
1.849GlnAsn: 1.849 ± 0.36
0.555GlnPro: 0.555 ± 0.246
1.295GlnGln: 1.295 ± 0.402
2.404GlnArg: 2.404 ± 0.679
1.665GlnSer: 1.665 ± 0.641
1.849GlnThr: 1.849 ± 0.512
4.069GlnVal: 4.069 ± 1.519
0.37GlnTrp: 0.37 ± 0.212
0.74GlnTyr: 0.74 ± 0.238
0.0GlnXaa: 0.0 ± 0.0
Arg
3.884ArgAla: 3.884 ± 0.61
1.11ArgCys: 1.11 ± 0.364
4.809ArgAsp: 4.809 ± 0.834
4.069ArgGlu: 4.069 ± 1.361
2.034ArgPhe: 2.034 ± 0.672
3.884ArgGly: 3.884 ± 0.77
2.034ArgHis: 2.034 ± 0.482
3.329ArgIle: 3.329 ± 0.57
2.219ArgLys: 2.219 ± 0.366
5.918ArgLeu: 5.918 ± 0.965
0.925ArgMet: 0.925 ± 0.402
2.404ArgAsn: 2.404 ± 0.89
2.959ArgPro: 2.959 ± 0.699
2.959ArgGln: 2.959 ± 0.909
4.254ArgArg: 4.254 ± 1.506
6.288ArgSer: 6.288 ± 1.361
2.959ArgThr: 2.959 ± 0.677
3.699ArgVal: 3.699 ± 0.698
1.295ArgTrp: 1.295 ± 0.453
3.144ArgTyr: 3.144 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
3.514SerAla: 3.514 ± 1.039
1.665SerCys: 1.665 ± 0.645
3.514SerAsp: 3.514 ± 0.723
4.069SerGlu: 4.069 ± 0.735
2.589SerPhe: 2.589 ± 0.267
5.178SerGly: 5.178 ± 1.071
1.849SerHis: 1.849 ± 0.631
3.884SerIle: 3.884 ± 0.783
4.809SerLys: 4.809 ± 0.738
11.837SerLeu: 11.837 ± 0.528
2.219SerMet: 2.219 ± 0.733
3.884SerAsn: 3.884 ± 0.758
3.699SerPro: 3.699 ± 1.277
2.589SerGln: 2.589 ± 0.555
6.103SerArg: 6.103 ± 1.292
5.733SerSer: 5.733 ± 0.711
7.768SerThr: 7.768 ± 1.194
4.809SerVal: 4.809 ± 0.66
0.74SerTrp: 0.74 ± 0.238
2.589SerTyr: 2.589 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
2.589ThrAla: 2.589 ± 0.717
0.925ThrCys: 0.925 ± 0.513
4.624ThrAsp: 4.624 ± 1.204
4.254ThrGlu: 4.254 ± 1.181
2.034ThrPhe: 2.034 ± 0.735
4.624ThrGly: 4.624 ± 0.77
1.295ThrHis: 1.295 ± 0.549
5.733ThrIle: 5.733 ± 0.825
3.329ThrLys: 3.329 ± 0.603
8.692ThrLeu: 8.692 ± 1.061
0.925ThrMet: 0.925 ± 0.254
1.849ThrAsn: 1.849 ± 0.491
2.589ThrPro: 2.589 ± 0.872
2.404ThrGln: 2.404 ± 0.803
4.069ThrArg: 4.069 ± 0.986
5.733ThrSer: 5.733 ± 1.075
3.884ThrThr: 3.884 ± 1.618
3.329ThrVal: 3.329 ± 0.917
1.295ThrTrp: 1.295 ± 0.451
2.774ThrTyr: 2.774 ± 0.672
0.0ThrXaa: 0.0 ± 0.0
Val
2.959ValAla: 2.959 ± 0.646
1.11ValCys: 1.11 ± 0.448
3.699ValAsp: 3.699 ± 0.682
4.254ValGlu: 4.254 ± 1.1
2.404ValPhe: 2.404 ± 0.53
2.774ValGly: 2.774 ± 0.608
1.48ValHis: 1.48 ± 0.479
4.994ValIle: 4.994 ± 1.386
2.959ValLys: 2.959 ± 0.695
7.768ValLeu: 7.768 ± 1.071
1.11ValMet: 1.11 ± 0.252
3.884ValAsn: 3.884 ± 0.547
2.219ValPro: 2.219 ± 0.805
2.774ValGln: 2.774 ± 1.09
4.439ValArg: 4.439 ± 1.217
3.884ValSer: 3.884 ± 1.096
3.514ValThr: 3.514 ± 0.858
4.254ValVal: 4.254 ± 1.108
0.185ValTrp: 0.185 ± 0.243
2.404ValTyr: 2.404 ± 0.878
0.0ValXaa: 0.0 ± 0.0
Trp
1.849TrpAla: 1.849 ± 0.395
0.0TrpCys: 0.0 ± 0.0
0.74TrpAsp: 0.74 ± 0.461
0.185TrpGlu: 0.185 ± 0.115
0.37TrpPhe: 0.37 ± 0.23
0.74TrpGly: 0.74 ± 0.238
0.185TrpHis: 0.185 ± 0.219
0.925TrpIle: 0.925 ± 0.313
0.185TrpLys: 0.185 ± 0.195
1.849TrpLeu: 1.849 ± 0.494
0.555TrpMet: 0.555 ± 0.345
0.555TrpAsn: 0.555 ± 0.345
0.185TrpPro: 0.185 ± 0.115
0.185TrpGln: 0.185 ± 0.115
0.37TrpArg: 0.37 ± 0.212
1.48TrpSer: 1.48 ± 0.479
0.555TrpThr: 0.555 ± 0.262
0.37TrpVal: 0.37 ± 0.23
0.0TrpTrp: 0.0 ± 0.0
0.74TrpTyr: 0.74 ± 0.391
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.11TyrAla: 1.11 ± 0.438
1.295TyrCys: 1.295 ± 0.526
1.849TyrAsp: 1.849 ± 0.779
2.034TyrGlu: 2.034 ± 0.549
0.555TyrPhe: 0.555 ± 0.345
1.665TyrGly: 1.665 ± 0.575
0.555TyrHis: 0.555 ± 0.254
1.665TyrIle: 1.665 ± 0.572
1.665TyrLys: 1.665 ± 0.611
3.884TyrLeu: 3.884 ± 1.004
0.925TyrMet: 0.925 ± 0.318
1.295TyrAsn: 1.295 ± 0.341
2.219TyrPro: 2.219 ± 0.397
1.48TyrGln: 1.48 ± 0.619
1.11TyrArg: 1.11 ± 0.4
4.069TyrSer: 4.069 ± 1.16
2.959TyrThr: 2.959 ± 0.682
1.48TyrVal: 1.48 ± 0.459
0.74TyrTrp: 0.74 ± 0.345
0.74TyrTyr: 0.74 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (5408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski