Amino acid dipepetide frequency for Influenza D virus (D/bovine/France/2986/2012)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.147AlaAla: 5.147 ± 0.878
1.225AlaCys: 1.225 ± 0.573
1.961AlaAsp: 1.961 ± 0.831
5.882AlaGlu: 5.882 ± 0.986
2.451AlaPhe: 2.451 ± 0.531
5.147AlaGly: 5.147 ± 1.868
0.98AlaHis: 0.98 ± 0.376
3.676AlaIle: 3.676 ± 0.771
6.373AlaLys: 6.373 ± 1.282
5.147AlaLeu: 5.147 ± 1.195
3.922AlaMet: 3.922 ± 1.101
2.696AlaAsn: 2.696 ± 0.645
1.471AlaPro: 1.471 ± 0.549
1.961AlaGln: 1.961 ± 0.675
2.941AlaArg: 2.941 ± 0.51
5.392AlaSer: 5.392 ± 1.259
2.941AlaThr: 2.941 ± 1.323
4.412AlaVal: 4.412 ± 1.35
0.735AlaTrp: 0.735 ± 0.451
1.961AlaTyr: 1.961 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.735CysAla: 0.735 ± 0.451
0.735CysCys: 0.735 ± 0.329
0.49CysAsp: 0.49 ± 0.29
1.961CysGlu: 1.961 ± 0.679
1.961CysPhe: 1.961 ± 0.743
1.471CysGly: 1.471 ± 0.658
0.0CysHis: 0.0 ± 0.0
1.471CysIle: 1.471 ± 0.739
1.961CysLys: 1.961 ± 0.288
2.696CysLeu: 2.696 ± 0.467
0.49CysMet: 0.49 ± 0.246
1.716CysAsn: 1.716 ± 0.864
0.245CysPro: 0.245 ± 0.24
0.735CysGln: 0.735 ± 0.488
2.206CysArg: 2.206 ± 0.931
0.98CysSer: 0.98 ± 0.304
0.49CysThr: 0.49 ± 0.364
0.49CysVal: 0.49 ± 0.317
0.49CysTrp: 0.49 ± 0.246
0.98CysTyr: 0.98 ± 0.233
0.0CysXaa: 0.0 ± 0.0
Asp
3.186AspAla: 3.186 ± 0.572
0.49AspCys: 0.49 ± 0.305
3.431AspAsp: 3.431 ± 1.021
2.696AspGlu: 2.696 ± 1.043
1.961AspPhe: 1.961 ± 0.566
4.167AspGly: 4.167 ± 0.733
0.245AspHis: 0.245 ± 0.212
3.186AspIle: 3.186 ± 0.403
3.676AspLys: 3.676 ± 0.723
4.167AspLeu: 4.167 ± 0.917
0.98AspMet: 0.98 ± 0.596
2.451AspAsn: 2.451 ± 0.486
1.716AspPro: 1.716 ± 0.672
0.49AspGln: 0.49 ± 0.471
1.225AspArg: 1.225 ± 0.545
3.676AspSer: 3.676 ± 0.637
2.206AspThr: 2.206 ± 1.079
3.676AspVal: 3.676 ± 1.514
1.225AspTrp: 1.225 ± 0.375
1.225AspTyr: 1.225 ± 0.638
0.0AspXaa: 0.0 ± 0.0
Glu
7.108GluAla: 7.108 ± 1.023
1.716GluCys: 1.716 ± 0.888
4.902GluAsp: 4.902 ± 0.836
7.843GluGlu: 7.843 ± 1.86
2.206GluPhe: 2.206 ± 0.794
5.147GluGly: 5.147 ± 0.786
0.49GluHis: 0.49 ± 0.425
4.657GluIle: 4.657 ± 1.237
6.863GluLys: 6.863 ± 1.15
6.863GluLeu: 6.863 ± 1.499
4.412GluMet: 4.412 ± 1.376
4.902GluAsn: 4.902 ± 0.994
1.716GluPro: 1.716 ± 0.784
3.186GluGln: 3.186 ± 0.747
3.922GluArg: 3.922 ± 1.11
5.147GluSer: 5.147 ± 0.704
5.392GluThr: 5.392 ± 0.905
2.941GluVal: 2.941 ± 0.897
0.735GluTrp: 0.735 ± 0.329
2.206GluTyr: 2.206 ± 0.477
0.0GluXaa: 0.0 ± 0.0
Phe
0.98PheAla: 0.98 ± 0.501
2.696PheCys: 2.696 ± 0.782
1.225PheAsp: 1.225 ± 0.51
3.922PheGlu: 3.922 ± 1.023
2.206PhePhe: 2.206 ± 0.645
5.147PheGly: 5.147 ± 1.419
0.735PheHis: 0.735 ± 0.436
1.716PheIle: 1.716 ± 0.615
2.206PheLys: 2.206 ± 0.801
3.676PheLeu: 3.676 ± 1.666
1.961PheMet: 1.961 ± 0.474
3.922PheAsn: 3.922 ± 0.633
1.471PhePro: 1.471 ± 0.3
0.49PheGln: 0.49 ± 0.257
1.716PheArg: 1.716 ± 0.571
3.186PheSer: 3.186 ± 0.793
4.657PheThr: 4.657 ± 1.127
2.451PheVal: 2.451 ± 0.787
0.735PheTrp: 0.735 ± 0.407
1.225PheTyr: 1.225 ± 0.51
0.0PheXaa: 0.0 ± 0.0
Gly
4.412GlyAla: 4.412 ± 1.27
1.961GlyCys: 1.961 ± 0.905
3.186GlyAsp: 3.186 ± 0.521
5.147GlyGlu: 5.147 ± 1.104
3.922GlyPhe: 3.922 ± 1.075
5.637GlyGly: 5.637 ± 2.045
0.245GlyHis: 0.245 ± 0.219
5.637GlyIle: 5.637 ± 0.917
5.147GlyLys: 5.147 ± 0.826
5.392GlyLeu: 5.392 ± 0.576
3.186GlyMet: 3.186 ± 0.693
4.167GlyAsn: 4.167 ± 1.541
2.206GlyPro: 2.206 ± 0.735
1.471GlyGln: 1.471 ± 0.659
3.186GlyArg: 3.186 ± 0.677
4.902GlySer: 4.902 ± 1.297
5.637GlyThr: 5.637 ± 0.949
4.167GlyVal: 4.167 ± 1.002
0.245GlyTrp: 0.245 ± 0.212
1.471GlyTyr: 1.471 ± 0.762
0.0GlyXaa: 0.0 ± 0.0
His
0.245HisAla: 0.245 ± 0.325
0.49HisCys: 0.49 ± 0.329
0.735HisAsp: 0.735 ± 0.254
0.735HisGlu: 0.735 ± 0.637
0.735HisPhe: 0.735 ± 0.274
0.49HisGly: 0.49 ± 0.294
0.245HisHis: 0.245 ± 0.219
0.735HisIle: 0.735 ± 0.32
1.225HisLys: 1.225 ± 0.605
1.225HisLeu: 1.225 ± 0.641
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.245HisPro: 0.245 ± 0.224
0.245HisGln: 0.245 ± 0.23
0.735HisArg: 0.735 ± 0.381
0.735HisSer: 0.735 ± 0.329
0.49HisThr: 0.49 ± 0.425
0.98HisVal: 0.98 ± 0.319
0.245HisTrp: 0.245 ± 0.224
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.412IleAla: 4.412 ± 0.987
2.206IleCys: 2.206 ± 0.605
2.206IleAsp: 2.206 ± 0.947
4.167IleGlu: 4.167 ± 0.626
2.941IlePhe: 2.941 ± 1.213
3.922IleGly: 3.922 ± 0.666
0.49IleHis: 0.49 ± 0.425
3.186IleIle: 3.186 ± 0.575
5.637IleLys: 5.637 ± 1.335
4.167IleLeu: 4.167 ± 1.022
1.961IleMet: 1.961 ± 1.038
2.696IleAsn: 2.696 ± 0.911
3.431IlePro: 3.431 ± 0.71
2.451IleGln: 2.451 ± 0.476
3.922IleArg: 3.922 ± 0.713
3.676IleSer: 3.676 ± 0.848
2.696IleThr: 2.696 ± 0.928
3.431IleVal: 3.431 ± 0.598
0.245IleTrp: 0.245 ± 0.24
2.451IleTyr: 2.451 ± 0.531
0.0IleXaa: 0.0 ± 0.0
Lys
3.676LysAla: 3.676 ± 1.571
1.225LysCys: 1.225 ± 0.305
5.392LysAsp: 5.392 ± 1.054
6.863LysGlu: 6.863 ± 1.417
3.431LysPhe: 3.431 ± 0.79
3.676LysGly: 3.676 ± 0.498
1.225LysHis: 1.225 ± 0.305
6.618LysIle: 6.618 ± 0.911
8.088LysLys: 8.088 ± 1.661
5.392LysLeu: 5.392 ± 1.273
2.941LysMet: 2.941 ± 0.724
4.657LysAsn: 4.657 ± 1.261
2.451LysPro: 2.451 ± 0.777
1.471LysGln: 1.471 ± 0.681
7.353LysArg: 7.353 ± 1.443
3.676LysSer: 3.676 ± 1.145
4.412LysThr: 4.412 ± 0.666
3.431LysVal: 3.431 ± 0.475
1.225LysTrp: 1.225 ± 0.411
2.941LysTyr: 2.941 ± 0.486
0.0LysXaa: 0.0 ± 0.0
Leu
7.598LeuAla: 7.598 ± 1.234
1.961LeuCys: 1.961 ± 0.536
1.961LeuAsp: 1.961 ± 0.638
7.843LeuGlu: 7.843 ± 2.838
4.167LeuPhe: 4.167 ± 0.85
6.373LeuGly: 6.373 ± 2.188
1.225LeuHis: 1.225 ± 0.247
5.637LeuIle: 5.637 ± 0.841
5.882LeuLys: 5.882 ± 1.013
5.882LeuLeu: 5.882 ± 1.634
4.657LeuMet: 4.657 ± 1.641
3.676LeuAsn: 3.676 ± 0.868
2.696LeuPro: 2.696 ± 0.631
1.961LeuGln: 1.961 ± 0.825
5.147LeuArg: 5.147 ± 1.223
4.412LeuSer: 4.412 ± 0.781
3.676LeuThr: 3.676 ± 0.741
3.922LeuVal: 3.922 ± 1.045
1.225LeuTrp: 1.225 ± 0.451
2.451LeuTyr: 2.451 ± 0.561
0.0LeuXaa: 0.0 ± 0.0
Met
3.431MetAla: 3.431 ± 1.029
0.49MetCys: 0.49 ± 0.289
2.941MetAsp: 2.941 ± 0.443
2.696MetGlu: 2.696 ± 0.761
1.716MetPhe: 1.716 ± 0.373
2.941MetGly: 2.941 ± 1.187
0.245MetHis: 0.245 ± 0.224
2.206MetIle: 2.206 ± 0.968
2.451MetLys: 2.451 ± 0.652
3.922MetLeu: 3.922 ± 1.027
1.225MetMet: 1.225 ± 0.624
1.471MetAsn: 1.471 ± 0.555
0.735MetPro: 0.735 ± 0.308
0.49MetGln: 0.49 ± 0.308
2.941MetArg: 2.941 ± 0.469
3.186MetSer: 3.186 ± 0.692
1.961MetThr: 1.961 ± 0.506
2.206MetVal: 2.206 ± 0.957
0.735MetTrp: 0.735 ± 0.436
1.961MetTyr: 1.961 ± 0.814
0.0MetXaa: 0.0 ± 0.0
Asn
1.961AsnAla: 1.961 ± 0.93
1.225AsnCys: 1.225 ± 0.663
2.696AsnAsp: 2.696 ± 0.532
3.676AsnGlu: 3.676 ± 1.154
2.451AsnPhe: 2.451 ± 0.537
3.676AsnGly: 3.676 ± 0.677
0.735AsnHis: 0.735 ± 0.25
3.922AsnIle: 3.922 ± 0.721
4.412AsnLys: 4.412 ± 0.938
4.167AsnLeu: 4.167 ± 0.86
1.961AsnMet: 1.961 ± 1.084
1.471AsnAsn: 1.471 ± 0.724
3.186AsnPro: 3.186 ± 0.464
2.206AsnGln: 2.206 ± 0.577
1.471AsnArg: 1.471 ± 0.336
2.696AsnSer: 2.696 ± 1.074
2.696AsnThr: 2.696 ± 0.893
1.225AsnVal: 1.225 ± 0.669
0.49AsnTrp: 0.49 ± 0.48
0.98AsnTyr: 0.98 ± 0.304
0.0AsnXaa: 0.0 ± 0.0
Pro
2.206ProAla: 2.206 ± 0.712
0.49ProCys: 0.49 ± 0.341
0.245ProAsp: 0.245 ± 0.24
4.412ProGlu: 4.412 ± 1.021
1.716ProPhe: 1.716 ± 0.675
2.941ProGly: 2.941 ± 0.415
0.245ProHis: 0.245 ± 0.212
2.696ProIle: 2.696 ± 0.683
2.696ProLys: 2.696 ± 0.448
2.451ProLeu: 2.451 ± 0.752
1.961ProMet: 1.961 ± 0.439
1.225ProAsn: 1.225 ± 0.43
1.716ProPro: 1.716 ± 0.292
1.225ProGln: 1.225 ± 0.573
1.716ProArg: 1.716 ± 0.316
2.206ProSer: 2.206 ± 0.431
2.451ProThr: 2.451 ± 0.548
1.225ProVal: 1.225 ± 0.319
0.49ProTrp: 0.49 ± 0.308
2.206ProTyr: 2.206 ± 0.822
0.0ProXaa: 0.0 ± 0.0
Gln
2.696GlnAla: 2.696 ± 1.089
0.245GlnCys: 0.245 ± 0.219
0.245GlnAsp: 0.245 ± 0.236
3.431GlnGlu: 3.431 ± 0.725
1.716GlnPhe: 1.716 ± 0.569
2.451GlnGly: 2.451 ± 0.358
0.0GlnHis: 0.0 ± 0.0
2.206GlnIle: 2.206 ± 0.748
2.451GlnLys: 2.451 ± 0.486
3.922GlnLeu: 3.922 ± 1.227
0.245GlnMet: 0.245 ± 0.23
0.735GlnAsn: 0.735 ± 0.351
1.225GlnPro: 1.225 ± 0.44
0.735GlnGln: 0.735 ± 0.329
2.206GlnArg: 2.206 ± 0.778
1.471GlnSer: 1.471 ± 0.603
1.471GlnThr: 1.471 ± 0.553
1.716GlnVal: 1.716 ± 0.539
0.245GlnTrp: 0.245 ± 0.23
0.245GlnTyr: 0.245 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
4.902ArgAla: 4.902 ± 0.877
0.735ArgCys: 0.735 ± 0.468
3.186ArgAsp: 3.186 ± 1.014
4.902ArgGlu: 4.902 ± 0.98
1.961ArgPhe: 1.961 ± 0.46
3.431ArgGly: 3.431 ± 0.455
0.245ArgHis: 0.245 ± 0.224
2.206ArgIle: 2.206 ± 0.814
3.922ArgLys: 3.922 ± 1.054
3.676ArgLeu: 3.676 ± 1.01
2.941ArgMet: 2.941 ± 0.949
2.941ArgAsn: 2.941 ± 0.862
1.961ArgPro: 1.961 ± 0.378
2.451ArgGln: 2.451 ± 0.341
4.167ArgArg: 4.167 ± 0.82
3.922ArgSer: 3.922 ± 1.503
3.922ArgThr: 3.922 ± 1.171
2.696ArgVal: 2.696 ± 0.451
0.49ArgTrp: 0.49 ± 0.459
1.225ArgTyr: 1.225 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
3.431SerAla: 3.431 ± 0.762
0.735SerCys: 0.735 ± 0.429
3.431SerAsp: 3.431 ± 0.817
4.167SerGlu: 4.167 ± 0.801
2.451SerPhe: 2.451 ± 0.603
5.637SerGly: 5.637 ± 1.544
0.735SerHis: 0.735 ± 0.274
1.716SerIle: 1.716 ± 0.64
7.598SerLys: 7.598 ± 1.911
6.127SerLeu: 6.127 ± 1.165
3.186SerMet: 3.186 ± 0.841
2.451SerAsn: 2.451 ± 0.854
3.676SerPro: 3.676 ± 0.62
1.471SerGln: 1.471 ± 0.725
3.186SerArg: 3.186 ± 0.642
5.882SerSer: 5.882 ± 1.497
4.657SerThr: 4.657 ± 0.98
3.922SerVal: 3.922 ± 0.734
0.98SerTrp: 0.98 ± 0.466
1.716SerTyr: 1.716 ± 0.725
0.0SerXaa: 0.0 ± 0.0
Thr
3.922ThrAla: 3.922 ± 0.969
0.735ThrCys: 0.735 ± 0.333
3.431ThrAsp: 3.431 ± 0.871
3.922ThrGlu: 3.922 ± 1.059
3.676ThrPhe: 3.676 ± 1.309
3.431ThrGly: 3.431 ± 1.249
0.98ThrHis: 0.98 ± 0.376
4.167ThrIle: 4.167 ± 0.664
4.167ThrLys: 4.167 ± 1.294
5.882ThrLeu: 5.882 ± 0.617
1.471ThrMet: 1.471 ± 0.663
2.451ThrAsn: 2.451 ± 0.787
2.696ThrPro: 2.696 ± 0.644
1.961ThrGln: 1.961 ± 0.523
2.206ThrArg: 2.206 ± 0.704
3.922ThrSer: 3.922 ± 0.575
3.676ThrThr: 3.676 ± 0.901
4.412ThrVal: 4.412 ± 0.694
0.735ThrTrp: 0.735 ± 0.36
1.716ThrTyr: 1.716 ± 0.786
0.0ThrXaa: 0.0 ± 0.0
Val
4.412ValAla: 4.412 ± 1.101
1.225ValCys: 1.225 ± 0.495
2.451ValAsp: 2.451 ± 0.771
4.657ValGlu: 4.657 ± 1.334
2.696ValPhe: 2.696 ± 0.419
3.676ValGly: 3.676 ± 1.142
0.98ValHis: 0.98 ± 0.437
2.206ValIle: 2.206 ± 0.661
2.941ValLys: 2.941 ± 0.574
4.657ValLeu: 4.657 ± 1.42
1.225ValMet: 1.225 ± 0.672
2.941ValAsn: 2.941 ± 0.883
2.206ValPro: 2.206 ± 0.795
1.961ValGln: 1.961 ± 0.971
2.696ValArg: 2.696 ± 1.068
3.922ValSer: 3.922 ± 0.688
3.431ValThr: 3.431 ± 1.162
3.431ValVal: 3.431 ± 0.545
0.0ValTrp: 0.0 ± 0.0
1.716ValTyr: 1.716 ± 0.735
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.461
0.0TrpCys: 0.0 ± 0.0
0.49TrpAsp: 0.49 ± 0.263
0.49TrpGlu: 0.49 ± 0.382
0.735TrpPhe: 0.735 ± 0.637
0.98TrpGly: 0.98 ± 0.376
0.0TrpHis: 0.0 ± 0.0
0.98TrpIle: 0.98 ± 0.611
0.245TrpLys: 0.245 ± 0.23
0.735TrpLeu: 0.735 ± 0.332
0.49TrpMet: 0.49 ± 0.418
0.49TrpAsn: 0.49 ± 0.275
0.49TrpPro: 0.49 ± 0.448
0.245TrpGln: 0.245 ± 0.212
1.225TrpArg: 1.225 ± 0.729
1.471TrpSer: 1.471 ± 0.547
0.98TrpThr: 0.98 ± 0.405
0.49TrpVal: 0.49 ± 0.459
0.0TrpTrp: 0.0 ± 0.0
0.49TrpTyr: 0.49 ± 0.48
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.98TyrAla: 0.98 ± 0.254
1.716TyrCys: 1.716 ± 0.53
1.716TyrAsp: 1.716 ± 0.248
2.941TyrGlu: 2.941 ± 1.158
1.225TyrPhe: 1.225 ± 0.579
0.98TyrGly: 0.98 ± 0.331
0.49TyrHis: 0.49 ± 0.284
1.471TyrIle: 1.471 ± 0.165
2.206TyrLys: 2.206 ± 0.686
2.206TyrLeu: 2.206 ± 0.621
0.49TyrMet: 0.49 ± 0.284
0.245TyrAsn: 0.245 ± 0.212
0.98TyrPro: 0.98 ± 0.422
2.451TyrGln: 2.451 ± 0.603
1.716TyrArg: 1.716 ± 0.825
2.696TyrSer: 2.696 ± 0.778
1.716TyrThr: 1.716 ± 0.75
2.206TyrVal: 2.206 ± 0.512
0.735TyrTrp: 0.735 ± 0.455
1.225TyrTyr: 1.225 ± 0.801
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski