Amino acid dipepetide frequency for Rotavirus G chicken/03V0567/DEU/2003

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.015AlaAla: 4.015 ± 1.106
0.873AlaCys: 0.873 ± 0.494
3.316AlaAsp: 3.316 ± 0.842
2.967AlaGlu: 2.967 ± 0.643
2.967AlaPhe: 2.967 ± 0.266
0.698AlaGly: 0.698 ± 0.356
0.524AlaHis: 0.524 ± 0.304
6.109AlaIle: 6.109 ± 1.325
5.062AlaLys: 5.062 ± 0.51
3.491AlaLeu: 3.491 ± 0.423
2.095AlaMet: 2.095 ± 0.468
2.967AlaAsn: 2.967 ± 0.935
2.095AlaPro: 2.095 ± 0.595
2.967AlaGln: 2.967 ± 0.743
2.618AlaArg: 2.618 ± 0.63
3.142AlaSer: 3.142 ± 0.963
3.666AlaThr: 3.666 ± 1.058
3.666AlaVal: 3.666 ± 0.768
0.873AlaTrp: 0.873 ± 0.253
2.269AlaTyr: 2.269 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.524CysAla: 0.524 ± 0.335
0.0CysCys: 0.0 ± 0.0
0.873CysAsp: 0.873 ± 0.359
0.698CysGlu: 0.698 ± 0.487
0.524CysPhe: 0.524 ± 0.307
1.047CysGly: 1.047 ± 0.492
0.0CysHis: 0.0 ± 0.0
0.349CysIle: 0.349 ± 0.265
1.222CysLys: 1.222 ± 0.459
0.349CysLeu: 0.349 ± 0.228
0.175CysMet: 0.175 ± 0.153
0.698CysAsn: 0.698 ± 0.291
0.349CysPro: 0.349 ± 0.335
0.698CysGln: 0.698 ± 0.316
0.349CysArg: 0.349 ± 0.307
1.222CysSer: 1.222 ± 0.387
0.698CysThr: 0.698 ± 0.388
1.222CysVal: 1.222 ± 0.34
0.0CysTrp: 0.0 ± 0.0
0.349CysTyr: 0.349 ± 0.256
0.0CysXaa: 0.0 ± 0.0
Asp
3.316AspAla: 3.316 ± 0.587
0.175AspCys: 0.175 ± 0.176
5.586AspAsp: 5.586 ± 1.094
3.666AspGlu: 3.666 ± 1.071
2.618AspPhe: 2.618 ± 0.932
3.84AspGly: 3.84 ± 0.59
0.873AspHis: 0.873 ± 0.371
5.935AspIle: 5.935 ± 1.254
6.109AspLys: 6.109 ± 0.84
6.633AspLeu: 6.633 ± 1.117
1.571AspMet: 1.571 ± 0.336
4.189AspAsn: 4.189 ± 0.922
1.571AspPro: 1.571 ± 0.474
4.015AspGln: 4.015 ± 0.404
1.92AspArg: 1.92 ± 0.443
2.967AspSer: 2.967 ± 0.746
4.189AspThr: 4.189 ± 0.878
5.062AspVal: 5.062 ± 0.699
0.698AspTrp: 0.698 ± 0.316
2.618AspTyr: 2.618 ± 0.507
0.0AspXaa: 0.0 ± 0.0
Glu
3.84GluAla: 3.84 ± 0.581
0.873GluCys: 0.873 ± 0.36
4.538GluAsp: 4.538 ± 0.845
4.713GluGlu: 4.713 ± 0.964
3.491GluPhe: 3.491 ± 0.476
1.746GluGly: 1.746 ± 0.735
1.571GluHis: 1.571 ± 0.693
5.76GluIle: 5.76 ± 1.126
5.935GluLys: 5.935 ± 0.845
4.887GluLeu: 4.887 ± 1.04
3.666GluMet: 3.666 ± 0.739
4.713GluAsn: 4.713 ± 0.819
2.095GluPro: 2.095 ± 0.73
1.396GluGln: 1.396 ± 0.618
4.887GluArg: 4.887 ± 0.767
3.84GluSer: 3.84 ± 0.802
3.84GluThr: 3.84 ± 0.429
3.84GluVal: 3.84 ± 0.711
0.698GluTrp: 0.698 ± 0.411
2.269GluTyr: 2.269 ± 0.539
0.0GluXaa: 0.0 ± 0.0
Phe
2.444PheAla: 2.444 ± 0.975
0.524PheCys: 0.524 ± 0.232
3.316PheAsp: 3.316 ± 0.98
3.491PheGlu: 3.491 ± 0.764
0.873PhePhe: 0.873 ± 0.49
2.095PheGly: 2.095 ± 0.512
0.349PheHis: 0.349 ± 0.214
2.793PheIle: 2.793 ± 0.455
2.618PheLys: 2.618 ± 0.504
2.793PheLeu: 2.793 ± 0.754
0.524PheMet: 0.524 ± 0.255
3.316PheAsn: 3.316 ± 0.65
0.873PhePro: 0.873 ± 0.547
0.698PheGln: 0.698 ± 0.318
1.571PheArg: 1.571 ± 0.41
3.142PheSer: 3.142 ± 0.848
2.618PheThr: 2.618 ± 0.652
2.095PheVal: 2.095 ± 0.685
0.349PheTrp: 0.349 ± 0.307
1.396PheTyr: 1.396 ± 0.538
0.0PheXaa: 0.0 ± 0.0
Gly
3.666GlyAla: 3.666 ± 0.607
0.175GlyCys: 0.175 ± 0.206
3.142GlyAsp: 3.142 ± 1.018
2.095GlyGlu: 2.095 ± 0.768
1.746GlyPhe: 1.746 ± 0.403
2.967GlyGly: 2.967 ± 0.855
1.047GlyHis: 1.047 ± 0.499
3.666GlyIle: 3.666 ± 0.87
2.967GlyLys: 2.967 ± 0.481
2.269GlyLeu: 2.269 ± 0.504
1.571GlyMet: 1.571 ± 0.412
2.444GlyAsn: 2.444 ± 0.56
1.222GlyPro: 1.222 ± 0.308
1.92GlyGln: 1.92 ± 0.544
2.095GlyArg: 2.095 ± 0.557
1.222GlySer: 1.222 ± 0.459
2.095GlyThr: 2.095 ± 0.566
2.967GlyVal: 2.967 ± 0.517
0.698GlyTrp: 0.698 ± 0.442
1.92GlyTyr: 1.92 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
1.047HisAla: 1.047 ± 0.408
0.0HisCys: 0.0 ± 0.0
1.047HisAsp: 1.047 ± 0.374
1.746HisGlu: 1.746 ± 0.464
0.349HisPhe: 0.349 ± 0.278
1.047HisGly: 1.047 ± 0.533
0.175HisHis: 0.175 ± 0.151
1.571HisIle: 1.571 ± 1.143
0.349HisLys: 0.349 ± 0.279
1.222HisLeu: 1.222 ± 0.367
0.524HisMet: 0.524 ± 0.239
0.698HisAsn: 0.698 ± 0.208
0.873HisPro: 0.873 ± 0.401
0.0HisGln: 0.0 ± 0.0
0.873HisArg: 0.873 ± 0.443
0.698HisSer: 0.698 ± 0.311
0.873HisThr: 0.873 ± 0.327
1.222HisVal: 1.222 ± 0.485
0.175HisTrp: 0.175 ± 0.153
0.698HisTyr: 0.698 ± 0.375
0.0HisXaa: 0.0 ± 0.0
Ile
4.538IleAla: 4.538 ± 0.907
1.047IleCys: 1.047 ± 0.445
5.062IleAsp: 5.062 ± 0.841
6.633IleGlu: 6.633 ± 1.025
4.189IlePhe: 4.189 ± 0.415
3.142IleGly: 3.142 ± 0.564
1.047IleHis: 1.047 ± 0.219
5.586IleIle: 5.586 ± 0.704
6.633IleLys: 6.633 ± 0.864
6.807IleLeu: 6.807 ± 0.833
2.269IleMet: 2.269 ± 0.506
5.062IleAsn: 5.062 ± 0.869
2.967IlePro: 2.967 ± 0.783
4.189IleGln: 4.189 ± 0.902
4.538IleArg: 4.538 ± 1.023
4.713IleSer: 4.713 ± 0.839
4.015IleThr: 4.015 ± 0.818
5.586IleVal: 5.586 ± 1.308
0.349IleTrp: 0.349 ± 0.197
2.967IleTyr: 2.967 ± 0.974
0.0IleXaa: 0.0 ± 0.0
Lys
2.269LysAla: 2.269 ± 0.594
0.873LysCys: 0.873 ± 0.51
4.189LysAsp: 4.189 ± 0.995
7.157LysGlu: 7.157 ± 0.978
3.491LysPhe: 3.491 ± 1.268
2.095LysGly: 2.095 ± 0.433
1.396LysHis: 1.396 ± 0.436
8.902LysIle: 8.902 ± 1.035
8.728LysLys: 8.728 ± 1.643
5.586LysLeu: 5.586 ± 0.776
2.967LysMet: 2.967 ± 0.906
4.887LysAsn: 4.887 ± 0.533
3.491LysPro: 3.491 ± 0.838
3.316LysGln: 3.316 ± 0.81
3.491LysArg: 3.491 ± 0.847
3.316LysSer: 3.316 ± 0.701
4.189LysThr: 4.189 ± 0.882
4.713LysVal: 4.713 ± 0.663
1.222LysTrp: 1.222 ± 0.495
1.92LysTyr: 1.92 ± 0.61
0.0LysXaa: 0.0 ± 0.0
Leu
5.062LeuAla: 5.062 ± 0.878
1.222LeuCys: 1.222 ± 0.487
4.364LeuAsp: 4.364 ± 0.851
5.586LeuGlu: 5.586 ± 1.513
1.571LeuPhe: 1.571 ± 0.496
3.666LeuGly: 3.666 ± 1.088
1.746LeuHis: 1.746 ± 0.365
7.331LeuIle: 7.331 ± 1.366
5.935LeuLys: 5.935 ± 1.025
5.586LeuLeu: 5.586 ± 0.861
2.444LeuMet: 2.444 ± 0.829
4.887LeuAsn: 4.887 ± 0.696
2.444LeuPro: 2.444 ± 0.533
3.84LeuGln: 3.84 ± 0.933
5.586LeuArg: 5.586 ± 1.13
6.109LeuSer: 6.109 ± 0.318
4.189LeuThr: 4.189 ± 0.606
2.793LeuVal: 2.793 ± 0.616
0.349LeuTrp: 0.349 ± 0.231
2.967LeuTyr: 2.967 ± 0.888
0.0LeuXaa: 0.0 ± 0.0
Met
2.095MetAla: 2.095 ± 0.672
0.175MetCys: 0.175 ± 0.151
1.92MetAsp: 1.92 ± 0.46
2.269MetGlu: 2.269 ± 0.473
0.873MetPhe: 0.873 ± 0.317
1.396MetGly: 1.396 ± 0.357
0.349MetHis: 0.349 ± 0.197
2.095MetIle: 2.095 ± 0.528
3.316MetLys: 3.316 ± 0.811
4.015MetLeu: 4.015 ± 0.731
0.698MetMet: 0.698 ± 0.354
2.618MetAsn: 2.618 ± 0.359
1.047MetPro: 1.047 ± 0.306
0.524MetGln: 0.524 ± 0.263
1.571MetArg: 1.571 ± 0.266
2.793MetSer: 2.793 ± 0.618
1.746MetThr: 1.746 ± 0.303
1.746MetVal: 1.746 ± 0.478
0.0MetTrp: 0.0 ± 0.0
1.396MetTyr: 1.396 ± 0.615
0.0MetXaa: 0.0 ± 0.0
Asn
4.538AsnAla: 4.538 ± 0.81
1.222AsnCys: 1.222 ± 0.412
3.666AsnAsp: 3.666 ± 1.025
4.887AsnGlu: 4.887 ± 0.843
2.095AsnPhe: 2.095 ± 0.357
2.618AsnGly: 2.618 ± 0.586
0.349AsnHis: 0.349 ± 0.221
4.015AsnIle: 4.015 ± 0.576
4.015AsnLys: 4.015 ± 0.926
4.713AsnLeu: 4.713 ± 0.644
1.396AsnMet: 1.396 ± 0.502
3.142AsnAsn: 3.142 ± 0.76
1.396AsnPro: 1.396 ± 0.471
2.095AsnGln: 2.095 ± 0.721
3.491AsnArg: 3.491 ± 1.25
5.237AsnSer: 5.237 ± 0.585
2.967AsnThr: 2.967 ± 0.543
6.109AsnVal: 6.109 ± 1.208
1.047AsnTrp: 1.047 ± 0.43
4.015AsnTyr: 4.015 ± 0.97
0.0AsnXaa: 0.0 ± 0.0
Pro
0.698ProAla: 0.698 ± 0.326
0.349ProCys: 0.349 ± 0.206
2.269ProAsp: 2.269 ± 0.685
2.269ProGlu: 2.269 ± 0.4
0.698ProPhe: 0.698 ± 0.333
1.571ProGly: 1.571 ± 0.508
0.349ProHis: 0.349 ± 0.164
1.92ProIle: 1.92 ± 0.466
2.269ProLys: 2.269 ± 0.543
2.444ProLeu: 2.444 ± 0.467
1.571ProMet: 1.571 ± 0.37
1.746ProAsn: 1.746 ± 0.498
0.873ProPro: 0.873 ± 0.306
1.222ProGln: 1.222 ± 0.256
1.047ProArg: 1.047 ± 0.371
2.793ProSer: 2.793 ± 0.756
2.967ProThr: 2.967 ± 0.759
2.618ProVal: 2.618 ± 0.527
0.524ProTrp: 0.524 ± 0.22
2.269ProTyr: 2.269 ± 0.638
0.0ProXaa: 0.0 ± 0.0
Gln
1.571GlnAla: 1.571 ± 0.583
1.047GlnCys: 1.047 ± 0.475
2.095GlnAsp: 2.095 ± 0.523
2.793GlnGlu: 2.793 ± 0.888
2.095GlnPhe: 2.095 ± 0.654
1.571GlnGly: 1.571 ± 0.34
0.524GlnHis: 0.524 ± 0.292
3.142GlnIle: 3.142 ± 0.882
2.444GlnLys: 2.444 ± 0.738
4.713GlnLeu: 4.713 ± 1.013
2.444GlnMet: 2.444 ± 0.424
3.142GlnAsn: 3.142 ± 0.812
1.396GlnPro: 1.396 ± 0.421
1.746GlnGln: 1.746 ± 0.352
2.444GlnArg: 2.444 ± 0.719
2.967GlnSer: 2.967 ± 0.486
2.095GlnThr: 2.095 ± 0.625
1.92GlnVal: 1.92 ± 0.487
0.349GlnTrp: 0.349 ± 0.197
1.571GlnTyr: 1.571 ± 0.543
0.0GlnXaa: 0.0 ± 0.0
Arg
2.793ArgAla: 2.793 ± 0.802
0.873ArgCys: 0.873 ± 0.29
3.666ArgAsp: 3.666 ± 0.547
3.316ArgGlu: 3.316 ± 0.766
1.746ArgPhe: 1.746 ± 0.665
2.269ArgGly: 2.269 ± 0.714
0.524ArgHis: 0.524 ± 0.43
4.887ArgIle: 4.887 ± 0.848
3.666ArgLys: 3.666 ± 0.528
4.887ArgLeu: 4.887 ± 0.966
1.746ArgMet: 1.746 ± 0.314
2.444ArgAsn: 2.444 ± 0.436
1.92ArgPro: 1.92 ± 0.484
2.618ArgGln: 2.618 ± 0.83
2.793ArgArg: 2.793 ± 0.896
2.967ArgSer: 2.967 ± 0.712
4.538ArgThr: 4.538 ± 0.859
3.316ArgVal: 3.316 ± 0.956
0.524ArgTrp: 0.524 ± 0.217
1.92ArgTyr: 1.92 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
4.189SerAla: 4.189 ± 1.399
0.175SerCys: 0.175 ± 0.139
5.586SerAsp: 5.586 ± 0.829
4.364SerGlu: 4.364 ± 1.489
2.444SerPhe: 2.444 ± 0.734
4.538SerGly: 4.538 ± 1.174
1.047SerHis: 1.047 ± 0.539
4.538SerIle: 4.538 ± 0.875
4.713SerLys: 4.713 ± 0.692
3.142SerLeu: 3.142 ± 0.666
1.746SerMet: 1.746 ± 0.542
4.538SerAsn: 4.538 ± 0.894
1.92SerPro: 1.92 ± 0.513
2.444SerGln: 2.444 ± 0.84
4.538SerArg: 4.538 ± 0.855
3.666SerSer: 3.666 ± 0.818
3.491SerThr: 3.491 ± 0.431
3.142SerVal: 3.142 ± 0.559
0.873SerTrp: 0.873 ± 0.337
2.967SerTyr: 2.967 ± 0.96
0.0SerXaa: 0.0 ± 0.0
Thr
2.793ThrAla: 2.793 ± 0.649
0.349ThrCys: 0.349 ± 0.227
4.538ThrAsp: 4.538 ± 0.493
3.142ThrGlu: 3.142 ± 0.42
1.746ThrPhe: 1.746 ± 0.283
3.142ThrGly: 3.142 ± 0.452
1.222ThrHis: 1.222 ± 0.435
5.062ThrIle: 5.062 ± 0.619
3.84ThrLys: 3.84 ± 0.776
4.713ThrLeu: 4.713 ± 0.984
1.571ThrMet: 1.571 ± 0.462
3.316ThrAsn: 3.316 ± 0.42
2.269ThrPro: 2.269 ± 0.666
3.666ThrGln: 3.666 ± 1.004
2.444ThrArg: 2.444 ± 0.638
5.411ThrSer: 5.411 ± 1.007
3.666ThrThr: 3.666 ± 0.625
4.713ThrVal: 4.713 ± 0.532
0.0ThrTrp: 0.0 ± 0.0
3.142ThrTyr: 3.142 ± 0.666
0.0ThrXaa: 0.0 ± 0.0
Val
3.142ValAla: 3.142 ± 0.419
0.698ValCys: 0.698 ± 0.356
5.935ValAsp: 5.935 ± 1.08
2.967ValGlu: 2.967 ± 0.484
2.618ValPhe: 2.618 ± 0.956
0.873ValGly: 0.873 ± 0.387
0.349ValHis: 0.349 ± 0.267
2.967ValIle: 2.967 ± 0.867
4.189ValLys: 4.189 ± 1.019
6.982ValLeu: 6.982 ± 0.978
2.095ValMet: 2.095 ± 0.587
5.237ValAsn: 5.237 ± 1.397
2.095ValPro: 2.095 ± 0.774
2.444ValGln: 2.444 ± 0.326
3.84ValArg: 3.84 ± 0.616
5.237ValSer: 5.237 ± 0.65
4.364ValThr: 4.364 ± 0.878
3.666ValVal: 3.666 ± 1.033
0.524ValTrp: 0.524 ± 0.227
2.967ValTyr: 2.967 ± 0.961
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.367
0.175TrpCys: 0.175 ± 0.153
0.175TrpAsp: 0.175 ± 0.139
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.349TrpGly: 0.349 ± 0.228
0.175TrpHis: 0.175 ± 0.153
0.698TrpIle: 0.698 ± 0.349
1.571TrpLys: 1.571 ± 0.484
0.524TrpLeu: 0.524 ± 0.224
0.175TrpMet: 0.175 ± 0.139
0.524TrpAsn: 0.524 ± 0.253
0.175TrpPro: 0.175 ± 0.153
1.047TrpGln: 1.047 ± 0.388
1.047TrpArg: 1.047 ± 0.284
0.698TrpSer: 0.698 ± 0.407
0.349TrpThr: 0.349 ± 0.197
0.175TrpVal: 0.175 ± 0.176
0.175TrpTrp: 0.175 ± 0.153
0.698TrpTyr: 0.698 ± 0.483
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.967TyrAla: 2.967 ± 0.627
0.524TyrCys: 0.524 ± 0.386
2.269TyrAsp: 2.269 ± 0.632
3.84TyrGlu: 3.84 ± 0.685
1.746TyrPhe: 1.746 ± 0.736
1.222TyrGly: 1.222 ± 0.427
1.571TyrHis: 1.571 ± 0.474
4.015TyrIle: 4.015 ± 1.095
2.618TyrLys: 2.618 ± 0.66
2.095TyrLeu: 2.095 ± 0.486
1.222TyrMet: 1.222 ± 0.392
2.444TyrAsn: 2.444 ± 0.565
1.396TyrPro: 1.396 ± 0.308
1.222TyrGln: 1.222 ± 0.447
2.269TyrArg: 2.269 ± 0.789
1.92TyrSer: 1.92 ± 0.711
4.189TyrThr: 4.189 ± 1.362
2.618TyrVal: 2.618 ± 0.574
0.175TyrTrp: 0.175 ± 0.157
2.444TyrTyr: 2.444 ± 0.669
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5730 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski