Amino acid dipepetide frequency for Rotavirus J

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.302AlaAla: 5.302 ± 1.31
0.166AlaCys: 0.166 ± 0.198
3.313AlaAsp: 3.313 ± 1.035
3.81AlaGlu: 3.81 ± 0.499
3.148AlaPhe: 3.148 ± 0.67
1.988AlaGly: 1.988 ± 0.581
0.828AlaHis: 0.828 ± 0.341
5.136AlaIle: 5.136 ± 0.778
4.142AlaLys: 4.142 ± 1.089
5.964AlaLeu: 5.964 ± 1.419
1.325AlaMet: 1.325 ± 0.42
2.651AlaAsn: 2.651 ± 0.879
1.822AlaPro: 1.822 ± 0.753
4.142AlaGln: 4.142 ± 0.801
4.639AlaArg: 4.639 ± 0.498
4.307AlaSer: 4.307 ± 1.023
4.639AlaThr: 4.639 ± 0.669
2.651AlaVal: 2.651 ± 0.641
0.994AlaTrp: 0.994 ± 0.469
3.148AlaTyr: 3.148 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
0.497CysAla: 0.497 ± 0.438
0.166CysCys: 0.166 ± 0.164
0.828CysAsp: 0.828 ± 0.414
0.828CysGlu: 0.828 ± 0.505
0.828CysPhe: 0.828 ± 0.436
0.663CysGly: 0.663 ± 0.681
0.331CysHis: 0.331 ± 0.199
0.497CysIle: 0.497 ± 0.308
0.497CysLys: 0.497 ± 0.33
0.828CysLeu: 0.828 ± 0.479
0.0CysMet: 0.0 ± 0.0
0.663CysAsn: 0.663 ± 0.264
0.0CysPro: 0.0 ± 0.0
0.663CysGln: 0.663 ± 0.308
0.166CysArg: 0.166 ± 0.157
0.331CysSer: 0.331 ± 0.278
0.828CysThr: 0.828 ± 0.398
0.497CysVal: 0.497 ± 0.274
0.0CysTrp: 0.0 ± 0.0
0.828CysTyr: 0.828 ± 0.414
0.0CysXaa: 0.0 ± 0.0
Asp
5.136AspAla: 5.136 ± 1.322
0.331AspCys: 0.331 ± 0.229
4.307AspAsp: 4.307 ± 0.693
4.805AspGlu: 4.805 ± 1.101
2.319AspPhe: 2.319 ± 0.846
2.154AspGly: 2.154 ± 0.769
0.828AspHis: 0.828 ± 0.313
4.142AspIle: 4.142 ± 1.205
2.982AspLys: 2.982 ± 0.787
6.627AspLeu: 6.627 ± 0.904
0.994AspMet: 0.994 ± 0.419
4.473AspAsn: 4.473 ± 1.091
2.816AspPro: 2.816 ± 0.842
2.319AspGln: 2.319 ± 0.626
3.313AspArg: 3.313 ± 0.808
2.651AspSer: 2.651 ± 0.792
1.988AspThr: 1.988 ± 0.641
4.307AspVal: 4.307 ± 0.67
0.994AspTrp: 0.994 ± 0.374
2.485AspTyr: 2.485 ± 0.721
0.0AspXaa: 0.0 ± 0.0
Glu
2.982GluAla: 2.982 ± 0.925
0.497GluCys: 0.497 ± 0.275
4.805GluAsp: 4.805 ± 0.99
4.805GluGlu: 4.805 ± 1.148
1.988GluPhe: 1.988 ± 0.752
1.988GluGly: 1.988 ± 0.56
1.16GluHis: 1.16 ± 0.576
6.627GluIle: 6.627 ± 0.938
4.639GluLys: 4.639 ± 0.811
4.97GluLeu: 4.97 ± 1.042
1.988GluMet: 1.988 ± 0.69
3.645GluAsn: 3.645 ± 0.888
1.325GluPro: 1.325 ± 0.401
2.154GluGln: 2.154 ± 0.848
4.307GluArg: 4.307 ± 1.261
5.136GluSer: 5.136 ± 0.906
4.142GluThr: 4.142 ± 0.693
3.148GluVal: 3.148 ± 0.671
0.994GluTrp: 0.994 ± 0.395
2.485GluTyr: 2.485 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
2.319PheAla: 2.319 ± 0.428
0.663PheCys: 0.663 ± 0.533
3.479PheAsp: 3.479 ± 1.101
2.485PheGlu: 2.485 ± 0.467
0.994PhePhe: 0.994 ± 0.393
1.822PheGly: 1.822 ± 0.417
0.828PheHis: 0.828 ± 0.516
3.148PheIle: 3.148 ± 0.413
2.485PheLys: 2.485 ± 0.392
3.479PheLeu: 3.479 ± 0.644
0.497PheMet: 0.497 ± 0.306
3.313PheAsn: 3.313 ± 0.556
1.657PhePro: 1.657 ± 0.541
1.657PheGln: 1.657 ± 0.546
2.651PheArg: 2.651 ± 0.73
3.645PheSer: 3.645 ± 1.075
2.816PheThr: 2.816 ± 0.783
1.491PheVal: 1.491 ± 0.636
0.331PheTrp: 0.331 ± 0.251
1.988PheTyr: 1.988 ± 0.52
0.0PheXaa: 0.0 ± 0.0
Gly
2.485GlyAla: 2.485 ± 0.524
0.663GlyCys: 0.663 ± 0.32
1.657GlyAsp: 1.657 ± 0.52
1.491GlyGlu: 1.491 ± 0.494
2.154GlyPhe: 2.154 ± 0.813
1.822GlyGly: 1.822 ± 0.845
1.16GlyHis: 1.16 ± 0.44
3.148GlyIle: 3.148 ± 0.481
3.479GlyLys: 3.479 ± 0.855
3.976GlyLeu: 3.976 ± 0.67
0.828GlyMet: 0.828 ± 0.494
3.479GlyAsn: 3.479 ± 1.075
1.657GlyPro: 1.657 ± 0.464
0.828GlyGln: 0.828 ± 0.396
2.154GlyArg: 2.154 ± 0.444
1.822GlySer: 1.822 ± 0.431
2.319GlyThr: 2.319 ± 0.773
2.485GlyVal: 2.485 ± 0.415
0.497GlyTrp: 0.497 ± 0.324
1.657GlyTyr: 1.657 ± 0.307
0.0GlyXaa: 0.0 ± 0.0
His
0.994HisAla: 0.994 ± 0.363
0.0HisCys: 0.0 ± 0.0
0.663HisAsp: 0.663 ± 0.309
1.16HisGlu: 1.16 ± 0.437
0.663HisPhe: 0.663 ± 0.398
1.491HisGly: 1.491 ± 0.616
0.497HisHis: 0.497 ± 0.33
0.994HisIle: 0.994 ± 0.439
0.331HisLys: 0.331 ± 0.217
1.988HisLeu: 1.988 ± 0.6
0.828HisMet: 0.828 ± 0.316
0.994HisAsn: 0.994 ± 0.404
1.16HisPro: 1.16 ± 0.505
0.331HisGln: 0.331 ± 0.287
0.663HisArg: 0.663 ± 0.319
1.16HisSer: 1.16 ± 0.337
0.994HisThr: 0.994 ± 0.338
1.491HisVal: 1.491 ± 0.554
0.166HisTrp: 0.166 ± 0.164
0.497HisTyr: 0.497 ± 0.309
0.0HisXaa: 0.0 ± 0.0
Ile
4.307IleAla: 4.307 ± 0.717
0.828IleCys: 0.828 ± 0.516
4.805IleAsp: 4.805 ± 0.628
5.467IleGlu: 5.467 ± 0.958
3.976IlePhe: 3.976 ± 0.678
3.479IleGly: 3.479 ± 1.24
1.325IleHis: 1.325 ± 0.579
5.136IleIle: 5.136 ± 1.035
4.307IleLys: 4.307 ± 0.956
5.136IleLeu: 5.136 ± 0.696
2.154IleMet: 2.154 ± 0.877
4.97IleAsn: 4.97 ± 0.782
4.307IlePro: 4.307 ± 0.975
3.313IleGln: 3.313 ± 0.828
3.645IleArg: 3.645 ± 0.603
5.633IleSer: 5.633 ± 1.253
5.799IleThr: 5.799 ± 1.431
4.307IleVal: 4.307 ± 1.139
0.331IleTrp: 0.331 ± 0.245
2.651IleTyr: 2.651 ± 0.538
0.0IleXaa: 0.0 ± 0.0
Lys
4.142LysAla: 4.142 ± 0.873
0.663LysCys: 0.663 ± 0.296
2.651LysAsp: 2.651 ± 0.505
3.976LysGlu: 3.976 ± 0.962
1.657LysPhe: 1.657 ± 0.751
1.988LysGly: 1.988 ± 0.714
1.16LysHis: 1.16 ± 0.261
8.449LysIle: 8.449 ± 1.284
5.799LysLys: 5.799 ± 1.373
7.124LysLeu: 7.124 ± 1.205
1.491LysMet: 1.491 ± 0.612
5.302LysAsn: 5.302 ± 1.103
2.485LysPro: 2.485 ± 0.92
3.313LysGln: 3.313 ± 0.753
3.148LysArg: 3.148 ± 0.806
3.313LysSer: 3.313 ± 0.849
4.639LysThr: 4.639 ± 1.042
4.142LysVal: 4.142 ± 0.781
1.491LysTrp: 1.491 ± 0.466
2.154LysTyr: 2.154 ± 0.666
0.0LysXaa: 0.0 ± 0.0
Leu
6.958LeuAla: 6.958 ± 1.529
1.491LeuCys: 1.491 ± 0.397
4.639LeuAsp: 4.639 ± 0.951
5.964LeuGlu: 5.964 ± 1.678
4.307LeuPhe: 4.307 ± 0.571
2.982LeuGly: 2.982 ± 0.606
1.988LeuHis: 1.988 ± 0.385
5.136LeuIle: 5.136 ± 0.989
4.142LeuLys: 4.142 ± 0.592
9.609LeuLeu: 9.609 ± 1.411
3.313LeuMet: 3.313 ± 0.987
4.473LeuAsn: 4.473 ± 1.086
3.313LeuPro: 3.313 ± 0.559
4.473LeuGln: 4.473 ± 0.981
5.964LeuArg: 5.964 ± 0.979
7.952LeuSer: 7.952 ± 1.227
4.473LeuThr: 4.473 ± 0.882
5.136LeuVal: 5.136 ± 1.05
0.497LeuTrp: 0.497 ± 0.211
3.479LeuTyr: 3.479 ± 0.876
0.0LeuXaa: 0.0 ± 0.0
Met
2.319MetAla: 2.319 ± 0.597
0.331MetCys: 0.331 ± 0.248
1.16MetAsp: 1.16 ± 0.421
0.994MetGlu: 0.994 ± 0.501
0.994MetPhe: 0.994 ± 0.411
1.16MetGly: 1.16 ± 0.397
0.497MetHis: 0.497 ± 0.317
1.988MetIle: 1.988 ± 0.463
1.988MetLys: 1.988 ± 0.651
2.319MetLeu: 2.319 ± 0.485
0.663MetMet: 0.663 ± 0.485
1.16MetAsn: 1.16 ± 0.443
0.663MetPro: 0.663 ± 0.211
1.16MetGln: 1.16 ± 0.708
1.822MetArg: 1.822 ± 0.529
2.154MetSer: 2.154 ± 0.859
2.485MetThr: 2.485 ± 0.591
1.657MetVal: 1.657 ± 0.529
0.331MetTrp: 0.331 ± 0.341
1.16MetTyr: 1.16 ± 0.545
0.0MetXaa: 0.0 ± 0.0
Asn
4.473AsnAla: 4.473 ± 0.998
0.497AsnCys: 0.497 ± 0.228
3.976AsnAsp: 3.976 ± 0.802
4.805AsnGlu: 4.805 ± 0.739
2.154AsnPhe: 2.154 ± 0.626
3.313AsnGly: 3.313 ± 0.971
0.663AsnHis: 0.663 ± 0.231
2.982AsnIle: 2.982 ± 0.673
4.805AsnLys: 4.805 ± 1.114
4.307AsnLeu: 4.307 ± 0.89
1.988AsnMet: 1.988 ± 0.92
3.976AsnAsn: 3.976 ± 1.266
2.982AsnPro: 2.982 ± 0.779
2.651AsnGln: 2.651 ± 0.478
2.651AsnArg: 2.651 ± 0.717
4.639AsnSer: 4.639 ± 1.106
3.645AsnThr: 3.645 ± 0.647
4.805AsnVal: 4.805 ± 1.03
0.663AsnTrp: 0.663 ± 0.394
2.154AsnTyr: 2.154 ± 0.554
0.0AsnXaa: 0.0 ± 0.0
Pro
1.657ProAla: 1.657 ± 0.314
0.166ProCys: 0.166 ± 0.17
2.982ProAsp: 2.982 ± 0.617
2.154ProGlu: 2.154 ± 0.661
1.822ProPhe: 1.822 ± 0.452
2.154ProGly: 2.154 ± 0.498
0.994ProHis: 0.994 ± 0.472
4.307ProIle: 4.307 ± 0.636
3.148ProLys: 3.148 ± 0.558
2.154ProLeu: 2.154 ± 0.469
0.994ProMet: 0.994 ± 0.424
2.319ProAsn: 2.319 ± 0.646
1.325ProPro: 1.325 ± 0.405
1.822ProGln: 1.822 ± 0.739
0.497ProArg: 0.497 ± 0.217
2.982ProSer: 2.982 ± 0.609
3.313ProThr: 3.313 ± 0.637
1.822ProVal: 1.822 ± 0.628
0.497ProTrp: 0.497 ± 0.187
1.325ProTyr: 1.325 ± 0.408
0.0ProXaa: 0.0 ± 0.0
Gln
1.491GlnAla: 1.491 ± 0.609
0.994GlnCys: 0.994 ± 0.571
1.988GlnAsp: 1.988 ± 0.339
1.822GlnGlu: 1.822 ± 0.852
0.994GlnPhe: 0.994 ± 0.304
1.16GlnGly: 1.16 ± 0.558
0.994GlnHis: 0.994 ± 0.397
3.81GlnIle: 3.81 ± 0.698
2.982GlnLys: 2.982 ± 0.735
5.467GlnLeu: 5.467 ± 1.557
1.491GlnMet: 1.491 ± 0.455
3.313GlnAsn: 3.313 ± 0.646
1.491GlnPro: 1.491 ± 0.389
2.319GlnGln: 2.319 ± 0.796
3.81GlnArg: 3.81 ± 0.95
3.645GlnSer: 3.645 ± 0.917
2.651GlnThr: 2.651 ± 0.49
1.822GlnVal: 1.822 ± 0.387
0.331GlnTrp: 0.331 ± 0.182
1.491GlnTyr: 1.491 ± 0.331
0.0GlnXaa: 0.0 ± 0.0
Arg
2.485ArgAla: 2.485 ± 0.572
0.663ArgCys: 0.663 ± 0.274
3.645ArgAsp: 3.645 ± 0.739
3.645ArgGlu: 3.645 ± 0.546
2.485ArgPhe: 2.485 ± 0.644
2.651ArgGly: 2.651 ± 0.64
0.663ArgHis: 0.663 ± 0.256
4.142ArgIle: 4.142 ± 0.709
5.136ArgLys: 5.136 ± 0.763
5.799ArgLeu: 5.799 ± 1.008
2.982ArgMet: 2.982 ± 0.713
2.816ArgAsn: 2.816 ± 0.661
1.491ArgPro: 1.491 ± 0.466
2.485ArgGln: 2.485 ± 0.75
4.307ArgArg: 4.307 ± 1.217
3.976ArgSer: 3.976 ± 0.607
2.816ArgThr: 2.816 ± 0.737
2.319ArgVal: 2.319 ± 0.653
0.663ArgTrp: 0.663 ± 0.33
1.491ArgTyr: 1.491 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
5.799SerAla: 5.799 ± 1.148
0.497SerCys: 0.497 ± 0.345
5.136SerAsp: 5.136 ± 1.006
3.81SerGlu: 3.81 ± 1.538
5.136SerPhe: 5.136 ± 0.67
3.148SerGly: 3.148 ± 0.548
0.497SerHis: 0.497 ± 0.292
5.964SerIle: 5.964 ± 0.793
5.467SerLys: 5.467 ± 1.171
4.97SerLeu: 4.97 ± 0.801
1.16SerMet: 1.16 ± 0.414
3.976SerAsn: 3.976 ± 1.034
3.479SerPro: 3.479 ± 0.809
3.81SerGln: 3.81 ± 1.013
3.645SerArg: 3.645 ± 0.699
4.473SerSer: 4.473 ± 0.641
3.313SerThr: 3.313 ± 0.665
5.136SerVal: 5.136 ± 0.677
0.0SerTrp: 0.0 ± 0.0
2.154SerTyr: 2.154 ± 0.525
0.0SerXaa: 0.0 ± 0.0
Thr
3.645ThrAla: 3.645 ± 1.134
0.497ThrCys: 0.497 ± 0.249
2.154ThrAsp: 2.154 ± 0.498
4.639ThrGlu: 4.639 ± 1.322
2.651ThrPhe: 2.651 ± 0.593
2.651ThrGly: 2.651 ± 1.021
0.828ThrHis: 0.828 ± 0.292
4.805ThrIle: 4.805 ± 0.825
3.976ThrLys: 3.976 ± 0.755
6.461ThrLeu: 6.461 ± 1.276
1.822ThrMet: 1.822 ± 0.496
2.319ThrAsn: 2.319 ± 0.475
2.982ThrPro: 2.982 ± 0.47
2.651ThrGln: 2.651 ± 0.624
2.982ThrArg: 2.982 ± 0.709
4.97ThrSer: 4.97 ± 0.7
5.302ThrThr: 5.302 ± 1.496
4.805ThrVal: 4.805 ± 1.081
0.497ThrTrp: 0.497 ± 0.402
1.822ThrTyr: 1.822 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
4.473ValAla: 4.473 ± 1.11
0.331ValCys: 0.331 ± 0.229
3.976ValAsp: 3.976 ± 0.629
3.645ValGlu: 3.645 ± 0.748
2.154ValPhe: 2.154 ± 0.555
1.657ValGly: 1.657 ± 0.493
0.663ValHis: 0.663 ± 0.252
2.816ValIle: 2.816 ± 0.828
5.136ValLys: 5.136 ± 0.48
4.805ValLeu: 4.805 ± 0.856
1.325ValMet: 1.325 ± 0.412
4.97ValAsn: 4.97 ± 0.576
2.154ValPro: 2.154 ± 0.761
2.651ValGln: 2.651 ± 0.661
3.976ValArg: 3.976 ± 0.698
4.307ValSer: 4.307 ± 0.832
3.148ValThr: 3.148 ± 0.777
2.982ValVal: 2.982 ± 0.526
0.331ValTrp: 0.331 ± 0.192
2.651ValTyr: 2.651 ± 0.726
0.0ValXaa: 0.0 ± 0.0
Trp
0.497TrpAla: 0.497 ± 0.22
0.0TrpCys: 0.0 ± 0.0
0.994TrpAsp: 0.994 ± 0.332
0.663TrpGlu: 0.663 ± 0.483
0.166TrpPhe: 0.166 ± 0.17
0.166TrpGly: 0.166 ± 0.158
0.0TrpHis: 0.0 ± 0.0
0.828TrpIle: 0.828 ± 0.341
1.491TrpLys: 1.491 ± 0.613
1.325TrpLeu: 1.325 ± 0.33
0.0TrpMet: 0.0 ± 0.0
0.828TrpAsn: 0.828 ± 0.309
0.166TrpPro: 0.166 ± 0.148
0.497TrpGln: 0.497 ± 0.261
1.325TrpArg: 1.325 ± 0.358
0.331TrpSer: 0.331 ± 0.193
0.166TrpThr: 0.166 ± 0.185
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.497TrpTyr: 0.497 ± 0.339
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.154TyrAla: 2.154 ± 0.698
0.497TyrCys: 0.497 ± 0.323
2.816TyrAsp: 2.816 ± 0.611
2.816TyrGlu: 2.816 ± 0.811
1.16TyrPhe: 1.16 ± 0.414
1.325TyrGly: 1.325 ± 0.413
1.16TyrHis: 1.16 ± 0.55
1.988TyrIle: 1.988 ± 0.642
2.154TyrLys: 2.154 ± 0.467
2.982TyrLeu: 2.982 ± 0.687
0.994TyrMet: 0.994 ± 0.381
2.485TyrAsn: 2.485 ± 0.434
1.16TyrPro: 1.16 ± 0.517
0.828TyrGln: 0.828 ± 0.482
0.994TyrArg: 0.994 ± 0.4
4.142TyrSer: 4.142 ± 0.901
2.982TyrThr: 2.982 ± 0.938
3.148TyrVal: 3.148 ± 0.836
0.331TyrTrp: 0.331 ± 0.268
1.822TyrTyr: 1.822 ± 0.426
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (6037 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski