Amino acid dipepetide frequency for Rotavirus X (strain RVX/Human/China/NADRV-J19/1997/GXP[X]) (RV ADRV-N) (Rotavirus (isolate novel adult diarrhea rotavirus-J19))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.051AlaAla: 4.051 ± 1.053
0.528AlaCys: 0.528 ± 0.409
2.466AlaAsp: 2.466 ± 0.975
4.227AlaGlu: 4.227 ± 0.94
2.818AlaPhe: 2.818 ± 0.72
1.937AlaGly: 1.937 ± 0.92
0.352AlaHis: 0.352 ± 0.168
4.755AlaIle: 4.755 ± 0.917
4.403AlaLys: 4.403 ± 1.059
4.755AlaLeu: 4.755 ± 0.555
1.585AlaMet: 1.585 ± 0.58
4.051AlaAsn: 4.051 ± 0.914
1.585AlaPro: 1.585 ± 0.642
2.642AlaGln: 2.642 ± 0.466
2.113AlaArg: 2.113 ± 0.628
5.107AlaSer: 5.107 ± 0.641
4.755AlaThr: 4.755 ± 0.912
2.818AlaVal: 2.818 ± 0.725
0.528AlaTrp: 0.528 ± 0.251
2.113AlaTyr: 2.113 ± 0.591
0.0AlaXaa: 0.0 ± 0.0
Cys
0.352CysAla: 0.352 ± 0.202
0.176CysCys: 0.176 ± 0.2
0.881CysAsp: 0.881 ± 0.464
1.585CysGlu: 1.585 ± 0.447
0.528CysPhe: 0.528 ± 0.22
1.233CysGly: 1.233 ± 0.797
0.0CysHis: 0.0 ± 0.0
1.409CysIle: 1.409 ± 0.539
0.881CysLys: 0.881 ± 0.312
0.528CysLeu: 0.528 ± 0.467
0.528CysMet: 0.528 ± 0.409
1.233CysAsn: 1.233 ± 0.359
0.176CysPro: 0.176 ± 0.196
0.528CysGln: 0.528 ± 0.345
0.176CysArg: 0.176 ± 0.211
0.528CysSer: 0.528 ± 0.335
0.528CysThr: 0.528 ± 0.418
0.352CysVal: 0.352 ± 0.217
0.0CysTrp: 0.0 ± 0.0
0.704CysTyr: 0.704 ± 0.434
0.0CysXaa: 0.0 ± 0.0
Asp
3.17AspAla: 3.17 ± 1.083
0.352AspCys: 0.352 ± 0.297
3.346AspAsp: 3.346 ± 0.532
4.051AspGlu: 4.051 ± 0.892
1.409AspPhe: 1.409 ± 0.469
2.642AspGly: 2.642 ± 0.58
0.352AspHis: 0.352 ± 0.207
5.284AspIle: 5.284 ± 0.881
4.579AspLys: 4.579 ± 0.859
6.34AspLeu: 6.34 ± 1.096
1.761AspMet: 1.761 ± 0.561
4.755AspAsn: 4.755 ± 0.95
2.113AspPro: 2.113 ± 0.692
2.113AspGln: 2.113 ± 0.653
4.051AspArg: 4.051 ± 0.95
4.579AspSer: 4.579 ± 1.106
2.818AspThr: 2.818 ± 0.544
3.875AspVal: 3.875 ± 0.809
0.881AspTrp: 0.881 ± 0.48
2.466AspTyr: 2.466 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
3.17GluAla: 3.17 ± 0.935
0.352GluCys: 0.352 ± 0.238
2.466GluAsp: 2.466 ± 0.683
3.522GluGlu: 3.522 ± 0.725
2.994GluPhe: 2.994 ± 0.819
2.466GluGly: 2.466 ± 0.783
1.233GluHis: 1.233 ± 0.705
4.227GluIle: 4.227 ± 0.657
4.755GluLys: 4.755 ± 0.691
5.636GluLeu: 5.636 ± 0.566
2.29GluMet: 2.29 ± 0.727
4.051GluAsn: 4.051 ± 0.851
1.937GluPro: 1.937 ± 0.539
2.29GluGln: 2.29 ± 0.441
2.113GluArg: 2.113 ± 0.775
5.812GluSer: 5.812 ± 1.148
3.17GluThr: 3.17 ± 0.758
2.994GluVal: 2.994 ± 0.858
0.352GluTrp: 0.352 ± 0.182
2.642GluTyr: 2.642 ± 0.765
0.0GluXaa: 0.0 ± 0.0
Phe
1.937PheAla: 1.937 ± 0.312
0.881PheCys: 0.881 ± 0.628
4.403PheAsp: 4.403 ± 1.253
2.994PheGlu: 2.994 ± 0.553
1.409PhePhe: 1.409 ± 0.448
3.17PheGly: 3.17 ± 0.733
1.057PheHis: 1.057 ± 0.435
2.994PheIle: 2.994 ± 0.8
3.522PheLys: 3.522 ± 0.917
2.642PheLeu: 2.642 ± 0.531
0.528PheMet: 0.528 ± 0.302
2.466PheAsn: 2.466 ± 0.31
1.409PhePro: 1.409 ± 0.468
1.233PheGln: 1.233 ± 0.289
3.17PheArg: 3.17 ± 0.728
4.227PheSer: 4.227 ± 0.716
3.17PheThr: 3.17 ± 0.436
2.113PheVal: 2.113 ± 0.615
0.352PheTrp: 0.352 ± 0.255
1.761PheTyr: 1.761 ± 0.576
0.0PheXaa: 0.0 ± 0.0
Gly
2.818GlyAla: 2.818 ± 0.793
0.528GlyCys: 0.528 ± 0.274
2.29GlyAsp: 2.29 ± 0.607
1.761GlyGlu: 1.761 ± 0.703
2.29GlyPhe: 2.29 ± 0.599
2.29GlyGly: 2.29 ± 0.947
1.761GlyHis: 1.761 ± 0.632
4.051GlyIle: 4.051 ± 0.42
2.466GlyLys: 2.466 ± 0.772
2.818GlyLeu: 2.818 ± 0.913
0.704GlyMet: 0.704 ± 0.207
2.994GlyAsn: 2.994 ± 0.802
1.233GlyPro: 1.233 ± 0.588
1.057GlyGln: 1.057 ± 0.412
1.585GlyArg: 1.585 ± 0.486
1.585GlySer: 1.585 ± 0.255
2.818GlyThr: 2.818 ± 0.718
2.466GlyVal: 2.466 ± 0.834
0.176GlyTrp: 0.176 ± 0.196
1.585GlyTyr: 1.585 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
1.585HisAla: 1.585 ± 0.5
0.176HisCys: 0.176 ± 0.2
1.233HisAsp: 1.233 ± 0.398
1.057HisGlu: 1.057 ± 0.3
0.528HisPhe: 0.528 ± 0.301
0.881HisGly: 0.881 ± 0.378
0.176HisHis: 0.176 ± 0.2
1.057HisIle: 1.057 ± 0.531
0.528HisLys: 0.528 ± 0.441
2.29HisLeu: 2.29 ± 0.67
1.057HisMet: 1.057 ± 0.34
0.528HisAsn: 0.528 ± 0.22
0.881HisPro: 0.881 ± 0.319
0.704HisGln: 0.704 ± 0.467
0.881HisArg: 0.881 ± 0.387
2.113HisSer: 2.113 ± 0.469
1.057HisThr: 1.057 ± 0.437
0.881HisVal: 0.881 ± 0.322
0.176HisTrp: 0.176 ± 0.16
0.704HisTyr: 0.704 ± 0.342
0.0HisXaa: 0.0 ± 0.0
Ile
5.107IleAla: 5.107 ± 1.074
1.057IleCys: 1.057 ± 0.455
5.284IleAsp: 5.284 ± 0.713
6.692IleGlu: 6.692 ± 0.963
3.522IlePhe: 3.522 ± 0.733
3.522IleGly: 3.522 ± 0.766
1.409IleHis: 1.409 ± 0.349
5.46IleIle: 5.46 ± 0.559
5.636IleLys: 5.636 ± 1.104
5.636IleLeu: 5.636 ± 0.925
1.761IleMet: 1.761 ± 0.428
5.812IleAsn: 5.812 ± 0.886
3.17IlePro: 3.17 ± 0.338
3.522IleGln: 3.522 ± 0.836
4.227IleArg: 4.227 ± 0.887
7.221IleSer: 7.221 ± 0.761
4.931IleThr: 4.931 ± 0.699
5.284IleVal: 5.284 ± 1.236
0.176IleTrp: 0.176 ± 0.166
2.642IleTyr: 2.642 ± 0.635
0.0IleXaa: 0.0 ± 0.0
Lys
3.875LysAla: 3.875 ± 0.881
1.233LysCys: 1.233 ± 0.575
2.466LysAsp: 2.466 ± 0.81
5.107LysGlu: 5.107 ± 0.909
2.113LysPhe: 2.113 ± 0.828
1.761LysGly: 1.761 ± 0.69
0.881LysHis: 0.881 ± 0.311
8.806LysIle: 8.806 ± 1.272
6.692LysLys: 6.692 ± 1.038
5.988LysLeu: 5.988 ± 0.779
1.233LysMet: 1.233 ± 0.179
4.579LysAsn: 4.579 ± 1.009
1.409LysPro: 1.409 ± 0.602
2.642LysGln: 2.642 ± 0.458
3.698LysArg: 3.698 ± 0.856
3.346LysSer: 3.346 ± 0.682
4.931LysThr: 4.931 ± 0.813
4.227LysVal: 4.227 ± 0.532
0.881LysTrp: 0.881 ± 0.322
3.346LysTyr: 3.346 ± 0.902
0.0LysXaa: 0.0 ± 0.0
Leu
5.46LeuAla: 5.46 ± 0.898
1.409LeuCys: 1.409 ± 0.581
6.692LeuAsp: 6.692 ± 0.686
3.698LeuGlu: 3.698 ± 1.134
4.755LeuPhe: 4.755 ± 0.893
2.466LeuGly: 2.466 ± 0.733
1.761LeuHis: 1.761 ± 0.434
6.34LeuIle: 6.34 ± 0.722
6.164LeuLys: 6.164 ± 0.955
8.806LeuLeu: 8.806 ± 1.026
1.937LeuMet: 1.937 ± 0.665
5.636LeuAsn: 5.636 ± 1.007
3.17LeuPro: 3.17 ± 0.66
4.579LeuGln: 4.579 ± 0.762
3.698LeuArg: 3.698 ± 0.728
7.221LeuSer: 7.221 ± 1.092
5.812LeuThr: 5.812 ± 0.648
4.051LeuVal: 4.051 ± 0.817
0.352LeuTrp: 0.352 ± 0.243
3.346LeuTyr: 3.346 ± 1.005
0.0LeuXaa: 0.0 ± 0.0
Met
1.761MetAla: 1.761 ± 0.417
0.528MetCys: 0.528 ± 0.306
1.057MetAsp: 1.057 ± 0.649
1.937MetGlu: 1.937 ± 0.515
1.937MetPhe: 1.937 ± 0.558
1.585MetGly: 1.585 ± 0.458
0.176MetHis: 0.176 ± 0.16
2.466MetIle: 2.466 ± 0.407
1.409MetLys: 1.409 ± 0.296
2.113MetLeu: 2.113 ± 0.573
1.233MetMet: 1.233 ± 0.386
1.937MetAsn: 1.937 ± 0.479
0.881MetPro: 0.881 ± 0.311
0.528MetGln: 0.528 ± 0.342
1.409MetArg: 1.409 ± 0.592
3.17MetSer: 3.17 ± 0.753
1.409MetThr: 1.409 ± 0.552
1.761MetVal: 1.761 ± 0.464
0.0MetTrp: 0.0 ± 0.0
0.704MetTyr: 0.704 ± 0.506
0.0MetXaa: 0.0 ± 0.0
Asn
5.107AsnAla: 5.107 ± 1.225
0.881AsnCys: 0.881 ± 0.382
4.931AsnAsp: 4.931 ± 0.871
2.642AsnGlu: 2.642 ± 0.49
2.29AsnPhe: 2.29 ± 0.407
1.761AsnGly: 1.761 ± 0.397
1.937AsnHis: 1.937 ± 0.626
4.403AsnIle: 4.403 ± 0.724
3.17AsnLys: 3.17 ± 0.921
6.164AsnLeu: 6.164 ± 0.709
2.29AsnMet: 2.29 ± 0.604
4.051AsnAsn: 4.051 ± 1.216
2.466AsnPro: 2.466 ± 1.176
1.937AsnGln: 1.937 ± 0.588
4.227AsnArg: 4.227 ± 1.082
6.516AsnSer: 6.516 ± 1.148
3.522AsnThr: 3.522 ± 0.907
3.698AsnVal: 3.698 ± 0.603
0.881AsnTrp: 0.881 ± 0.31
3.522AsnTyr: 3.522 ± 0.969
0.0AsnXaa: 0.0 ± 0.0
Pro
1.761ProAla: 1.761 ± 0.372
0.352ProCys: 0.352 ± 0.238
1.233ProAsp: 1.233 ± 0.258
1.937ProGlu: 1.937 ± 0.417
1.233ProPhe: 1.233 ± 0.321
1.409ProGly: 1.409 ± 0.456
0.704ProHis: 0.704 ± 0.418
2.29ProIle: 2.29 ± 0.532
2.642ProLys: 2.642 ± 0.825
3.17ProLeu: 3.17 ± 0.913
0.881ProMet: 0.881 ± 0.339
3.17ProAsn: 3.17 ± 0.795
1.233ProPro: 1.233 ± 0.452
2.113ProGln: 2.113 ± 0.342
1.057ProArg: 1.057 ± 0.345
3.17ProSer: 3.17 ± 0.733
2.29ProThr: 2.29 ± 0.78
3.346ProVal: 3.346 ± 0.711
0.352ProTrp: 0.352 ± 0.217
2.113ProTyr: 2.113 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
0.704GlnAla: 0.704 ± 0.419
0.704GlnCys: 0.704 ± 0.475
1.409GlnAsp: 1.409 ± 0.341
1.937GlnGlu: 1.937 ± 0.61
3.346GlnPhe: 3.346 ± 0.648
1.233GlnGly: 1.233 ± 0.322
1.409GlnHis: 1.409 ± 0.319
3.698GlnIle: 3.698 ± 0.831
2.466GlnLys: 2.466 ± 0.715
5.284GlnLeu: 5.284 ± 0.609
0.704GlnMet: 0.704 ± 0.302
2.466GlnAsn: 2.466 ± 0.518
2.466GlnPro: 2.466 ± 0.366
1.761GlnGln: 1.761 ± 0.521
2.642GlnArg: 2.642 ± 0.745
1.409GlnSer: 1.409 ± 0.368
1.761GlnThr: 1.761 ± 0.287
2.642GlnVal: 2.642 ± 0.509
0.176GlnTrp: 0.176 ± 0.127
1.057GlnTyr: 1.057 ± 0.46
0.0GlnXaa: 0.0 ± 0.0
Arg
2.994ArgAla: 2.994 ± 0.698
0.881ArgCys: 0.881 ± 0.282
4.051ArgAsp: 4.051 ± 0.907
3.522ArgGlu: 3.522 ± 0.842
2.466ArgPhe: 2.466 ± 0.855
1.233ArgGly: 1.233 ± 0.311
0.352ArgHis: 0.352 ± 0.247
3.698ArgIle: 3.698 ± 0.691
3.17ArgLys: 3.17 ± 0.692
3.522ArgLeu: 3.522 ± 0.623
2.818ArgMet: 2.818 ± 0.631
2.994ArgAsn: 2.994 ± 0.614
1.937ArgPro: 1.937 ± 0.525
2.818ArgGln: 2.818 ± 0.51
2.642ArgArg: 2.642 ± 0.981
3.346ArgSer: 3.346 ± 0.536
4.051ArgThr: 4.051 ± 0.725
2.113ArgVal: 2.113 ± 0.531
0.176ArgTrp: 0.176 ± 0.166
1.233ArgTyr: 1.233 ± 0.352
0.0ArgXaa: 0.0 ± 0.0
Ser
4.755SerAla: 4.755 ± 0.965
0.704SerCys: 0.704 ± 0.238
5.636SerAsp: 5.636 ± 1.241
3.875SerGlu: 3.875 ± 0.746
4.051SerPhe: 4.051 ± 0.648
2.642SerGly: 2.642 ± 0.378
1.233SerHis: 1.233 ± 0.437
7.221SerIle: 7.221 ± 1.148
4.931SerLys: 4.931 ± 0.689
8.454SerLeu: 8.454 ± 1.06
0.881SerMet: 0.881 ± 0.479
3.346SerAsn: 3.346 ± 0.751
3.522SerPro: 3.522 ± 0.724
3.875SerGln: 3.875 ± 0.955
3.346SerArg: 3.346 ± 0.608
5.46SerSer: 5.46 ± 0.56
5.988SerThr: 5.988 ± 1.041
6.164SerVal: 6.164 ± 1.179
0.881SerTrp: 0.881 ± 0.443
2.466SerTyr: 2.466 ± 1.114
0.0SerXaa: 0.0 ± 0.0
Thr
1.937ThrAla: 1.937 ± 0.593
0.352ThrCys: 0.352 ± 0.247
4.051ThrAsp: 4.051 ± 0.601
2.642ThrGlu: 2.642 ± 1.0
2.994ThrPhe: 2.994 ± 0.906
2.29ThrGly: 2.29 ± 0.76
1.233ThrHis: 1.233 ± 0.345
7.045ThrIle: 7.045 ± 0.971
5.284ThrLys: 5.284 ± 0.862
4.403ThrLeu: 4.403 ± 1.075
2.466ThrMet: 2.466 ± 0.395
3.522ThrAsn: 3.522 ± 0.659
2.466ThrPro: 2.466 ± 0.593
1.937ThrGln: 1.937 ± 0.446
2.818ThrArg: 2.818 ± 0.848
5.988ThrSer: 5.988 ± 1.014
4.051ThrThr: 4.051 ± 0.666
4.051ThrVal: 4.051 ± 0.572
0.528ThrTrp: 0.528 ± 0.418
2.818ThrTyr: 2.818 ± 0.668
0.0ThrXaa: 0.0 ± 0.0
Val
4.403ValAla: 4.403 ± 0.603
1.057ValCys: 1.057 ± 0.268
4.051ValAsp: 4.051 ± 0.773
2.642ValGlu: 2.642 ± 0.493
3.522ValPhe: 3.522 ± 0.782
2.113ValGly: 2.113 ± 0.797
1.057ValHis: 1.057 ± 0.366
3.346ValIle: 3.346 ± 1.254
3.522ValLys: 3.522 ± 0.75
5.46ValLeu: 5.46 ± 0.993
1.761ValMet: 1.761 ± 0.437
3.875ValAsn: 3.875 ± 0.695
2.994ValPro: 2.994 ± 0.623
1.585ValGln: 1.585 ± 0.597
4.051ValArg: 4.051 ± 0.677
4.227ValSer: 4.227 ± 0.648
3.17ValThr: 3.17 ± 0.974
3.17ValVal: 3.17 ± 1.027
0.528ValTrp: 0.528 ± 0.303
2.466ValTyr: 2.466 ± 0.401
0.0ValXaa: 0.0 ± 0.0
Trp
0.528TrpAla: 0.528 ± 0.212
0.0TrpCys: 0.0 ± 0.0
0.176TrpAsp: 0.176 ± 0.127
0.352TrpGlu: 0.352 ± 0.202
0.0TrpPhe: 0.0 ± 0.0
0.176TrpGly: 0.176 ± 0.2
0.0TrpHis: 0.0 ± 0.0
0.528TrpIle: 0.528 ± 0.273
0.881TrpLys: 0.881 ± 0.343
1.233TrpLeu: 1.233 ± 0.316
0.176TrpMet: 0.176 ± 0.2
0.528TrpAsn: 0.528 ± 0.231
0.176TrpPro: 0.176 ± 0.16
0.176TrpGln: 0.176 ± 0.127
0.704TrpArg: 0.704 ± 0.351
0.528TrpSer: 0.528 ± 0.19
0.528TrpThr: 0.528 ± 0.353
0.704TrpVal: 0.704 ± 0.414
0.0TrpTrp: 0.0 ± 0.0
0.352TrpTyr: 0.352 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.113TyrAla: 2.113 ± 0.495
0.176TyrCys: 0.176 ± 0.146
2.818TyrAsp: 2.818 ± 0.524
2.466TyrGlu: 2.466 ± 0.813
1.233TyrPhe: 1.233 ± 0.433
2.466TyrGly: 2.466 ± 0.504
1.409TyrHis: 1.409 ± 0.294
2.994TyrIle: 2.994 ± 0.867
2.113TyrLys: 2.113 ± 0.8
2.113TyrLeu: 2.113 ± 0.589
1.233TyrMet: 1.233 ± 0.359
4.403TyrAsn: 4.403 ± 0.84
1.233TyrPro: 1.233 ± 0.425
1.233TyrGln: 1.233 ± 0.543
1.409TyrArg: 1.409 ± 0.458
3.875TyrSer: 3.875 ± 0.887
2.113TyrThr: 2.113 ± 0.66
2.29TyrVal: 2.29 ± 0.468
0.352TyrTrp: 0.352 ± 0.209
1.585TyrTyr: 1.585 ± 0.366
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (5679 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski