Amino acid dipepetide frequency for Rotavirus I

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.037AlaAla: 4.037 ± 1.1
0.176AlaCys: 0.176 ± 0.146
2.457AlaAsp: 2.457 ± 0.708
4.915AlaGlu: 4.915 ± 0.96
2.282AlaPhe: 2.282 ± 0.935
1.404AlaGly: 1.404 ± 0.382
0.527AlaHis: 0.527 ± 0.235
4.037AlaIle: 4.037 ± 0.696
2.808AlaLys: 2.808 ± 1.042
4.915AlaLeu: 4.915 ± 1.121
1.931AlaMet: 1.931 ± 0.495
3.335AlaAsn: 3.335 ± 0.6
1.404AlaPro: 1.404 ± 0.907
2.984AlaGln: 2.984 ± 0.748
2.457AlaArg: 2.457 ± 0.383
4.915AlaSer: 4.915 ± 1.088
3.686AlaThr: 3.686 ± 0.994
3.686AlaVal: 3.686 ± 0.585
0.351AlaTrp: 0.351 ± 0.181
2.106AlaTyr: 2.106 ± 0.854
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.755CysAsp: 1.755 ± 0.497
0.527CysGlu: 0.527 ± 0.338
0.176CysPhe: 0.176 ± 0.21
1.053CysGly: 1.053 ± 0.625
0.0CysHis: 0.0 ± 0.0
1.229CysIle: 1.229 ± 0.618
1.229CysLys: 1.229 ± 0.659
1.053CysLeu: 1.053 ± 0.413
0.351CysMet: 0.351 ± 0.232
0.702CysAsn: 0.702 ± 0.322
0.0CysPro: 0.0 ± 0.0
0.351CysGln: 0.351 ± 0.295
0.176CysArg: 0.176 ± 0.184
0.878CysSer: 0.878 ± 0.521
0.878CysThr: 0.878 ± 0.606
0.878CysVal: 0.878 ± 0.403
0.0CysTrp: 0.0 ± 0.0
0.878CysTyr: 0.878 ± 0.44
0.0CysXaa: 0.0 ± 0.0
Asp
2.984AspAla: 2.984 ± 0.936
0.878AspCys: 0.878 ± 0.487
6.67AspAsp: 6.67 ± 1.034
3.335AspGlu: 3.335 ± 0.75
2.808AspPhe: 2.808 ± 0.64
1.931AspGly: 1.931 ± 0.593
1.229AspHis: 1.229 ± 0.52
7.197AspIle: 7.197 ± 1.177
3.862AspLys: 3.862 ± 0.642
5.09AspLeu: 5.09 ± 0.834
1.755AspMet: 1.755 ± 0.589
4.915AspAsn: 4.915 ± 0.795
3.335AspPro: 3.335 ± 0.874
3.16AspGln: 3.16 ± 0.81
2.984AspArg: 2.984 ± 0.652
2.984AspSer: 2.984 ± 0.68
3.335AspThr: 3.335 ± 0.856
4.388AspVal: 4.388 ± 0.614
0.702AspTrp: 0.702 ± 0.187
1.931AspTyr: 1.931 ± 0.407
0.0AspXaa: 0.0 ± 0.0
Glu
2.808GluAla: 2.808 ± 0.595
1.58GluCys: 1.58 ± 0.63
4.213GluAsp: 4.213 ± 1.004
4.564GluGlu: 4.564 ± 1.06
1.58GluPhe: 1.58 ± 0.447
2.457GluGly: 2.457 ± 0.5
0.702GluHis: 0.702 ± 0.4
5.266GluIle: 5.266 ± 0.75
6.144GluLys: 6.144 ± 0.847
3.16GluLeu: 3.16 ± 0.553
1.58GluMet: 1.58 ± 0.516
4.037GluAsn: 4.037 ± 1.022
1.404GluPro: 1.404 ± 0.571
2.457GluGln: 2.457 ± 1.002
2.282GluArg: 2.282 ± 0.783
3.335GluSer: 3.335 ± 0.624
3.686GluThr: 3.686 ± 0.703
2.984GluVal: 2.984 ± 0.439
0.527GluTrp: 0.527 ± 0.315
2.282GluTyr: 2.282 ± 0.665
0.0GluXaa: 0.0 ± 0.0
Phe
2.106PheAla: 2.106 ± 0.532
0.702PheCys: 0.702 ± 0.292
2.633PheAsp: 2.633 ± 0.417
2.282PheGlu: 2.282 ± 0.462
2.808PhePhe: 2.808 ± 0.675
1.404PheGly: 1.404 ± 0.425
0.878PheHis: 0.878 ± 0.318
4.037PheIle: 4.037 ± 0.45
3.335PheLys: 3.335 ± 1.122
3.335PheLeu: 3.335 ± 0.685
0.878PheMet: 0.878 ± 0.289
4.213PheAsn: 4.213 ± 0.817
1.58PhePro: 1.58 ± 0.45
1.229PheGln: 1.229 ± 0.505
2.984PheArg: 2.984 ± 0.563
3.511PheSer: 3.511 ± 0.63
4.388PheThr: 4.388 ± 0.99
2.457PheVal: 2.457 ± 0.585
0.527PheTrp: 0.527 ± 0.309
0.527PheTyr: 0.527 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
2.633GlyAla: 2.633 ± 0.637
0.176GlyCys: 0.176 ± 0.21
2.984GlyAsp: 2.984 ± 0.616
0.878GlyGlu: 0.878 ± 0.677
2.633GlyPhe: 2.633 ± 0.663
1.755GlyGly: 1.755 ± 0.529
1.404GlyHis: 1.404 ± 0.491
4.915GlyIle: 4.915 ± 0.433
3.335GlyLys: 3.335 ± 0.869
2.808GlyLeu: 2.808 ± 0.508
0.878GlyMet: 0.878 ± 0.358
3.335GlyAsn: 3.335 ± 0.816
1.931GlyPro: 1.931 ± 0.554
1.755GlyGln: 1.755 ± 0.525
2.106GlyArg: 2.106 ± 0.592
2.106GlySer: 2.106 ± 0.634
2.282GlyThr: 2.282 ± 0.55
1.58GlyVal: 1.58 ± 0.386
0.702GlyTrp: 0.702 ± 0.412
2.106GlyTyr: 2.106 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
0.878HisAla: 0.878 ± 0.282
1.053HisCys: 1.053 ± 0.489
1.229HisAsp: 1.229 ± 0.527
1.229HisGlu: 1.229 ± 0.374
1.404HisPhe: 1.404 ± 0.256
1.053HisGly: 1.053 ± 0.421
0.0HisHis: 0.0 ± 0.0
1.404HisIle: 1.404 ± 0.338
0.702HisLys: 0.702 ± 0.352
1.053HisLeu: 1.053 ± 0.49
0.527HisMet: 0.527 ± 0.21
1.053HisAsn: 1.053 ± 0.304
0.351HisPro: 0.351 ± 0.261
1.053HisGln: 1.053 ± 0.316
0.878HisArg: 0.878 ± 0.529
2.106HisSer: 2.106 ± 0.47
2.282HisThr: 2.282 ± 0.671
0.702HisVal: 0.702 ± 0.363
0.0HisTrp: 0.0 ± 0.0
0.878HisTyr: 0.878 ± 0.527
0.0HisXaa: 0.0 ± 0.0
Ile
5.09IleAla: 5.09 ± 0.724
1.053IleCys: 1.053 ± 0.333
7.372IleAsp: 7.372 ± 1.16
4.739IleGlu: 4.739 ± 1.073
2.808IlePhe: 2.808 ± 1.004
4.213IleGly: 4.213 ± 0.821
2.633IleHis: 2.633 ± 0.578
6.319IleIle: 6.319 ± 1.446
4.739IleLys: 4.739 ± 0.789
6.495IleLeu: 6.495 ± 0.772
1.755IleMet: 1.755 ± 0.596
6.144IleAsn: 6.144 ± 1.082
4.564IlePro: 4.564 ± 0.793
3.16IleGln: 3.16 ± 0.873
4.564IleArg: 4.564 ± 0.688
7.372IleSer: 7.372 ± 1.116
4.564IleThr: 4.564 ± 0.93
3.511IleVal: 3.511 ± 0.802
0.176IleTrp: 0.176 ± 0.184
2.984IleTyr: 2.984 ± 0.892
0.0IleXaa: 0.0 ± 0.0
Lys
3.686LysAla: 3.686 ± 0.726
1.229LysCys: 1.229 ± 0.671
3.862LysAsp: 3.862 ± 0.794
2.984LysGlu: 2.984 ± 0.532
3.511LysPhe: 3.511 ± 0.664
2.808LysGly: 2.808 ± 0.811
0.527LysHis: 0.527 ± 0.36
5.441LysIle: 5.441 ± 0.775
5.266LysLys: 5.266 ± 1.195
7.548LysLeu: 7.548 ± 1.861
2.282LysMet: 2.282 ± 0.782
6.495LysAsn: 6.495 ± 0.945
1.58LysPro: 1.58 ± 0.401
2.633LysGln: 2.633 ± 0.827
1.931LysArg: 1.931 ± 0.508
4.213LysSer: 4.213 ± 1.011
3.511LysThr: 3.511 ± 0.762
2.808LysVal: 2.808 ± 0.712
1.404LysTrp: 1.404 ± 0.44
3.335LysTyr: 3.335 ± 0.749
0.0LysXaa: 0.0 ± 0.0
Leu
5.09LeuAla: 5.09 ± 0.905
0.878LeuCys: 0.878 ± 0.404
3.335LeuAsp: 3.335 ± 0.636
5.793LeuGlu: 5.793 ± 0.96
4.739LeuPhe: 4.739 ± 1.055
2.633LeuGly: 2.633 ± 0.507
2.106LeuHis: 2.106 ± 0.696
5.968LeuIle: 5.968 ± 1.165
5.968LeuLys: 5.968 ± 1.177
8.25LeuLeu: 8.25 ± 0.811
3.16LeuMet: 3.16 ± 0.612
5.617LeuAsn: 5.617 ± 1.669
3.511LeuPro: 3.511 ± 0.736
3.686LeuGln: 3.686 ± 1.133
3.862LeuArg: 3.862 ± 0.91
8.074LeuSer: 8.074 ± 1.326
5.793LeuThr: 5.793 ± 0.889
4.037LeuVal: 4.037 ± 0.929
0.176LeuTrp: 0.176 ± 0.141
2.633LeuTyr: 2.633 ± 0.75
0.0LeuXaa: 0.0 ± 0.0
Met
1.755MetAla: 1.755 ± 0.722
0.527MetCys: 0.527 ± 0.278
1.58MetAsp: 1.58 ± 0.418
0.878MetGlu: 0.878 ± 0.424
1.58MetPhe: 1.58 ± 0.566
0.702MetGly: 0.702 ± 0.275
0.702MetHis: 0.702 ± 0.452
2.808MetIle: 2.808 ± 0.793
1.404MetLys: 1.404 ± 0.546
2.808MetLeu: 2.808 ± 0.394
0.527MetMet: 0.527 ± 0.204
1.931MetAsn: 1.931 ± 0.426
0.878MetPro: 0.878 ± 0.275
1.229MetGln: 1.229 ± 0.468
1.931MetArg: 1.931 ± 0.591
2.808MetSer: 2.808 ± 0.914
1.755MetThr: 1.755 ± 0.556
1.404MetVal: 1.404 ± 0.255
0.176MetTrp: 0.176 ± 0.21
1.229MetTyr: 1.229 ± 0.535
0.0MetXaa: 0.0 ± 0.0
Asn
4.037AsnAla: 4.037 ± 1.131
0.351AsnCys: 0.351 ± 0.283
4.037AsnAsp: 4.037 ± 0.566
4.037AsnGlu: 4.037 ± 0.98
2.808AsnPhe: 2.808 ± 0.685
3.16AsnGly: 3.16 ± 0.544
0.878AsnHis: 0.878 ± 0.348
6.144AsnIle: 6.144 ± 0.85
3.686AsnLys: 3.686 ± 1.163
5.09AsnLeu: 5.09 ± 1.043
1.931AsnMet: 1.931 ± 0.329
3.686AsnAsn: 3.686 ± 0.792
3.335AsnPro: 3.335 ± 0.856
2.633AsnGln: 2.633 ± 0.672
2.808AsnArg: 2.808 ± 0.394
5.793AsnSer: 5.793 ± 0.85
4.564AsnThr: 4.564 ± 1.092
6.67AsnVal: 6.67 ± 0.882
0.702AsnTrp: 0.702 ± 0.407
2.457AsnTyr: 2.457 ± 0.737
0.0AsnXaa: 0.0 ± 0.0
Pro
1.755ProAla: 1.755 ± 0.358
0.527ProCys: 0.527 ± 0.289
1.053ProAsp: 1.053 ± 0.486
2.457ProGlu: 2.457 ± 0.668
1.58ProPhe: 1.58 ± 0.333
2.106ProGly: 2.106 ± 0.607
0.878ProHis: 0.878 ± 0.459
3.335ProIle: 3.335 ± 0.628
1.404ProLys: 1.404 ± 0.642
2.633ProLeu: 2.633 ± 0.852
0.702ProMet: 0.702 ± 0.446
2.457ProAsn: 2.457 ± 1.169
0.527ProPro: 0.527 ± 0.316
1.58ProGln: 1.58 ± 0.374
0.878ProArg: 0.878 ± 0.304
3.862ProSer: 3.862 ± 1.052
3.862ProThr: 3.862 ± 1.348
2.457ProVal: 2.457 ± 0.558
1.229ProTrp: 1.229 ± 0.36
1.755ProTyr: 1.755 ± 0.624
0.0ProXaa: 0.0 ± 0.0
Gln
2.106GlnAla: 2.106 ± 0.625
0.176GlnCys: 0.176 ± 0.219
1.58GlnAsp: 1.58 ± 0.558
2.457GlnGlu: 2.457 ± 0.7
1.755GlnPhe: 1.755 ± 0.422
0.878GlnGly: 0.878 ± 0.379
1.755GlnHis: 1.755 ± 0.387
3.511GlnIle: 3.511 ± 0.628
3.335GlnLys: 3.335 ± 0.984
4.388GlnLeu: 4.388 ± 0.982
0.878GlnMet: 0.878 ± 0.353
2.457GlnAsn: 2.457 ± 0.71
2.106GlnPro: 2.106 ± 0.47
1.58GlnGln: 1.58 ± 0.581
2.984GlnArg: 2.984 ± 0.556
2.106GlnSer: 2.106 ± 0.444
2.106GlnThr: 2.106 ± 0.402
1.053GlnVal: 1.053 ± 0.409
0.702GlnTrp: 0.702 ± 0.355
1.053GlnTyr: 1.053 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
1.931ArgAla: 1.931 ± 0.827
0.0ArgCys: 0.0 ± 0.0
3.862ArgAsp: 3.862 ± 0.797
3.511ArgGlu: 3.511 ± 0.812
1.229ArgPhe: 1.229 ± 0.408
3.16ArgGly: 3.16 ± 0.684
0.176ArgHis: 0.176 ± 0.146
3.862ArgIle: 3.862 ± 0.935
4.388ArgLys: 4.388 ± 0.784
3.511ArgLeu: 3.511 ± 0.73
1.58ArgMet: 1.58 ± 0.772
3.335ArgAsn: 3.335 ± 0.824
1.229ArgPro: 1.229 ± 0.376
1.931ArgGln: 1.931 ± 0.604
2.808ArgArg: 2.808 ± 1.162
2.984ArgSer: 2.984 ± 0.542
3.16ArgThr: 3.16 ± 0.74
2.808ArgVal: 2.808 ± 0.81
0.176ArgTrp: 0.176 ± 0.16
1.931ArgTyr: 1.931 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
4.915SerAla: 4.915 ± 0.867
0.527SerCys: 0.527 ± 0.318
5.09SerAsp: 5.09 ± 0.855
3.862SerGlu: 3.862 ± 0.918
3.335SerPhe: 3.335 ± 0.61
5.266SerGly: 5.266 ± 1.139
1.404SerHis: 1.404 ± 0.63
4.564SerIle: 4.564 ± 0.866
4.739SerLys: 4.739 ± 0.873
8.074SerLeu: 8.074 ± 1.333
2.106SerMet: 2.106 ± 0.527
5.968SerAsn: 5.968 ± 0.965
2.808SerPro: 2.808 ± 0.874
2.633SerGln: 2.633 ± 0.64
3.511SerArg: 3.511 ± 0.687
6.846SerSer: 6.846 ± 1.453
4.915SerThr: 4.915 ± 1.215
4.213SerVal: 4.213 ± 0.662
0.176SerTrp: 0.176 ± 0.16
1.931SerTyr: 1.931 ± 0.399
0.0SerXaa: 0.0 ± 0.0
Thr
2.808ThrAla: 2.808 ± 0.651
0.878ThrCys: 0.878 ± 0.424
4.915ThrAsp: 4.915 ± 0.488
2.808ThrGlu: 2.808 ± 0.572
4.213ThrPhe: 4.213 ± 0.673
2.457ThrGly: 2.457 ± 0.807
1.229ThrHis: 1.229 ± 0.492
6.319ThrIle: 6.319 ± 1.224
4.037ThrLys: 4.037 ± 0.963
6.144ThrLeu: 6.144 ± 0.86
2.282ThrMet: 2.282 ± 0.66
3.335ThrAsn: 3.335 ± 0.679
2.106ThrPro: 2.106 ± 0.654
1.755ThrGln: 1.755 ± 0.635
2.808ThrArg: 2.808 ± 0.931
4.915ThrSer: 4.915 ± 0.685
5.09ThrThr: 5.09 ± 0.987
4.915ThrVal: 4.915 ± 0.806
0.527ThrTrp: 0.527 ± 0.469
3.686ThrTyr: 3.686 ± 0.927
0.0ThrXaa: 0.0 ± 0.0
Val
3.511ValAla: 3.511 ± 0.77
1.053ValCys: 1.053 ± 0.561
3.511ValAsp: 3.511 ± 0.854
3.862ValGlu: 3.862 ± 0.583
2.457ValPhe: 2.457 ± 0.692
1.229ValGly: 1.229 ± 0.338
1.404ValHis: 1.404 ± 0.6
3.686ValIle: 3.686 ± 0.471
4.213ValLys: 4.213 ± 0.782
5.09ValLeu: 5.09 ± 0.961
0.878ValMet: 0.878 ± 0.296
2.633ValAsn: 2.633 ± 0.473
2.457ValPro: 2.457 ± 0.59
2.106ValGln: 2.106 ± 0.572
3.335ValArg: 3.335 ± 0.681
5.266ValSer: 5.266 ± 0.809
3.511ValThr: 3.511 ± 0.879
2.106ValVal: 2.106 ± 0.533
0.176ValTrp: 0.176 ± 0.21
2.282ValTyr: 2.282 ± 0.63
0.0ValXaa: 0.0 ± 0.0
Trp
0.351TrpAla: 0.351 ± 0.292
0.176TrpCys: 0.176 ± 0.184
0.527TrpAsp: 0.527 ± 0.352
0.0TrpGlu: 0.0 ± 0.0
0.878TrpPhe: 0.878 ± 0.494
0.351TrpGly: 0.351 ± 0.419
0.176TrpHis: 0.176 ± 0.16
0.351TrpIle: 0.351 ± 0.217
1.053TrpLys: 1.053 ± 0.277
1.229TrpLeu: 1.229 ± 0.262
0.527TrpMet: 0.527 ± 0.379
0.702TrpAsn: 0.702 ± 0.319
0.176TrpPro: 0.176 ± 0.21
0.351TrpGln: 0.351 ± 0.217
0.527TrpArg: 0.527 ± 0.333
0.351TrpSer: 0.351 ± 0.222
0.702TrpThr: 0.702 ± 0.386
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.176TrpTyr: 0.176 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.931TyrAla: 1.931 ± 0.686
0.176TyrCys: 0.176 ± 0.146
2.808TyrAsp: 2.808 ± 0.8
1.931TyrGlu: 1.931 ± 0.75
1.229TyrPhe: 1.229 ± 0.38
2.282TyrGly: 2.282 ± 0.541
1.229TyrHis: 1.229 ± 0.541
3.862TyrIle: 3.862 ± 1.175
1.404TyrLys: 1.404 ± 0.445
2.633TyrLeu: 2.633 ± 0.719
1.931TyrMet: 1.931 ± 0.48
2.633TyrAsn: 2.633 ± 0.454
1.755TyrPro: 1.755 ± 0.714
0.702TyrGln: 0.702 ± 0.453
1.755TyrArg: 1.755 ± 0.607
2.457TyrSer: 2.457 ± 0.641
3.16TyrThr: 3.16 ± 0.942
2.106TyrVal: 2.106 ± 0.658
0.176TyrTrp: 0.176 ± 0.148
1.755TyrTyr: 1.755 ± 0.493
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (5698 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski