Amino acid dipepetide frequency for Sena Madureira virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.815AlaAla: 0.815 ± 0.359
0.543AlaCys: 0.543 ± 0.305
0.815AlaAsp: 0.815 ± 0.477
1.358AlaGlu: 1.358 ± 0.665
0.815AlaPhe: 0.815 ± 0.419
2.173AlaGly: 2.173 ± 0.498
0.815AlaHis: 0.815 ± 0.311
2.716AlaIle: 2.716 ± 1.334
1.358AlaLys: 1.358 ± 0.591
2.716AlaLeu: 2.716 ± 1.175
0.815AlaMet: 0.815 ± 0.311
2.988AlaAsn: 2.988 ± 2.344
0.543AlaPro: 0.543 ± 0.51
1.358AlaGln: 1.358 ± 0.681
2.173AlaArg: 2.173 ± 0.593
2.173AlaSer: 2.173 ± 0.606
1.358AlaThr: 1.358 ± 0.497
0.815AlaVal: 0.815 ± 0.75
0.272AlaTrp: 0.272 ± 0.391
1.63AlaTyr: 1.63 ± 0.966
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.477
0.543CysCys: 0.543 ± 0.318
1.358CysAsp: 1.358 ± 0.773
1.086CysGlu: 1.086 ± 0.44
1.086CysPhe: 1.086 ± 0.535
0.815CysGly: 0.815 ± 0.477
0.272CysHis: 0.272 ± 0.375
1.901CysIle: 1.901 ± 0.785
2.716CysLys: 2.716 ± 1.19
0.543CysLeu: 0.543 ± 0.338
0.272CysMet: 0.272 ± 0.159
1.63CysAsn: 1.63 ± 0.703
0.543CysPro: 0.543 ± 0.749
1.358CysGln: 1.358 ± 0.497
0.815CysArg: 0.815 ± 0.477
0.815CysSer: 0.815 ± 0.477
0.543CysThr: 0.543 ± 0.305
1.086CysVal: 1.086 ± 0.388
0.815CysTrp: 0.815 ± 0.371
1.086CysTyr: 1.086 ± 0.342
0.0CysXaa: 0.0 ± 0.0
Asp
1.086AspAla: 1.086 ± 0.503
1.086AspCys: 1.086 ± 0.342
3.531AspAsp: 3.531 ± 1.053
3.259AspGlu: 3.259 ± 0.926
3.802AspPhe: 3.802 ± 1.046
1.63AspGly: 1.63 ± 0.552
2.173AspHis: 2.173 ± 0.384
4.345AspIle: 4.345 ± 0.658
4.889AspLys: 4.889 ± 1.583
8.963AspLeu: 8.963 ± 1.788
2.444AspMet: 2.444 ± 0.935
2.716AspAsn: 2.716 ± 1.335
3.802AspPro: 3.802 ± 1.015
2.173AspGln: 2.173 ± 0.449
1.358AspArg: 1.358 ± 0.591
3.259AspSer: 3.259 ± 2.427
2.173AspThr: 2.173 ± 0.682
4.074AspVal: 4.074 ± 0.919
1.901AspTrp: 1.901 ± 0.666
2.444AspTyr: 2.444 ± 1.112
0.0AspXaa: 0.0 ± 0.0
Glu
0.815GluAla: 0.815 ± 0.836
1.358GluCys: 1.358 ± 0.594
5.703GluAsp: 5.703 ± 1.933
6.79GluGlu: 6.79 ± 1.405
3.531GluPhe: 3.531 ± 1.744
4.345GluGly: 4.345 ± 1.619
0.815GluHis: 0.815 ± 0.419
8.691GluIle: 8.691 ± 1.172
4.617GluLys: 4.617 ± 0.891
4.617GluLeu: 4.617 ± 0.554
1.901GluMet: 1.901 ± 0.785
6.247GluAsn: 6.247 ± 2.166
1.086GluPro: 1.086 ± 0.68
1.63GluGln: 1.63 ± 0.577
2.988GluArg: 2.988 ± 0.426
4.345GluSer: 4.345 ± 0.535
0.815GluThr: 0.815 ± 0.365
3.802GluVal: 3.802 ± 1.154
0.815GluTrp: 0.815 ± 0.543
2.444GluTyr: 2.444 ± 0.728
0.0GluXaa: 0.0 ± 0.0
Phe
1.086PheAla: 1.086 ± 0.738
0.815PheCys: 0.815 ± 0.477
2.988PheAsp: 2.988 ± 1.573
4.074PheGlu: 4.074 ± 1.674
3.531PhePhe: 3.531 ± 1.194
3.259PheGly: 3.259 ± 0.965
0.543PheHis: 0.543 ± 0.318
5.975PheIle: 5.975 ± 1.473
2.716PheLys: 2.716 ± 0.456
4.345PheLeu: 4.345 ± 1.226
0.543PheMet: 0.543 ± 0.404
2.716PheAsn: 2.716 ± 0.423
2.444PhePro: 2.444 ± 0.477
0.815PheGln: 0.815 ± 0.365
2.173PheArg: 2.173 ± 0.671
1.63PheSer: 1.63 ± 1.1
2.173PheThr: 2.173 ± 1.121
1.63PheVal: 1.63 ± 0.552
1.086PheTrp: 1.086 ± 0.611
1.901PheTyr: 1.901 ± 0.578
0.0PheXaa: 0.0 ± 0.0
Gly
1.901GlyAla: 1.901 ± 0.959
1.358GlyCys: 1.358 ± 0.505
3.259GlyAsp: 3.259 ± 0.631
4.074GlyGlu: 4.074 ± 0.617
4.345GlyPhe: 4.345 ± 0.87
3.259GlyGly: 3.259 ± 0.825
0.815GlyHis: 0.815 ± 0.359
3.259GlyIle: 3.259 ± 1.078
4.074GlyLys: 4.074 ± 1.303
6.518GlyLeu: 6.518 ± 0.621
1.086GlyMet: 1.086 ± 0.637
3.259GlyAsn: 3.259 ± 0.408
0.0GlyPro: 0.0 ± 0.0
3.259GlyGln: 3.259 ± 1.755
2.716GlyArg: 2.716 ± 0.939
3.531GlySer: 3.531 ± 1.676
4.074GlyThr: 4.074 ± 2.363
3.531GlyVal: 3.531 ± 1.854
1.358GlyTrp: 1.358 ± 0.329
2.173GlyTyr: 2.173 ± 0.591
0.0GlyXaa: 0.0 ± 0.0
His
0.543HisAla: 0.543 ± 0.305
0.0HisCys: 0.0 ± 0.0
0.272HisAsp: 0.272 ± 0.391
0.815HisGlu: 0.815 ± 0.311
1.63HisPhe: 1.63 ± 0.688
0.272HisGly: 0.272 ± 0.159
0.543HisHis: 0.543 ± 0.318
2.716HisIle: 2.716 ± 0.72
2.444HisLys: 2.444 ± 0.405
2.716HisLeu: 2.716 ± 0.78
0.272HisMet: 0.272 ± 0.159
1.63HisAsn: 1.63 ± 0.537
1.63HisPro: 1.63 ± 0.622
0.272HisGln: 0.272 ± 0.375
0.543HisArg: 0.543 ± 0.318
1.086HisSer: 1.086 ± 0.911
1.086HisThr: 1.086 ± 0.388
0.272HisVal: 0.272 ± 0.159
0.272HisTrp: 0.272 ± 0.375
0.815HisTyr: 0.815 ± 0.456
0.0HisXaa: 0.0 ± 0.0
Ile
2.173IleAla: 2.173 ± 0.823
1.358IleCys: 1.358 ± 0.505
6.247IleAsp: 6.247 ± 1.044
5.16IleGlu: 5.16 ± 1.397
2.444IlePhe: 2.444 ± 0.907
5.975IleGly: 5.975 ± 0.854
1.086IleHis: 1.086 ± 0.85
7.605IleIle: 7.605 ± 1.503
6.518IleLys: 6.518 ± 1.365
10.32IleLeu: 10.32 ± 1.23
2.444IleMet: 2.444 ± 0.887
6.518IleAsn: 6.518 ± 1.914
4.074IlePro: 4.074 ± 0.498
2.988IleGln: 2.988 ± 0.807
5.16IleArg: 5.16 ± 1.013
8.148IleSer: 8.148 ± 1.636
4.345IleThr: 4.345 ± 1.406
5.16IleVal: 5.16 ± 0.743
3.531IleTrp: 3.531 ± 0.604
3.259IleTyr: 3.259 ± 1.591
0.0IleXaa: 0.0 ± 0.0
Lys
1.901LysAla: 1.901 ± 0.868
1.358LysCys: 1.358 ± 0.555
4.345LysAsp: 4.345 ± 0.778
5.703LysGlu: 5.703 ± 0.897
3.259LysPhe: 3.259 ± 0.416
5.703LysGly: 5.703 ± 1.952
0.815LysHis: 0.815 ± 0.311
8.963LysIle: 8.963 ± 2.887
4.074LysLys: 4.074 ± 0.847
5.975LysLeu: 5.975 ± 1.13
2.988LysMet: 2.988 ± 0.636
3.802LysAsn: 3.802 ± 0.753
2.444LysPro: 2.444 ± 0.655
1.63LysGln: 1.63 ± 0.508
3.259LysArg: 3.259 ± 1.399
5.16LysSer: 5.16 ± 0.734
6.518LysThr: 6.518 ± 0.939
2.173LysVal: 2.173 ± 0.618
1.63LysTrp: 1.63 ± 0.838
2.716LysTyr: 2.716 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
3.259LeuAla: 3.259 ± 1.226
1.63LeuCys: 1.63 ± 0.727
4.074LeuAsp: 4.074 ± 0.498
4.889LeuGlu: 4.889 ± 0.519
4.617LeuPhe: 4.617 ± 0.756
6.79LeuGly: 6.79 ± 1.788
2.716LeuHis: 2.716 ± 0.939
7.333LeuIle: 7.333 ± 1.554
9.234LeuLys: 9.234 ± 2.502
6.518LeuLeu: 6.518 ± 1.447
3.802LeuMet: 3.802 ± 0.886
6.79LeuAsn: 6.79 ± 1.12
4.074LeuPro: 4.074 ± 1.149
1.901LeuGln: 1.901 ± 0.816
5.432LeuArg: 5.432 ± 1.133
8.419LeuSer: 8.419 ± 1.45
5.703LeuThr: 5.703 ± 1.626
2.988LeuVal: 2.988 ± 1.063
0.272LeuTrp: 0.272 ± 0.375
2.988LeuTyr: 2.988 ± 1.438
0.0LeuXaa: 0.0 ± 0.0
Met
1.358MetAla: 1.358 ± 0.447
0.272MetCys: 0.272 ± 0.159
1.358MetAsp: 1.358 ± 0.788
1.358MetGlu: 1.358 ± 0.762
1.63MetPhe: 1.63 ± 0.703
1.086MetGly: 1.086 ± 0.44
0.0MetHis: 0.0 ± 0.0
2.716MetIle: 2.716 ± 1.591
1.901MetLys: 1.901 ± 0.685
1.901MetLeu: 1.901 ± 0.846
1.358MetMet: 1.358 ± 0.621
1.901MetAsn: 1.901 ± 0.785
0.543MetPro: 0.543 ± 0.34
1.086MetGln: 1.086 ± 0.611
0.815MetArg: 0.815 ± 0.665
2.716MetSer: 2.716 ± 0.683
0.815MetThr: 0.815 ± 0.477
1.358MetVal: 1.358 ± 0.48
0.0MetTrp: 0.0 ± 0.0
1.086MetTyr: 1.086 ± 0.44
0.0MetXaa: 0.0 ± 0.0
Asn
2.173AsnAla: 2.173 ± 0.778
1.358AsnCys: 1.358 ± 0.796
5.16AsnAsp: 5.16 ± 1.815
3.802AsnGlu: 3.802 ± 0.805
2.444AsnPhe: 2.444 ± 0.641
2.173AsnGly: 2.173 ± 1.45
2.444AsnHis: 2.444 ± 0.919
5.703AsnIle: 5.703 ± 1.816
4.889AsnLys: 4.889 ± 1.595
4.345AsnLeu: 4.345 ± 1.259
0.543AsnMet: 0.543 ± 0.436
4.617AsnAsn: 4.617 ± 1.444
2.988AsnPro: 2.988 ± 0.427
2.173AsnGln: 2.173 ± 0.528
3.531AsnArg: 3.531 ± 1.5
5.16AsnSer: 5.16 ± 0.725
4.074AsnThr: 4.074 ± 0.706
2.716AsnVal: 2.716 ± 0.79
1.358AsnTrp: 1.358 ± 0.505
1.63AsnTyr: 1.63 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
0.272ProAla: 0.272 ± 0.159
0.0ProCys: 0.0 ± 0.0
2.716ProAsp: 2.716 ± 0.641
2.716ProGlu: 2.716 ± 1.309
0.543ProPhe: 0.543 ± 0.642
1.086ProGly: 1.086 ± 0.719
0.815ProHis: 0.815 ± 0.477
4.617ProIle: 4.617 ± 0.409
3.259ProLys: 3.259 ± 0.879
3.259ProLeu: 3.259 ± 0.684
0.272ProMet: 0.272 ± 0.159
0.543ProAsn: 0.543 ± 0.318
1.901ProPro: 1.901 ± 0.7
1.358ProGln: 1.358 ± 1.07
1.086ProArg: 1.086 ± 0.637
4.617ProSer: 4.617 ± 1.66
3.259ProThr: 3.259 ± 1.099
3.259ProVal: 3.259 ± 1.778
0.543ProTrp: 0.543 ± 0.318
1.086ProTyr: 1.086 ± 1.068
0.0ProXaa: 0.0 ± 0.0
Gln
0.272GlnAla: 0.272 ± 0.159
0.543GlnCys: 0.543 ± 0.749
2.716GlnAsp: 2.716 ± 0.756
1.086GlnGlu: 1.086 ± 0.548
1.358GlnPhe: 1.358 ± 0.429
1.63GlnGly: 1.63 ± 0.719
0.543GlnHis: 0.543 ± 0.318
3.259GlnIle: 3.259 ± 1.191
1.901GlnLys: 1.901 ± 0.487
3.259GlnLeu: 3.259 ± 1.788
0.815GlnMet: 0.815 ± 0.709
1.63GlnAsn: 1.63 ± 0.537
0.543GlnPro: 0.543 ± 0.305
0.543GlnGln: 0.543 ± 0.72
0.815GlnArg: 0.815 ± 0.477
4.617GlnSer: 4.617 ± 0.559
1.086GlnThr: 1.086 ± 0.68
1.086GlnVal: 1.086 ± 0.45
0.543GlnTrp: 0.543 ± 0.642
1.63GlnTyr: 1.63 ± 0.967
0.0GlnXaa: 0.0 ± 0.0
Arg
0.815ArgAla: 0.815 ± 0.477
1.901ArgCys: 1.901 ± 0.646
3.259ArgAsp: 3.259 ± 0.756
3.259ArgGlu: 3.259 ± 1.224
1.086ArgPhe: 1.086 ± 0.478
4.074ArgGly: 4.074 ± 1.133
1.086ArgHis: 1.086 ± 0.5
2.988ArgIle: 2.988 ± 0.396
2.716ArgLys: 2.716 ± 1.223
2.444ArgLeu: 2.444 ± 0.816
1.63ArgMet: 1.63 ± 0.539
2.716ArgAsn: 2.716 ± 0.483
1.358ArgPro: 1.358 ± 0.555
1.358ArgGln: 1.358 ± 0.535
2.444ArgArg: 2.444 ± 0.953
2.444ArgSer: 2.444 ± 0.672
3.802ArgThr: 3.802 ± 1.058
2.173ArgVal: 2.173 ± 0.543
1.086ArgTrp: 1.086 ± 0.68
2.444ArgTyr: 2.444 ± 0.816
0.0ArgXaa: 0.0 ± 0.0
Ser
4.617SerAla: 4.617 ± 0.798
2.444SerCys: 2.444 ± 0.69
5.432SerAsp: 5.432 ± 1.084
6.247SerGlu: 6.247 ± 1.066
3.531SerPhe: 3.531 ± 0.949
3.802SerGly: 3.802 ± 1.179
2.173SerHis: 2.173 ± 0.974
6.79SerIle: 6.79 ± 1.474
4.345SerLys: 4.345 ± 1.316
8.419SerLeu: 8.419 ± 1.573
1.086SerMet: 1.086 ± 0.472
3.259SerAsn: 3.259 ± 0.921
2.444SerPro: 2.444 ± 0.536
2.173SerGln: 2.173 ± 0.972
2.716SerArg: 2.716 ± 1.073
6.247SerSer: 6.247 ± 1.956
3.259SerThr: 3.259 ± 1.17
2.988SerVal: 2.988 ± 1.634
1.086SerTrp: 1.086 ± 0.5
3.531SerTyr: 3.531 ± 1.351
0.0SerXaa: 0.0 ± 0.0
Thr
1.901ThrAla: 1.901 ± 1.179
1.086ThrCys: 1.086 ± 0.45
3.259ThrAsp: 3.259 ± 0.647
4.889ThrGlu: 4.889 ± 1.219
2.173ThrPhe: 2.173 ± 0.791
3.802ThrGly: 3.802 ± 0.972
0.543ThrHis: 0.543 ± 0.318
5.432ThrIle: 5.432 ± 1.294
3.802ThrLys: 3.802 ± 0.863
4.617ThrLeu: 4.617 ± 1.146
1.358ThrMet: 1.358 ± 0.555
2.716ThrAsn: 2.716 ± 0.607
1.358ThrPro: 1.358 ± 1.116
1.086ThrGln: 1.086 ± 0.638
1.086ThrArg: 1.086 ± 0.44
3.802ThrSer: 3.802 ± 0.863
3.531ThrThr: 3.531 ± 0.793
3.531ThrVal: 3.531 ± 0.623
0.543ThrTrp: 0.543 ± 0.34
2.988ThrTyr: 2.988 ± 0.809
0.0ThrXaa: 0.0 ± 0.0
Val
1.901ValAla: 1.901 ± 0.548
1.086ValCys: 1.086 ± 0.388
2.444ValAsp: 2.444 ± 1.621
1.63ValGlu: 1.63 ± 0.508
1.358ValPhe: 1.358 ± 0.738
1.63ValGly: 1.63 ± 0.831
0.543ValHis: 0.543 ± 0.318
4.617ValIle: 4.617 ± 1.125
3.802ValLys: 3.802 ± 0.892
5.703ValLeu: 5.703 ± 1.192
0.272ValMet: 0.272 ± 0.159
5.432ValAsn: 5.432 ± 1.18
2.173ValPro: 2.173 ± 0.569
1.086ValGln: 1.086 ± 0.61
3.259ValArg: 3.259 ± 1.327
3.802ValSer: 3.802 ± 0.893
2.173ValThr: 2.173 ± 0.747
1.63ValVal: 1.63 ± 0.388
0.543ValTrp: 0.543 ± 0.34
2.173ValTyr: 2.173 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.272TrpAla: 0.272 ± 0.375
0.0TrpCys: 0.0 ± 0.0
0.543TrpAsp: 0.543 ± 0.662
1.358TrpGlu: 1.358 ± 0.488
0.543TrpPhe: 0.543 ± 0.305
1.358TrpGly: 1.358 ± 0.595
0.543TrpHis: 0.543 ± 0.305
2.173TrpIle: 2.173 ± 0.891
1.63TrpLys: 1.63 ± 0.388
2.444TrpLeu: 2.444 ± 1.009
0.0TrpMet: 0.0 ± 0.0
0.543TrpAsn: 0.543 ± 0.305
0.815TrpPro: 0.815 ± 0.365
0.272TrpGln: 0.272 ± 0.159
0.543TrpArg: 0.543 ± 0.305
0.543TrpSer: 0.543 ± 0.305
1.358TrpThr: 1.358 ± 0.621
1.63TrpVal: 1.63 ± 0.838
0.0TrpTrp: 0.0 ± 0.0
1.358TrpTyr: 1.358 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.272TyrAla: 0.272 ± 0.375
1.358TyrCys: 1.358 ± 0.569
1.63TyrAsp: 1.63 ± 0.703
3.802TyrGlu: 3.802 ± 1.602
2.716TyrPhe: 2.716 ± 0.499
2.988TyrGly: 2.988 ± 0.812
0.543TyrHis: 0.543 ± 0.318
2.716TyrIle: 2.716 ± 0.79
3.531TyrLys: 3.531 ± 0.956
4.074TyrLeu: 4.074 ± 1.12
1.086TyrMet: 1.086 ± 0.637
1.63TyrAsn: 1.63 ± 0.727
2.444TyrPro: 2.444 ± 0.477
1.358TyrGln: 1.358 ± 0.478
2.173TyrArg: 2.173 ± 0.936
4.074TyrSer: 4.074 ± 1.909
1.358TyrThr: 1.358 ± 0.842
1.358TyrVal: 1.358 ± 0.685
0.272TyrTrp: 0.272 ± 0.375
1.358TyrTyr: 1.358 ± 1.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3683 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski