Amino acid dipepetide frequency for Beihai sesarmid crab virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.598AlaAla: 4.598 ± 2.145
1.533AlaCys: 1.533 ± 0.779
1.916AlaAsp: 1.916 ± 0.707
3.448AlaGlu: 3.448 ± 0.072
1.916AlaPhe: 1.916 ± 0.414
3.448AlaGly: 3.448 ± 0.633
2.682AlaHis: 2.682 ± 0.243
3.831AlaIle: 3.831 ± 1.414
4.598AlaLys: 4.598 ± 0.464
4.981AlaLeu: 4.981 ± 0.269
2.682AlaMet: 2.682 ± 0.804
2.299AlaAsn: 2.299 ± 1.072
7.28AlaPro: 7.28 ± 3.022
2.682AlaGln: 2.682 ± 0.317
3.448AlaArg: 3.448 ± 0.633
5.747AlaSer: 5.747 ± 3.802
4.981AlaThr: 4.981 ± 0.291
5.747AlaVal: 5.747 ± 1.802
1.533AlaTrp: 1.533 ± 0.341
1.533AlaTyr: 1.533 ± 0.341
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.149CysCys: 1.149 ± 0.024
0.766CysAsp: 0.766 ± 0.39
2.299CysGlu: 2.299 ± 1.169
0.766CysPhe: 0.766 ± 0.39
1.916CysGly: 1.916 ± 0.707
0.0CysHis: 0.0 ± 0.0
1.149CysIle: 1.149 ± 0.585
0.766CysLys: 0.766 ± 0.39
0.383CysLeu: 0.383 ± 0.195
0.383CysMet: 0.383 ± 0.195
1.149CysAsn: 1.149 ± 0.024
1.149CysPro: 1.149 ± 0.585
0.0CysGln: 0.0 ± 0.0
0.383CysArg: 0.383 ± 0.366
1.149CysSer: 1.149 ± 0.024
1.533CysThr: 1.533 ± 0.341
1.916CysVal: 1.916 ± 0.147
0.766CysTrp: 0.766 ± 0.171
0.383CysTyr: 0.383 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.072
1.533AspCys: 1.533 ± 0.219
8.046AspAsp: 8.046 ± 1.29
5.364AspGlu: 5.364 ± 1.607
5.747AspPhe: 5.747 ± 0.44
3.448AspGly: 3.448 ± 0.488
0.0AspHis: 0.0 ± 0.0
2.682AspIle: 2.682 ± 0.804
1.533AspLys: 1.533 ± 0.779
3.831AspLeu: 3.831 ± 0.267
1.916AspMet: 1.916 ± 0.414
2.299AspAsn: 2.299 ± 1.072
2.682AspPro: 2.682 ± 1.438
2.299AspGln: 2.299 ± 0.609
1.533AspArg: 1.533 ± 0.779
4.981AspSer: 4.981 ± 1.39
3.065AspThr: 3.065 ± 1.243
4.981AspVal: 4.981 ± 0.269
0.383AspTrp: 0.383 ± 0.195
3.448AspTyr: 3.448 ± 0.633
0.0AspXaa: 0.0 ± 0.0
Glu
3.065GluAla: 3.065 ± 0.438
1.149GluCys: 1.149 ± 0.585
2.299GluAsp: 2.299 ± 0.512
2.682GluGlu: 2.682 ± 0.878
1.149GluPhe: 1.149 ± 0.024
1.533GluGly: 1.533 ± 0.219
0.383GluHis: 0.383 ± 0.195
3.448GluIle: 3.448 ± 1.754
2.682GluLys: 2.682 ± 0.804
5.747GluLeu: 5.747 ± 1.0
4.215GluMet: 4.215 ± 1.492
2.682GluAsn: 2.682 ± 0.243
4.215GluPro: 4.215 ± 0.462
1.916GluGln: 1.916 ± 0.974
2.299GluArg: 2.299 ± 1.169
3.831GluSer: 3.831 ± 0.293
3.448GluThr: 3.448 ± 1.193
4.215GluVal: 4.215 ± 1.219
1.149GluTrp: 1.149 ± 0.585
1.533GluTyr: 1.533 ± 0.219
0.0GluXaa: 0.0 ± 0.0
Phe
3.448PheAla: 3.448 ± 1.048
1.149PheCys: 1.149 ± 0.024
3.065PheAsp: 3.065 ± 0.438
3.065PheGlu: 3.065 ± 0.683
3.065PhePhe: 3.065 ± 0.438
2.682PheGly: 2.682 ± 0.317
1.533PheHis: 1.533 ± 0.219
2.299PheIle: 2.299 ± 1.169
4.215PheLys: 4.215 ± 0.462
3.831PheLeu: 3.831 ± 0.293
0.383PheMet: 0.383 ± 0.195
3.065PheAsn: 3.065 ± 0.683
1.533PhePro: 1.533 ± 0.779
2.682PheGln: 2.682 ± 0.317
1.533PheArg: 1.533 ± 0.219
3.065PheSer: 3.065 ± 0.122
2.682PheThr: 2.682 ± 0.878
3.065PheVal: 3.065 ± 0.683
0.766PheTrp: 0.766 ± 0.171
1.916PheTyr: 1.916 ± 1.267
0.0PheXaa: 0.0 ± 0.0
Gly
4.598GlyAla: 4.598 ± 1.585
1.149GlyCys: 1.149 ± 0.024
2.299GlyAsp: 2.299 ± 1.072
2.299GlyGlu: 2.299 ± 0.048
3.448GlyPhe: 3.448 ± 0.633
2.682GlyGly: 2.682 ± 0.317
1.149GlyHis: 1.149 ± 0.585
2.299GlyIle: 2.299 ± 0.609
3.831GlyLys: 3.831 ± 0.828
2.299GlyLeu: 2.299 ± 0.609
1.149GlyMet: 1.149 ± 0.024
3.448GlyAsn: 3.448 ± 0.072
2.682GlyPro: 2.682 ± 1.438
0.383GlyGln: 0.383 ± 0.366
4.215GlyArg: 4.215 ± 0.462
5.364GlySer: 5.364 ± 0.486
4.981GlyThr: 4.981 ± 1.95
7.663GlyVal: 7.663 ± 1.655
1.533GlyTrp: 1.533 ± 0.341
2.682GlyTyr: 2.682 ± 0.878
0.0GlyXaa: 0.0 ± 0.0
His
2.682HisAla: 2.682 ± 0.243
0.766HisCys: 0.766 ± 0.39
1.149HisAsp: 1.149 ± 0.024
1.916HisGlu: 1.916 ± 0.147
1.149HisPhe: 1.149 ± 0.024
1.533HisGly: 1.533 ± 0.779
0.766HisHis: 0.766 ± 0.39
1.533HisIle: 1.533 ± 0.779
0.383HisLys: 0.383 ± 0.195
3.065HisLeu: 3.065 ± 0.683
0.0HisMet: 0.0 ± 0.0
0.766HisAsn: 0.766 ± 0.171
2.299HisPro: 2.299 ± 0.609
0.383HisGln: 0.383 ± 0.195
1.149HisArg: 1.149 ± 0.024
1.533HisSer: 1.533 ± 0.219
1.149HisThr: 1.149 ± 0.585
1.916HisVal: 1.916 ± 0.147
0.766HisTrp: 0.766 ± 0.39
0.383HisTyr: 0.383 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.598IleAla: 4.598 ± 1.024
0.766IleCys: 0.766 ± 0.731
5.747IleAsp: 5.747 ± 0.681
2.299IleGlu: 2.299 ± 0.048
1.916IlePhe: 1.916 ± 0.414
4.981IleGly: 4.981 ± 0.852
2.299IleHis: 2.299 ± 0.048
1.149IleIle: 1.149 ± 0.585
1.533IleLys: 1.533 ± 0.779
4.598IleLeu: 4.598 ± 1.217
1.149IleMet: 1.149 ± 0.585
2.682IleAsn: 2.682 ± 1.438
2.682IlePro: 2.682 ± 0.243
0.383IleGln: 0.383 ± 0.195
2.299IleArg: 2.299 ± 0.512
4.215IleSer: 4.215 ± 0.462
2.299IleThr: 2.299 ± 0.609
3.448IleVal: 3.448 ± 0.072
0.383IleTrp: 0.383 ± 0.195
0.766IleTyr: 0.766 ± 0.39
0.0IleXaa: 0.0 ± 0.0
Lys
4.981LysAla: 4.981 ± 0.852
0.383LysCys: 0.383 ± 0.195
3.448LysAsp: 3.448 ± 1.193
2.299LysGlu: 2.299 ± 0.609
1.916LysPhe: 1.916 ± 0.707
3.065LysGly: 3.065 ± 0.122
1.149LysHis: 1.149 ± 0.585
1.533LysIle: 1.533 ± 0.779
4.598LysLys: 4.598 ± 2.338
5.364LysLeu: 5.364 ± 0.486
1.533LysMet: 1.533 ± 0.219
1.533LysAsn: 1.533 ± 0.219
3.065LysPro: 3.065 ± 0.438
2.682LysGln: 2.682 ± 0.804
3.831LysArg: 3.831 ± 1.388
5.364LysSer: 5.364 ± 1.607
3.065LysThr: 3.065 ± 0.998
3.831LysVal: 3.831 ± 0.828
0.766LysTrp: 0.766 ± 0.39
3.831LysTyr: 3.831 ± 0.293
0.0LysXaa: 0.0 ± 0.0
Leu
6.513LeuAla: 6.513 ± 0.51
1.149LeuCys: 1.149 ± 0.024
4.598LeuAsp: 4.598 ± 0.657
2.682LeuGlu: 2.682 ± 0.804
2.299LeuPhe: 2.299 ± 1.633
2.682LeuGly: 2.682 ± 0.804
1.916LeuHis: 1.916 ± 0.147
3.448LeuIle: 3.448 ± 0.488
4.981LeuLys: 4.981 ± 1.412
6.13LeuLeu: 6.13 ± 0.876
1.916LeuMet: 1.916 ± 0.414
2.299LeuAsn: 2.299 ± 0.512
3.448LeuPro: 3.448 ± 0.488
1.533LeuGln: 1.533 ± 0.219
4.598LeuArg: 4.598 ± 0.464
4.981LeuSer: 4.981 ± 1.39
9.962LeuThr: 9.962 ± 2.779
6.513LeuVal: 6.513 ± 1.071
0.766LeuTrp: 0.766 ± 0.171
2.682LeuTyr: 2.682 ± 0.317
0.0LeuXaa: 0.0 ± 0.0
Met
1.916MetAla: 1.916 ± 0.147
0.0MetCys: 0.0 ± 0.0
2.682MetAsp: 2.682 ± 0.317
1.916MetGlu: 1.916 ± 0.974
1.916MetPhe: 1.916 ± 0.414
3.448MetGly: 3.448 ± 0.633
0.766MetHis: 0.766 ± 0.171
0.766MetIle: 0.766 ± 0.39
2.299MetLys: 2.299 ± 1.169
2.299MetLeu: 2.299 ± 0.609
1.916MetMet: 1.916 ± 0.414
1.916MetAsn: 1.916 ± 0.414
1.149MetPro: 1.149 ± 0.024
1.533MetGln: 1.533 ± 0.341
1.533MetArg: 1.533 ± 0.219
3.065MetSer: 3.065 ± 0.998
3.065MetThr: 3.065 ± 0.998
1.149MetVal: 1.149 ± 0.024
0.0MetTrp: 0.0 ± 0.0
0.766MetTyr: 0.766 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
1.533AsnAla: 1.533 ± 0.902
1.149AsnCys: 1.149 ± 0.585
1.533AsnAsp: 1.533 ± 0.902
3.065AsnGlu: 3.065 ± 0.122
1.916AsnPhe: 1.916 ± 0.707
4.598AsnGly: 4.598 ± 1.024
0.0AsnHis: 0.0 ± 0.0
2.299AsnIle: 2.299 ± 0.048
2.299AsnLys: 2.299 ± 0.048
2.299AsnLeu: 2.299 ± 0.048
2.682AsnMet: 2.682 ± 0.243
1.533AsnAsn: 1.533 ± 0.219
3.831AsnPro: 3.831 ± 1.414
1.149AsnGln: 1.149 ± 0.585
0.383AsnArg: 0.383 ± 0.366
5.747AsnSer: 5.747 ± 0.44
3.448AsnThr: 3.448 ± 0.633
3.448AsnVal: 3.448 ± 1.609
0.766AsnTrp: 0.766 ± 0.171
0.766AsnTyr: 0.766 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
3.831ProAla: 3.831 ± 1.974
1.149ProCys: 1.149 ± 0.536
4.215ProAsp: 4.215 ± 0.098
3.448ProGlu: 3.448 ± 0.072
3.448ProPhe: 3.448 ± 2.169
3.831ProGly: 3.831 ± 0.853
2.682ProHis: 2.682 ± 0.804
3.448ProIle: 3.448 ± 0.633
2.299ProLys: 2.299 ± 0.048
3.448ProLeu: 3.448 ± 0.488
0.766ProMet: 0.766 ± 0.38
3.831ProAsn: 3.831 ± 0.828
3.065ProPro: 3.065 ± 0.683
1.533ProGln: 1.533 ± 0.902
1.916ProArg: 1.916 ± 0.414
3.448ProSer: 3.448 ± 0.488
4.215ProThr: 4.215 ± 0.659
1.916ProVal: 1.916 ± 0.707
0.383ProTrp: 0.383 ± 0.366
3.065ProTyr: 3.065 ± 1.803
0.0ProXaa: 0.0 ± 0.0
Gln
4.981GlnAla: 4.981 ± 0.829
0.766GlnCys: 0.766 ± 0.171
1.916GlnAsp: 1.916 ± 0.147
2.299GlnGlu: 2.299 ± 0.048
1.533GlnPhe: 1.533 ± 0.219
0.766GlnGly: 0.766 ± 0.171
1.533GlnHis: 1.533 ± 0.341
1.533GlnIle: 1.533 ± 0.341
3.065GlnLys: 3.065 ± 0.998
0.766GlnLeu: 0.766 ± 0.39
1.533GlnMet: 1.533 ± 0.219
1.149GlnAsn: 1.149 ± 0.536
0.766GlnPro: 0.766 ± 0.171
1.533GlnGln: 1.533 ± 0.219
2.682GlnArg: 2.682 ± 0.804
1.149GlnSer: 1.149 ± 0.024
0.766GlnThr: 0.766 ± 0.171
0.383GlnVal: 0.383 ± 0.195
0.0GlnTrp: 0.0 ± 0.0
0.383GlnTyr: 0.383 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
2.682ArgAla: 2.682 ± 0.317
0.383ArgCys: 0.383 ± 0.195
2.299ArgAsp: 2.299 ± 1.169
2.682ArgGlu: 2.682 ± 0.804
4.598ArgPhe: 4.598 ± 0.464
1.916ArgGly: 1.916 ± 0.707
1.916ArgHis: 1.916 ± 0.414
2.682ArgIle: 2.682 ± 0.243
3.065ArgLys: 3.065 ± 1.559
4.981ArgLeu: 4.981 ± 0.269
1.916ArgMet: 1.916 ± 0.414
1.533ArgAsn: 1.533 ± 0.779
4.215ArgPro: 4.215 ± 1.219
1.149ArgGln: 1.149 ± 0.585
2.299ArgArg: 2.299 ± 1.169
4.981ArgSer: 4.981 ± 0.291
1.149ArgThr: 1.149 ± 0.024
2.682ArgVal: 2.682 ± 0.243
0.383ArgTrp: 0.383 ± 0.366
1.916ArgTyr: 1.916 ± 0.707
0.0ArgXaa: 0.0 ± 0.0
Ser
4.981SerAla: 4.981 ± 0.291
1.916SerCys: 1.916 ± 0.414
4.981SerAsp: 4.981 ± 0.269
2.299SerGlu: 2.299 ± 0.048
2.682SerPhe: 2.682 ± 1.438
3.831SerGly: 3.831 ± 0.828
1.533SerHis: 1.533 ± 0.219
6.13SerIle: 6.13 ± 0.805
5.364SerLys: 5.364 ± 0.074
6.897SerLeu: 6.897 ± 2.097
2.682SerMet: 2.682 ± 0.243
4.598SerAsn: 4.598 ± 0.464
4.981SerPro: 4.981 ± 0.269
2.299SerGln: 2.299 ± 0.512
3.448SerArg: 3.448 ± 1.048
4.598SerSer: 4.598 ± 1.778
4.981SerThr: 4.981 ± 0.829
5.747SerVal: 5.747 ± 0.44
1.533SerTrp: 1.533 ± 0.779
2.682SerTyr: 2.682 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
4.215ThrAla: 4.215 ± 0.098
0.0ThrCys: 0.0 ± 0.0
2.682ThrAsp: 2.682 ± 0.317
3.831ThrGlu: 3.831 ± 0.267
3.065ThrPhe: 3.065 ± 0.122
4.215ThrGly: 4.215 ± 0.462
2.299ThrHis: 2.299 ± 0.048
5.364ThrIle: 5.364 ± 1.755
4.215ThrLys: 4.215 ± 1.583
6.13ThrLeu: 6.13 ± 0.316
1.916ThrMet: 1.916 ± 0.414
2.682ThrAsn: 2.682 ± 0.878
4.215ThrPro: 4.215 ± 1.779
1.916ThrGln: 1.916 ± 0.147
3.448ThrArg: 3.448 ± 0.072
6.13ThrSer: 6.13 ± 0.245
2.299ThrThr: 2.299 ± 1.072
2.299ThrVal: 2.299 ± 0.512
0.383ThrTrp: 0.383 ± 0.366
1.916ThrTyr: 1.916 ± 0.147
0.0ThrXaa: 0.0 ± 0.0
Val
4.215ValAla: 4.215 ± 0.659
1.533ValCys: 1.533 ± 0.779
5.747ValAsp: 5.747 ± 0.121
3.448ValGlu: 3.448 ± 1.193
3.448ValPhe: 3.448 ± 0.633
5.364ValGly: 5.364 ± 0.634
2.299ValHis: 2.299 ± 0.609
2.299ValIle: 2.299 ± 0.048
3.831ValLys: 3.831 ± 0.293
4.598ValLeu: 4.598 ± 0.464
2.299ValMet: 2.299 ± 0.609
1.916ValAsn: 1.916 ± 0.707
1.916ValPro: 1.916 ± 1.267
2.682ValGln: 2.682 ± 0.317
4.215ValArg: 4.215 ± 1.023
4.598ValSer: 4.598 ± 1.024
3.831ValThr: 3.831 ± 0.267
6.897ValVal: 6.897 ± 0.145
1.149ValTrp: 1.149 ± 0.585
4.981ValTyr: 4.981 ± 0.829
0.0ValXaa: 0.0 ± 0.0
Trp
1.916TrpAla: 1.916 ± 0.414
0.0TrpCys: 0.0 ± 0.0
0.766TrpAsp: 0.766 ± 0.171
0.766TrpGlu: 0.766 ± 0.171
1.149TrpPhe: 1.149 ± 0.585
1.149TrpGly: 1.149 ± 0.024
0.0TrpHis: 0.0 ± 0.0
0.383TrpIle: 0.383 ± 0.195
2.299TrpLys: 2.299 ± 0.048
0.766TrpLeu: 0.766 ± 0.171
0.383TrpMet: 0.383 ± 0.195
0.766TrpAsn: 0.766 ± 0.171
0.0TrpPro: 0.0 ± 0.0
0.383TrpGln: 0.383 ± 0.366
1.916TrpArg: 1.916 ± 0.707
0.766TrpSer: 0.766 ± 0.39
0.383TrpThr: 0.383 ± 0.195
0.383TrpVal: 0.383 ± 0.195
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.682TyrAla: 2.682 ± 0.317
0.383TyrCys: 0.383 ± 0.366
3.065TyrAsp: 3.065 ± 0.122
1.533TyrGlu: 1.533 ± 0.779
1.916TyrPhe: 1.916 ± 0.974
3.065TyrGly: 3.065 ± 1.803
0.383TyrHis: 0.383 ± 0.195
1.916TyrIle: 1.916 ± 0.147
0.383TyrLys: 0.383 ± 0.195
2.682TyrLeu: 2.682 ± 0.878
2.299TyrMet: 2.299 ± 0.512
2.299TyrAsn: 2.299 ± 0.512
1.149TyrPro: 1.149 ± 0.585
0.383TyrGln: 0.383 ± 0.366
2.299TyrArg: 2.299 ± 1.072
3.448TyrSer: 3.448 ± 0.488
1.916TyrThr: 1.916 ± 0.147
3.065TyrVal: 3.065 ± 0.683
0.766TyrTrp: 0.766 ± 0.171
0.383TyrTyr: 0.383 ± 0.195
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski