Amino acid dipepetide frequency for Fusobacterium mortiferum ATCC 9817

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.392AlaAla: 3.392 ± 0.09
0.642AlaCys: 0.642 ± 0.033
2.49AlaAsp: 2.49 ± 0.065
3.662AlaGlu: 3.662 ± 0.077
2.523AlaPhe: 2.523 ± 0.063
4.301AlaGly: 4.301 ± 0.096
0.867AlaHis: 0.867 ± 0.03
5.438AlaIle: 5.438 ± 0.092
4.807AlaLys: 4.807 ± 0.096
5.462AlaLeu: 5.462 ± 0.098
1.717AlaMet: 1.717 ± 0.047
2.386AlaAsn: 2.386 ± 0.049
1.507AlaPro: 1.507 ± 0.042
1.617AlaGln: 1.617 ± 0.049
1.891AlaArg: 1.891 ± 0.046
2.785AlaSer: 2.785 ± 0.061
2.904AlaThr: 2.904 ± 0.064
3.716AlaVal: 3.716 ± 0.089
0.395AlaTrp: 0.395 ± 0.021
2.056AlaTyr: 2.056 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.483CysAla: 0.483 ± 0.028
0.125CysCys: 0.125 ± 0.013
0.502CysAsp: 0.502 ± 0.026
0.698CysGlu: 0.698 ± 0.036
0.372CysPhe: 0.372 ± 0.02
1.077CysGly: 1.077 ± 0.038
0.175CysHis: 0.175 ± 0.016
0.792CysIle: 0.792 ± 0.033
0.737CysLys: 0.737 ± 0.034
0.716CysLeu: 0.716 ± 0.032
0.209CysMet: 0.209 ± 0.016
0.474CysAsn: 0.474 ± 0.026
0.43CysPro: 0.43 ± 0.027
0.211CysGln: 0.211 ± 0.016
0.331CysArg: 0.331 ± 0.022
0.603CysSer: 0.603 ± 0.029
0.506CysThr: 0.506 ± 0.026
0.565CysVal: 0.565 ± 0.027
0.058CysTrp: 0.058 ± 0.008
0.343CysTyr: 0.343 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
2.126AspAla: 2.126 ± 0.061
0.437AspCys: 0.437 ± 0.023
2.25AspAsp: 2.25 ± 0.066
4.68AspGlu: 4.68 ± 0.083
2.809AspPhe: 2.809 ± 0.071
3.43AspGly: 3.43 ± 0.077
0.43AspHis: 0.43 ± 0.025
6.157AspIle: 6.157 ± 0.09
5.132AspLys: 5.132 ± 0.085
4.323AspLeu: 4.323 ± 0.085
1.56AspMet: 1.56 ± 0.04
2.785AspAsn: 2.785 ± 0.055
1.091AspPro: 1.091 ± 0.041
0.507AspGln: 0.507 ± 0.026
2.197AspArg: 2.197 ± 0.061
2.892AspSer: 2.892 ± 0.065
2.468AspThr: 2.468 ± 0.055
2.997AspVal: 2.997 ± 0.067
0.435AspTrp: 0.435 ± 0.022
2.565AspTyr: 2.565 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
4.205GluAla: 4.205 ± 0.081
0.591GluCys: 0.591 ± 0.029
4.084GluAsp: 4.084 ± 0.079
8.39GluGlu: 8.39 ± 0.161
3.5GluPhe: 3.5 ± 0.079
4.364GluGly: 4.364 ± 0.077
0.914GluHis: 0.914 ± 0.033
8.773GluIle: 8.773 ± 0.12
10.472GluLys: 10.472 ± 0.161
7.805GluLeu: 7.805 ± 0.119
2.147GluMet: 2.147 ± 0.051
6.194GluAsn: 6.194 ± 0.103
1.261GluPro: 1.261 ± 0.038
1.672GluGln: 1.672 ± 0.054
3.419GluArg: 3.419 ± 0.08
3.311GluSer: 3.311 ± 0.069
3.151GluThr: 3.151 ± 0.06
5.245GluVal: 5.245 ± 0.084
0.565GluTrp: 0.565 ± 0.028
3.799GluTyr: 3.799 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
2.303PheAla: 2.303 ± 0.057
0.422PheCys: 0.422 ± 0.024
2.58PheAsp: 2.58 ± 0.063
3.265PheGlu: 3.265 ± 0.075
2.562PhePhe: 2.562 ± 0.066
3.326PheGly: 3.326 ± 0.064
0.529PheHis: 0.529 ± 0.025
4.838PheIle: 4.838 ± 0.103
3.865PheLys: 3.865 ± 0.073
4.92PheLeu: 4.92 ± 0.093
1.238PheMet: 1.238 ± 0.045
2.536PheAsn: 2.536 ± 0.056
1.37PhePro: 1.37 ± 0.048
1.207PheGln: 1.207 ± 0.04
1.447PheArg: 1.447 ± 0.04
3.662PheSer: 3.662 ± 0.069
2.398PheThr: 2.398 ± 0.059
2.573PheVal: 2.573 ± 0.069
0.347PheTrp: 0.347 ± 0.021
2.112PheTyr: 2.112 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.522GlyAla: 4.522 ± 0.094
0.749GlyCys: 0.749 ± 0.032
3.338GlyAsp: 3.338 ± 0.071
5.2GlyGlu: 5.2 ± 0.096
3.001GlyPhe: 3.001 ± 0.076
4.601GlyGly: 4.601 ± 0.098
1.037GlyHis: 1.037 ± 0.036
7.341GlyIle: 7.341 ± 0.127
6.518GlyLys: 6.518 ± 0.114
5.542GlyLeu: 5.542 ± 0.105
1.991GlyMet: 1.991 ± 0.048
3.494GlyAsn: 3.494 ± 0.082
1.084GlyPro: 1.084 ± 0.04
1.358GlyGln: 1.358 ± 0.045
2.339GlyArg: 2.339 ± 0.061
3.323GlySer: 3.323 ± 0.074
3.558GlyThr: 3.558 ± 0.076
5.617GlyVal: 5.617 ± 0.103
0.488GlyTrp: 0.488 ± 0.028
3.195GlyTyr: 3.195 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
0.579HisAla: 0.579 ± 0.028
0.173HisCys: 0.173 ± 0.015
0.507HisAsp: 0.507 ± 0.028
0.677HisGlu: 0.677 ± 0.032
0.609HisPhe: 0.609 ± 0.03
0.931HisGly: 0.931 ± 0.039
0.298HisHis: 0.298 ± 0.019
1.247HisIle: 1.247 ± 0.045
0.915HisLys: 0.915 ± 0.039
1.165HisLeu: 1.165 ± 0.041
0.34HisMet: 0.34 ± 0.021
0.655HisAsn: 0.655 ± 0.032
0.606HisPro: 0.606 ± 0.027
0.347HisGln: 0.347 ± 0.023
0.512HisArg: 0.512 ± 0.025
0.844HisSer: 0.844 ± 0.032
0.684HisThr: 0.684 ± 0.032
0.607HisVal: 0.607 ± 0.031
0.098HisTrp: 0.098 ± 0.01
0.544HisTyr: 0.544 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.82IleAla: 5.82 ± 0.095
0.968IleCys: 0.968 ± 0.037
5.506IleAsp: 5.506 ± 0.086
8.063IleGlu: 8.063 ± 0.127
5.09IlePhe: 5.09 ± 0.109
6.514IleGly: 6.514 ± 0.113
1.03IleHis: 1.03 ± 0.039
8.682IleIle: 8.682 ± 0.13
8.477IleLys: 8.477 ± 0.119
9.869IleLeu: 9.869 ± 0.168
2.201IleMet: 2.201 ± 0.055
5.158IleAsn: 5.158 ± 0.091
3.459IlePro: 3.459 ± 0.07
2.017IleGln: 2.017 ± 0.046
3.02IleArg: 3.02 ± 0.062
6.274IleSer: 6.274 ± 0.088
4.929IleThr: 4.929 ± 0.082
6.162IleVal: 6.162 ± 0.091
0.588IleTrp: 0.588 ± 0.03
3.765IleTyr: 3.765 ± 0.075
0.0IleXaa: 0.0 ± 0.0
Lys
4.699LysAla: 4.699 ± 0.093
0.651LysCys: 0.651 ± 0.032
5.502LysAsp: 5.502 ± 0.107
10.019LysGlu: 10.019 ± 0.133
3.848LysPhe: 3.848 ± 0.071
5.203LysGly: 5.203 ± 0.09
0.9LysHis: 0.9 ± 0.034
9.631LysIle: 9.631 ± 0.134
9.774LysLys: 9.774 ± 0.124
8.137LysLeu: 8.137 ± 0.12
2.594LysMet: 2.594 ± 0.058
7.127LysAsn: 7.127 ± 0.117
1.845LysPro: 1.845 ± 0.051
1.737LysGln: 1.737 ± 0.051
3.453LysArg: 3.453 ± 0.077
4.372LysSer: 4.372 ± 0.079
3.962LysThr: 3.962 ± 0.069
6.078LysVal: 6.078 ± 0.093
0.55LysTrp: 0.55 ± 0.023
4.771LysTyr: 4.771 ± 0.08
0.0LysXaa: 0.0 ± 0.0
Leu
5.512LeuAla: 5.512 ± 0.111
0.813LeuCys: 0.813 ± 0.033
5.336LeuAsp: 5.336 ± 0.082
8.401LeuGlu: 8.401 ± 0.125
4.066LeuPhe: 4.066 ± 0.085
7.201LeuGly: 7.201 ± 0.117
1.028LeuHis: 1.028 ± 0.036
7.561LeuIle: 7.561 ± 0.114
9.752LeuLys: 9.752 ± 0.131
8.124LeuLeu: 8.124 ± 0.135
2.19LeuMet: 2.19 ± 0.054
5.286LeuAsn: 5.286 ± 0.09
2.954LeuPro: 2.954 ± 0.058
2.19LeuGln: 2.19 ± 0.054
3.118LeuArg: 3.118 ± 0.069
5.908LeuSer: 5.908 ± 0.086
4.721LeuThr: 4.721 ± 0.081
5.502LeuVal: 5.502 ± 0.099
0.541LeuTrp: 0.541 ± 0.028
3.296LeuTyr: 3.296 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
1.742MetAla: 1.742 ± 0.048
0.212MetCys: 0.212 ± 0.019
1.172MetAsp: 1.172 ± 0.041
2.243MetGlu: 2.243 ± 0.061
0.975MetPhe: 0.975 ± 0.041
2.018MetGly: 2.018 ± 0.048
0.25MetHis: 0.25 ± 0.017
2.185MetIle: 2.185 ± 0.051
2.901MetLys: 2.901 ± 0.067
2.339MetLeu: 2.339 ± 0.05
0.652MetMet: 0.652 ± 0.03
1.441MetAsn: 1.441 ± 0.042
0.759MetPro: 0.759 ± 0.033
0.485MetGln: 0.485 ± 0.025
1.032MetArg: 1.032 ± 0.04
1.465MetSer: 1.465 ± 0.051
1.266MetThr: 1.266 ± 0.038
1.599MetVal: 1.599 ± 0.048
0.16MetTrp: 0.16 ± 0.015
0.824MetTyr: 0.824 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
2.066AsnAla: 2.066 ± 0.062
0.598AsnCys: 0.598 ± 0.029
2.328AsnAsp: 2.328 ± 0.059
3.666AsnGlu: 3.666 ± 0.073
3.076AsnPhe: 3.076 ± 0.073
3.835AsnGly: 3.835 ± 0.082
0.683AsnHis: 0.683 ± 0.031
6.758AsnIle: 6.758 ± 0.107
5.101AsnLys: 5.101 ± 0.097
5.941AsnLeu: 5.941 ± 0.095
1.54AsnMet: 1.54 ± 0.047
3.563AsnAsn: 3.563 ± 0.091
2.309AsnPro: 2.309 ± 0.057
1.333AsnGln: 1.333 ± 0.05
2.318AsnArg: 2.318 ± 0.061
3.829AsnSer: 3.829 ± 0.079
2.467AsnThr: 2.467 ± 0.056
2.761AsnVal: 2.761 ± 0.071
0.422AsnTrp: 0.422 ± 0.023
2.818AsnTyr: 2.818 ± 0.076
0.0AsnXaa: 0.0 ± 0.0
Pro
1.663ProAla: 1.663 ± 0.05
0.295ProCys: 0.295 ± 0.02
1.303ProAsp: 1.303 ± 0.044
2.523ProGlu: 2.523 ± 0.059
1.48ProPhe: 1.48 ± 0.047
1.552ProGly: 1.552 ± 0.053
0.507ProHis: 0.507 ± 0.027
2.58ProIle: 2.58 ± 0.066
2.294ProLys: 2.294 ± 0.057
2.614ProLeu: 2.614 ± 0.056
0.756ProMet: 0.756 ± 0.033
1.614ProAsn: 1.614 ± 0.05
0.543ProPro: 0.543 ± 0.028
0.799ProGln: 0.799 ± 0.027
0.831ProArg: 0.831 ± 0.029
1.453ProSer: 1.453 ± 0.048
1.507ProThr: 1.507 ± 0.048
2.066ProVal: 2.066 ± 0.056
0.216ProTrp: 0.216 ± 0.017
1.19ProTyr: 1.19 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
1.378GlnAla: 1.378 ± 0.05
0.208GlnCys: 0.208 ± 0.019
1.071GlnAsp: 1.071 ± 0.035
2.137GlnGlu: 2.137 ± 0.061
0.813GlnPhe: 0.813 ± 0.034
1.614GlnGly: 1.614 ± 0.052
0.294GlnHis: 0.294 ± 0.018
2.004GlnIle: 2.004 ± 0.054
2.044GlnLys: 2.044 ± 0.055
2.103GlnLeu: 2.103 ± 0.053
0.577GlnMet: 0.577 ± 0.03
1.42GlnAsn: 1.42 ± 0.053
0.489GlnPro: 0.489 ± 0.024
0.503GlnGln: 0.503 ± 0.031
0.901GlnArg: 0.901 ± 0.036
1.063GlnSer: 1.063 ± 0.039
0.977GlnThr: 0.977 ± 0.035
1.387GlnVal: 1.387 ± 0.043
0.195GlnTrp: 0.195 ± 0.014
0.905GlnTyr: 0.905 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.111ArgAla: 2.111 ± 0.054
0.32ArgCys: 0.32 ± 0.025
2.104ArgAsp: 2.104 ± 0.05
4.0ArgGlu: 4.0 ± 0.083
1.548ArgPhe: 1.548 ± 0.042
2.341ArgGly: 2.341 ± 0.057
0.386ArgHis: 0.386 ± 0.022
3.082ArgIle: 3.082 ± 0.059
3.559ArgLys: 3.559 ± 0.077
3.003ArgLeu: 3.003 ± 0.059
0.973ArgMet: 0.973 ± 0.033
1.84ArgAsn: 1.84 ± 0.051
0.815ArgPro: 0.815 ± 0.032
0.746ArgGln: 0.746 ± 0.03
1.45ArgArg: 1.45 ± 0.047
1.27ArgSer: 1.27 ± 0.044
1.505ArgThr: 1.505 ± 0.043
2.872ArgVal: 2.872 ± 0.061
0.252ArgTrp: 0.252 ± 0.019
1.672ArgTyr: 1.672 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
2.93SerAla: 2.93 ± 0.056
0.591SerCys: 0.591 ± 0.027
2.592SerAsp: 2.592 ± 0.058
3.775SerGlu: 3.775 ± 0.072
3.284SerPhe: 3.284 ± 0.078
4.164SerGly: 4.164 ± 0.079
0.799SerHis: 0.799 ± 0.04
5.385SerIle: 5.385 ± 0.085
5.085SerLys: 5.085 ± 0.084
5.79SerLeu: 5.79 ± 0.098
1.337SerMet: 1.337 ± 0.038
3.006SerAsn: 3.006 ± 0.076
1.584SerPro: 1.584 ± 0.051
1.542SerGln: 1.542 ± 0.047
1.966SerArg: 1.966 ± 0.052
3.69SerSer: 3.69 ± 0.097
2.854SerThr: 2.854 ± 0.062
3.296SerVal: 3.296 ± 0.067
0.457SerTrp: 0.457 ± 0.026
2.398SerTyr: 2.398 ± 0.059
0.0SerXaa: 0.0 ± 0.0
Thr
2.861ThrAla: 2.861 ± 0.07
0.452ThrCys: 0.452 ± 0.025
2.336ThrAsp: 2.336 ± 0.053
3.161ThrGlu: 3.161 ± 0.065
2.286ThrPhe: 2.286 ± 0.056
3.765ThrGly: 3.765 ± 0.081
0.767ThrHis: 0.767 ± 0.029
4.467ThrIle: 4.467 ± 0.084
3.754ThrLys: 3.754 ± 0.063
5.1ThrLeu: 5.1 ± 0.084
1.109ThrMet: 1.109 ± 0.041
2.347ThrAsn: 2.347 ± 0.058
2.108ThrPro: 2.108 ± 0.052
1.21ThrGln: 1.21 ± 0.037
1.581ThrArg: 1.581 ± 0.043
2.853ThrSer: 2.853 ± 0.066
2.804ThrThr: 2.804 ± 0.067
3.046ThrVal: 3.046 ± 0.064
0.327ThrTrp: 0.327 ± 0.021
1.93ThrTyr: 1.93 ± 0.055
0.0ThrXaa: 0.0 ± 0.0
Val
4.048ValAla: 4.048 ± 0.081
0.646ValCys: 0.646 ± 0.028
3.552ValAsp: 3.552 ± 0.073
5.59ValGlu: 5.59 ± 0.094
2.85ValPhe: 2.85 ± 0.061
4.554ValGly: 4.554 ± 0.09
0.751ValHis: 0.751 ± 0.027
6.024ValIle: 6.024 ± 0.107
5.565ValLys: 5.565 ± 0.091
5.653ValLeu: 5.653 ± 0.104
1.534ValMet: 1.534 ± 0.043
3.002ValAsn: 3.002 ± 0.058
1.982ValPro: 1.982 ± 0.051
1.279ValGln: 1.279 ± 0.043
2.157ValArg: 2.157 ± 0.051
3.545ValSer: 3.545 ± 0.078
3.209ValThr: 3.209 ± 0.06
4.785ValVal: 4.785 ± 0.098
0.408ValTrp: 0.408 ± 0.023
2.273ValTyr: 2.273 ± 0.06
0.0ValXaa: 0.0 ± 0.0
Trp
0.356TrpAla: 0.356 ± 0.023
0.073TrpCys: 0.073 ± 0.009
0.399TrpAsp: 0.399 ± 0.024
0.625TrpGlu: 0.625 ± 0.03
0.309TrpPhe: 0.309 ± 0.02
0.533TrpGly: 0.533 ± 0.027
0.105TrpHis: 0.105 ± 0.013
0.603TrpIle: 0.603 ± 0.027
0.615TrpLys: 0.615 ± 0.029
0.683TrpLeu: 0.683 ± 0.031
0.145TrpMet: 0.145 ± 0.013
0.433TrpAsn: 0.433 ± 0.027
0.13TrpPro: 0.13 ± 0.015
0.185TrpGln: 0.185 ± 0.015
0.258TrpArg: 0.258 ± 0.019
0.366TrpSer: 0.366 ± 0.025
0.306TrpThr: 0.306 ± 0.021
0.383TrpVal: 0.383 ± 0.025
0.095TrpTrp: 0.095 ± 0.012
0.312TrpTyr: 0.312 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.76TyrAla: 1.76 ± 0.049
0.443TyrCys: 0.443 ± 0.025
2.319TyrAsp: 2.319 ± 0.065
3.088TyrGlu: 3.088 ± 0.076
2.497TyrPhe: 2.497 ± 0.062
2.972TyrGly: 2.972 ± 0.066
0.587TyrHis: 0.587 ± 0.027
4.042TyrIle: 4.042 ± 0.082
3.486TyrLys: 3.486 ± 0.079
4.211TyrLeu: 4.211 ± 0.084
0.91TyrMet: 0.91 ± 0.036
2.651TyrAsn: 2.651 ± 0.067
1.46TyrPro: 1.46 ± 0.051
1.139TyrGln: 1.139 ± 0.043
1.628TyrArg: 1.628 ± 0.055
3.019TyrSer: 3.019 ± 0.066
2.085TyrThr: 2.085 ± 0.054
2.153TyrVal: 2.153 ± 0.051
0.317TyrTrp: 0.317 ± 0.019
2.086TyrTyr: 2.086 ± 0.065
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2532 proteins (778824 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski