Amino acid dipepetide frequency for Firmicutes bacterium CAG:65

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.757AlaAla: 7.757 ± 0.135
1.124AlaCys: 1.124 ± 0.043
4.892AlaAsp: 4.892 ± 0.088
6.075AlaGlu: 6.075 ± 0.107
2.964AlaPhe: 2.964 ± 0.06
6.412AlaGly: 6.412 ± 0.093
1.086AlaHis: 1.086 ± 0.043
4.959AlaIle: 4.959 ± 0.094
4.569AlaLys: 4.569 ± 0.087
7.137AlaLeu: 7.137 ± 0.103
2.551AlaMet: 2.551 ± 0.059
2.456AlaAsn: 2.456 ± 0.058
2.203AlaPro: 2.203 ± 0.053
2.766AlaGln: 2.766 ± 0.068
2.927AlaArg: 2.927 ± 0.068
3.927AlaSer: 3.927 ± 0.072
3.692AlaThr: 3.692 ± 0.077
6.573AlaVal: 6.573 ± 0.101
0.618AlaTrp: 0.618 ± 0.032
2.962AlaTyr: 2.962 ± 0.067
0.003AlaXaa: 0.003 ± 0.002
Cys
1.048CysAla: 1.048 ± 0.038
0.281CysCys: 0.281 ± 0.021
0.827CysAsp: 0.827 ± 0.029
0.964CysGlu: 0.964 ± 0.031
0.655CysPhe: 0.655 ± 0.029
1.629CysGly: 1.629 ± 0.051
0.346CysHis: 0.346 ± 0.021
1.088CysIle: 1.088 ± 0.036
0.761CysLys: 0.761 ± 0.03
1.246CysLeu: 1.246 ± 0.044
0.511CysMet: 0.511 ± 0.028
0.591CysAsn: 0.591 ± 0.028
0.593CysPro: 0.593 ± 0.031
0.37CysGln: 0.37 ± 0.021
0.711CysArg: 0.711 ± 0.03
0.788CysSer: 0.788 ± 0.032
0.678CysThr: 0.678 ± 0.029
1.138CysVal: 1.138 ± 0.038
0.114CysTrp: 0.114 ± 0.013
0.64CysTyr: 0.64 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
4.594AspAla: 4.594 ± 0.077
0.791AspCys: 0.791 ± 0.033
2.789AspAsp: 2.789 ± 0.074
4.086AspGlu: 4.086 ± 0.077
2.55AspPhe: 2.55 ± 0.057
4.348AspGly: 4.348 ± 0.086
1.013AspHis: 1.013 ± 0.041
4.219AspIle: 4.219 ± 0.084
3.411AspLys: 3.411 ± 0.07
4.608AspLeu: 4.608 ± 0.073
1.916AspMet: 1.916 ± 0.047
2.224AspAsn: 2.224 ± 0.057
1.969AspPro: 1.969 ± 0.051
1.43AspGln: 1.43 ± 0.047
2.677AspArg: 2.677 ± 0.074
3.091AspSer: 3.091 ± 0.063
3.315AspThr: 3.315 ± 0.065
3.796AspVal: 3.796 ± 0.068
0.546AspTrp: 0.546 ± 0.03
2.946AspTyr: 2.946 ± 0.075
0.001AspXaa: 0.001 ± 0.001
Glu
5.749GluAla: 5.749 ± 0.099
0.813GluCys: 0.813 ± 0.034
4.485GluAsp: 4.485 ± 0.076
7.751GluGlu: 7.751 ± 0.122
2.41GluPhe: 2.41 ± 0.055
4.62GluGly: 4.62 ± 0.071
1.459GluHis: 1.459 ± 0.044
5.332GluIle: 5.332 ± 0.091
6.286GluLys: 6.286 ± 0.118
6.897GluLeu: 6.897 ± 0.107
2.347GluMet: 2.347 ± 0.067
4.289GluAsn: 4.289 ± 0.087
1.862GluPro: 1.862 ± 0.053
3.43GluGln: 3.43 ± 0.075
3.192GluArg: 3.192 ± 0.083
3.303GluSer: 3.303 ± 0.061
3.982GluThr: 3.982 ± 0.079
4.497GluVal: 4.497 ± 0.083
0.664GluTrp: 0.664 ± 0.034
3.258GluTyr: 3.258 ± 0.073
0.0GluXaa: 0.0 ± 0.0
Phe
3.014PheAla: 3.014 ± 0.064
0.788PheCys: 0.788 ± 0.031
2.37PheAsp: 2.37 ± 0.051
2.286PheGlu: 2.286 ± 0.055
1.818PhePhe: 1.818 ± 0.058
2.931PheGly: 2.931 ± 0.06
0.94PheHis: 0.94 ± 0.034
2.496PheIle: 2.496 ± 0.057
1.627PheLys: 1.627 ± 0.052
4.244PheLeu: 4.244 ± 0.081
1.096PheMet: 1.096 ± 0.036
1.279PheAsn: 1.279 ± 0.032
1.451PhePro: 1.451 ± 0.036
1.276PheGln: 1.276 ± 0.038
1.909PheArg: 1.909 ± 0.05
2.608PheSer: 2.608 ± 0.066
2.452PheThr: 2.452 ± 0.065
2.821PheVal: 2.821 ± 0.065
0.455PheTrp: 0.455 ± 0.026
1.793PheTyr: 1.793 ± 0.047
0.001PheXaa: 0.001 ± 0.001
Gly
4.947GlyAla: 4.947 ± 0.095
1.299GlyCys: 1.299 ± 0.045
3.714GlyAsp: 3.714 ± 0.079
5.073GlyGlu: 5.073 ± 0.086
2.985GlyPhe: 2.985 ± 0.065
4.746GlyGly: 4.746 ± 0.097
1.239GlyHis: 1.239 ± 0.048
6.322GlyIle: 6.322 ± 0.091
5.187GlyLys: 5.187 ± 0.09
5.771GlyLeu: 5.771 ± 0.098
2.667GlyMet: 2.667 ± 0.074
3.188GlyAsn: 3.188 ± 0.066
1.237GlyPro: 1.237 ± 0.046
2.366GlyGln: 2.366 ± 0.063
3.312GlyArg: 3.312 ± 0.061
4.162GlySer: 4.162 ± 0.081
4.267GlyThr: 4.267 ± 0.082
5.165GlyVal: 5.165 ± 0.081
0.733GlyTrp: 0.733 ± 0.033
3.325GlyTyr: 3.325 ± 0.07
0.0GlyXaa: 0.0 ± 0.0
His
1.172HisAla: 1.172 ± 0.036
0.292HisCys: 0.292 ± 0.02
0.91HisAsp: 0.91 ± 0.041
0.961HisGlu: 0.961 ± 0.033
0.875HisPhe: 0.875 ± 0.035
1.289HisGly: 1.289 ± 0.044
0.414HisHis: 0.414 ± 0.034
1.332HisIle: 1.332 ± 0.042
1.064HisLys: 1.064 ± 0.039
1.603HisLeu: 1.603 ± 0.047
0.595HisMet: 0.595 ± 0.029
0.753HisAsn: 0.753 ± 0.033
0.987HisPro: 0.987 ± 0.035
0.47HisGln: 0.47 ± 0.028
0.86HisArg: 0.86 ± 0.03
0.995HisSer: 0.995 ± 0.036
0.985HisThr: 0.985 ± 0.03
1.144HisVal: 1.144 ± 0.037
0.192HisTrp: 0.192 ± 0.016
0.827HisTyr: 0.827 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.818IleAla: 5.818 ± 0.096
1.426IleCys: 1.426 ± 0.045
3.85IleAsp: 3.85 ± 0.076
4.152IleGlu: 4.152 ± 0.092
2.804IlePhe: 2.804 ± 0.061
4.992IleGly: 4.992 ± 0.083
1.39IleHis: 1.39 ± 0.042
4.584IleIle: 4.584 ± 0.093
3.38IleLys: 3.38 ± 0.076
7.343IleLeu: 7.343 ± 0.118
1.94IleMet: 1.94 ± 0.05
2.6IleAsn: 2.6 ± 0.052
3.175IlePro: 3.175 ± 0.067
2.268IleGln: 2.268 ± 0.051
3.929IleArg: 3.929 ± 0.083
4.46IleSer: 4.46 ± 0.079
4.393IleThr: 4.393 ± 0.089
5.023IleVal: 5.023 ± 0.08
0.664IleTrp: 0.664 ± 0.033
2.871IleTyr: 2.871 ± 0.058
0.001IleXaa: 0.001 ± 0.001
Lys
4.741LysAla: 4.741 ± 0.09
0.734LysCys: 0.734 ± 0.035
3.472LysAsp: 3.472 ± 0.071
5.84LysGlu: 5.84 ± 0.101
1.764LysPhe: 1.764 ± 0.052
3.748LysGly: 3.748 ± 0.073
0.951LysHis: 0.951 ± 0.04
4.514LysIle: 4.514 ± 0.085
5.301LysLys: 5.301 ± 0.093
5.127LysLeu: 5.127 ± 0.078
2.103LysMet: 2.103 ± 0.052
3.247LysAsn: 3.247 ± 0.068
1.748LysPro: 1.748 ± 0.044
2.162LysGln: 2.162 ± 0.059
2.687LysArg: 2.687 ± 0.062
2.956LysSer: 2.956 ± 0.066
3.447LysThr: 3.447 ± 0.073
4.304LysVal: 4.304 ± 0.075
0.593LysTrp: 0.593 ± 0.031
2.799LysTyr: 2.799 ± 0.061
0.0LysXaa: 0.0 ± 0.0
Leu
7.021LeuAla: 7.021 ± 0.108
1.692LeuCys: 1.692 ± 0.048
5.054LeuAsp: 5.054 ± 0.087
6.441LeuGlu: 6.441 ± 0.091
3.865LeuPhe: 3.865 ± 0.085
6.137LeuGly: 6.137 ± 0.097
1.734LeuHis: 1.734 ± 0.05
5.832LeuIle: 5.832 ± 0.105
5.529LeuLys: 5.529 ± 0.086
9.563LeuLeu: 9.563 ± 0.202
2.545LeuMet: 2.545 ± 0.061
3.69LeuAsn: 3.69 ± 0.061
3.737LeuPro: 3.737 ± 0.077
3.71LeuGln: 3.71 ± 0.072
3.892LeuArg: 3.892 ± 0.076
5.867LeuSer: 5.867 ± 0.101
5.54LeuThr: 5.54 ± 0.079
5.659LeuVal: 5.659 ± 0.095
0.94LeuTrp: 0.94 ± 0.035
3.449LeuTyr: 3.449 ± 0.073
0.001LeuXaa: 0.001 ± 0.001
Met
2.492MetAla: 2.492 ± 0.056
0.342MetCys: 0.342 ± 0.021
1.915MetAsp: 1.915 ± 0.051
2.606MetGlu: 2.606 ± 0.059
1.002MetPhe: 1.002 ± 0.039
2.184MetGly: 2.184 ± 0.057
0.487MetHis: 0.487 ± 0.024
2.279MetIle: 2.279 ± 0.052
2.356MetLys: 2.356 ± 0.053
2.704MetLeu: 2.704 ± 0.06
0.935MetMet: 0.935 ± 0.036
1.57MetAsn: 1.57 ± 0.046
1.148MetPro: 1.148 ± 0.04
1.318MetGln: 1.318 ± 0.036
1.271MetArg: 1.271 ± 0.043
1.789MetSer: 1.789 ± 0.048
1.935MetThr: 1.935 ± 0.051
1.952MetVal: 1.952 ± 0.051
0.194MetTrp: 0.194 ± 0.016
1.03MetTyr: 1.03 ± 0.035
0.0MetXaa: 0.0 ± 0.0
Asn
3.317AsnAla: 3.317 ± 0.076
0.627AsnCys: 0.627 ± 0.033
2.122AsnAsp: 2.122 ± 0.054
2.574AsnGlu: 2.574 ± 0.057
1.559AsnPhe: 1.559 ± 0.043
3.489AsnGly: 3.489 ± 0.072
0.767AsnHis: 0.767 ± 0.032
3.279AsnIle: 3.279 ± 0.069
2.237AsnLys: 2.237 ± 0.059
3.635AsnLeu: 3.635 ± 0.074
1.451AsnMet: 1.451 ± 0.043
1.823AsnAsn: 1.823 ± 0.057
1.811AsnPro: 1.811 ± 0.053
1.237AsnGln: 1.237 ± 0.046
2.066AsnArg: 2.066 ± 0.056
2.335AsnSer: 2.335 ± 0.06
2.53AsnThr: 2.53 ± 0.063
3.11AsnVal: 3.11 ± 0.063
0.39AsnTrp: 0.39 ± 0.024
1.831AsnTyr: 1.831 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.726ProAla: 2.726 ± 0.06
0.424ProCys: 0.424 ± 0.024
2.148ProAsp: 2.148 ± 0.061
3.702ProGlu: 3.702 ± 0.077
1.53ProPhe: 1.53 ± 0.048
2.5ProGly: 2.5 ± 0.059
0.573ProHis: 0.573 ± 0.027
2.031ProIle: 2.031 ± 0.054
1.786ProLys: 1.786 ± 0.052
2.786ProLeu: 2.786 ± 0.061
0.944ProMet: 0.944 ± 0.035
1.132ProAsn: 1.132 ± 0.037
0.658ProPro: 0.658 ± 0.026
1.075ProGln: 1.075 ± 0.042
1.001ProArg: 1.001 ± 0.035
1.622ProSer: 1.622 ± 0.042
1.629ProThr: 1.629 ± 0.057
3.079ProVal: 3.079 ± 0.075
0.37ProTrp: 0.37 ± 0.026
1.501ProTyr: 1.501 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
2.573GlnAla: 2.573 ± 0.064
0.392GlnCys: 0.392 ± 0.023
1.739GlnAsp: 1.739 ± 0.048
3.015GlnGlu: 3.015 ± 0.068
1.226GlnPhe: 1.226 ± 0.04
2.244GlnGly: 2.244 ± 0.062
0.51GlnHis: 0.51 ± 0.024
2.759GlnIle: 2.759 ± 0.059
2.662GlnLys: 2.662 ± 0.057
3.048GlnLeu: 3.048 ± 0.065
1.253GlnMet: 1.253 ± 0.039
1.794GlnAsn: 1.794 ± 0.05
1.063GlnPro: 1.063 ± 0.04
1.357GlnGln: 1.357 ± 0.047
1.496GlnArg: 1.496 ± 0.05
1.652GlnSer: 1.652 ± 0.052
1.94GlnThr: 1.94 ± 0.054
2.283GlnVal: 2.283 ± 0.053
0.322GlnTrp: 0.322 ± 0.022
1.326GlnTyr: 1.326 ± 0.046
0.0GlnXaa: 0.0 ± 0.0
Arg
2.709ArgAla: 2.709 ± 0.055
0.553ArgCys: 0.553 ± 0.03
2.409ArgAsp: 2.409 ± 0.059
4.123ArgGlu: 4.123 ± 0.082
1.749ArgPhe: 1.749 ± 0.045
2.573ArgGly: 2.573 ± 0.067
0.758ArgHis: 0.758 ± 0.033
3.606ArgIle: 3.606 ± 0.065
3.337ArgLys: 3.337 ± 0.066
3.861ArgLeu: 3.861 ± 0.075
1.673ArgMet: 1.673 ± 0.04
2.016ArgAsn: 2.016 ± 0.052
1.368ArgPro: 1.368 ± 0.047
1.88ArgGln: 1.88 ± 0.052
2.436ArgArg: 2.436 ± 0.067
2.279ArgSer: 2.279 ± 0.045
2.313ArgThr: 2.313 ± 0.05
2.682ArgVal: 2.682 ± 0.059
0.321ArgTrp: 0.321 ± 0.02
1.885ArgTyr: 1.885 ± 0.053
0.001ArgXaa: 0.001 ± 0.001
Ser
4.249SerAla: 4.249 ± 0.077
0.816SerCys: 0.816 ± 0.035
3.046SerAsp: 3.046 ± 0.068
3.727SerGlu: 3.727 ± 0.074
2.553SerPhe: 2.553 ± 0.055
4.776SerGly: 4.776 ± 0.087
0.98SerHis: 0.98 ± 0.035
3.712SerIle: 3.712 ± 0.07
2.9SerLys: 2.9 ± 0.063
5.155SerLeu: 5.155 ± 0.095
1.83SerMet: 1.83 ± 0.048
2.113SerAsn: 2.113 ± 0.054
1.638SerPro: 1.638 ± 0.05
1.699SerGln: 1.699 ± 0.044
2.685SerArg: 2.685 ± 0.064
3.139SerSer: 3.139 ± 0.08
2.924SerThr: 2.924 ± 0.064
4.244SerVal: 4.244 ± 0.075
0.472SerTrp: 0.472 ± 0.027
2.419SerTyr: 2.419 ± 0.064
0.001SerXaa: 0.001 ± 0.001
Thr
4.896ThrAla: 4.896 ± 0.088
0.625ThrCys: 0.625 ± 0.027
3.43ThrAsp: 3.43 ± 0.068
4.492ThrGlu: 4.492 ± 0.087
2.231ThrPhe: 2.231 ± 0.06
4.936ThrGly: 4.936 ± 0.092
0.897ThrHis: 0.897 ± 0.03
3.92ThrIle: 3.92 ± 0.074
2.917ThrLys: 2.917 ± 0.065
5.12ThrLeu: 5.12 ± 0.076
1.504ThrMet: 1.504 ± 0.041
2.039ThrAsn: 2.039 ± 0.053
2.237ThrPro: 2.237 ± 0.061
1.764ThrGln: 1.764 ± 0.05
1.915ThrArg: 1.915 ± 0.048
2.917ThrSer: 2.917 ± 0.064
3.149ThrThr: 3.149 ± 0.076
4.816ThrVal: 4.816 ± 0.097
0.479ThrTrp: 0.479 ± 0.027
2.257ThrTyr: 2.257 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.322ValAla: 5.322 ± 0.088
1.148ValCys: 1.148 ± 0.037
4.019ValAsp: 4.019 ± 0.074
4.952ValGlu: 4.952 ± 0.091
2.821ValPhe: 2.821 ± 0.072
4.47ValGly: 4.47 ± 0.086
1.09ValHis: 1.09 ± 0.034
5.335ValIle: 5.335 ± 0.075
4.216ValLys: 4.216 ± 0.07
6.972ValLeu: 6.972 ± 0.103
2.134ValMet: 2.134 ± 0.053
2.979ValAsn: 2.979 ± 0.061
2.581ValPro: 2.581 ± 0.058
2.11ValGln: 2.11 ± 0.057
2.905ValArg: 2.905 ± 0.063
4.371ValSer: 4.371 ± 0.079
4.689ValThr: 4.689 ± 0.093
5.098ValVal: 5.098 ± 0.089
0.716ValTrp: 0.716 ± 0.036
2.811ValTyr: 2.811 ± 0.058
0.0ValXaa: 0.0 ± 0.0
Trp
0.506TrpAla: 0.506 ± 0.025
0.157TrpCys: 0.157 ± 0.014
0.523TrpAsp: 0.523 ± 0.025
0.669TrpGlu: 0.669 ± 0.033
0.404TrpPhe: 0.404 ± 0.023
0.639TrpGly: 0.639 ± 0.035
0.179TrpHis: 0.179 ± 0.015
0.721TrpIle: 0.721 ± 0.031
0.757TrpLys: 0.757 ± 0.032
0.91TrpLeu: 0.91 ± 0.035
0.345TrpMet: 0.345 ± 0.021
0.531TrpAsn: 0.531 ± 0.031
0.228TrpPro: 0.228 ± 0.019
0.444TrpGln: 0.444 ± 0.025
0.351TrpArg: 0.351 ± 0.021
0.465TrpSer: 0.465 ± 0.025
0.404TrpThr: 0.404 ± 0.025
0.487TrpVal: 0.487 ± 0.028
0.13TrpTrp: 0.13 ± 0.012
0.411TrpTyr: 0.411 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.001TyrAla: 3.001 ± 0.067
0.654TyrCys: 0.654 ± 0.03
2.608TyrAsp: 2.608 ± 0.066
3.139TyrGlu: 3.139 ± 0.062
1.89TyrPhe: 1.89 ± 0.052
3.038TyrGly: 3.038 ± 0.068
0.906TyrHis: 0.906 ± 0.033
2.743TyrIle: 2.743 ± 0.062
1.96TyrLys: 1.96 ± 0.062
4.167TyrLeu: 4.167 ± 0.08
1.172TyrMet: 1.172 ± 0.042
1.872TyrAsn: 1.872 ± 0.048
1.519TyrPro: 1.519 ± 0.045
1.523TyrGln: 1.523 ± 0.042
2.311TyrArg: 2.311 ± 0.053
2.351TyrSer: 2.351 ± 0.059
2.276TyrThr: 2.276 ± 0.066
2.866TyrVal: 2.866 ± 0.066
0.347TyrTrp: 0.347 ± 0.023
2.071TyrTyr: 2.071 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.003XaaGly: 0.003 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.003XaaLeu: 0.003 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.009XaaXaa: 0.009 ± 0.004
Statistics based on 2512 proteins (798008 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski