Amino acid dipepetide frequency for Xanthobacter tagetidis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.449AlaAla: 23.449 ± 0.233
1.327AlaCys: 1.327 ± 0.03
7.76AlaAsp: 7.76 ± 0.102
8.353AlaGlu: 8.353 ± 0.096
5.453AlaPhe: 5.453 ± 0.076
12.964AlaGly: 12.964 ± 0.123
2.642AlaHis: 2.642 ± 0.047
6.137AlaIle: 6.137 ± 0.068
4.067AlaLys: 4.067 ± 0.064
16.378AlaLeu: 16.378 ± 0.168
3.69AlaMet: 3.69 ± 0.057
2.554AlaAsn: 2.554 ± 0.047
8.117AlaPro: 8.117 ± 0.111
4.38AlaGln: 4.38 ± 0.059
11.456AlaArg: 11.456 ± 0.107
6.354AlaSer: 6.354 ± 0.077
6.074AlaThr: 6.074 ± 0.074
10.488AlaVal: 10.488 ± 0.103
1.551AlaTrp: 1.551 ± 0.038
2.603AlaTyr: 2.603 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
1.236CysAla: 1.236 ± 0.032
0.119CysCys: 0.119 ± 0.013
0.51CysAsp: 0.51 ± 0.018
0.422CysGlu: 0.422 ± 0.018
0.305CysPhe: 0.305 ± 0.014
1.042CysGly: 1.042 ± 0.032
0.226CysHis: 0.226 ± 0.013
0.323CysIle: 0.323 ± 0.015
0.145CysLys: 0.145 ± 0.01
0.823CysLeu: 0.823 ± 0.023
0.165CysMet: 0.165 ± 0.011
0.164CysAsn: 0.164 ± 0.011
0.514CysPro: 0.514 ± 0.028
0.208CysGln: 0.208 ± 0.014
0.661CysArg: 0.661 ± 0.024
0.385CysSer: 0.385 ± 0.02
0.418CysThr: 0.418 ± 0.02
0.671CysVal: 0.671 ± 0.021
0.091CysTrp: 0.091 ± 0.008
0.187CysTyr: 0.187 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.688AspAla: 7.688 ± 0.086
0.455AspCys: 0.455 ± 0.018
2.762AspAsp: 2.762 ± 0.049
2.961AspGlu: 2.961 ± 0.05
2.085AspPhe: 2.085 ± 0.043
5.273AspGly: 5.273 ± 0.072
1.153AspHis: 1.153 ± 0.034
2.671AspIle: 2.671 ± 0.049
1.416AspLys: 1.416 ± 0.036
6.176AspLeu: 6.176 ± 0.083
1.247AspMet: 1.247 ± 0.033
0.894AspAsn: 0.894 ± 0.028
3.586AspPro: 3.586 ± 0.048
1.41AspGln: 1.41 ± 0.032
3.943AspArg: 3.943 ± 0.06
1.516AspSer: 1.516 ± 0.031
2.336AspThr: 2.336 ± 0.05
4.116AspVal: 4.116 ± 0.055
0.776AspTrp: 0.776 ± 0.024
1.294AspTyr: 1.294 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
8.575GluAla: 8.575 ± 0.097
0.333GluCys: 0.333 ± 0.016
2.654GluAsp: 2.654 ± 0.053
3.108GluGlu: 3.108 ± 0.056
1.464GluPhe: 1.464 ± 0.037
4.58GluGly: 4.58 ± 0.06
1.039GluHis: 1.039 ± 0.025
3.084GluIle: 3.084 ± 0.059
2.046GluLys: 2.046 ± 0.047
4.67GluLeu: 4.67 ± 0.068
1.488GluMet: 1.488 ± 0.033
1.213GluAsn: 1.213 ± 0.03
2.719GluPro: 2.719 ± 0.044
1.637GluGln: 1.637 ± 0.033
4.937GluArg: 4.937 ± 0.072
1.972GluSer: 1.972 ± 0.038
2.975GluThr: 2.975 ± 0.051
4.288GluVal: 4.288 ± 0.063
0.571GluTrp: 0.571 ± 0.02
0.78GluTyr: 0.78 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
5.258PheAla: 5.258 ± 0.073
0.417PheCys: 0.417 ± 0.019
2.322PheAsp: 2.322 ± 0.04
2.017PheGlu: 2.017 ± 0.037
1.366PhePhe: 1.366 ± 0.035
3.871PheGly: 3.871 ± 0.06
0.719PheHis: 0.719 ± 0.024
1.5PheIle: 1.5 ± 0.034
0.936PheLys: 0.936 ± 0.028
3.552PheLeu: 3.552 ± 0.059
0.747PheMet: 0.747 ± 0.025
0.871PheAsn: 0.871 ± 0.026
1.794PhePro: 1.794 ± 0.038
0.957PheGln: 0.957 ± 0.026
2.275PheArg: 2.275 ± 0.049
2.039PheSer: 2.039 ± 0.042
1.943PheThr: 1.943 ± 0.041
3.04PheVal: 3.04 ± 0.046
0.511PheTrp: 0.511 ± 0.018
0.81PheTyr: 0.81 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
12.587GlyAla: 12.587 ± 0.129
0.891GlyCys: 0.891 ± 0.027
4.151GlyAsp: 4.151 ± 0.067
4.586GlyGlu: 4.586 ± 0.057
3.766GlyPhe: 3.766 ± 0.054
8.379GlyGly: 8.379 ± 0.262
1.912GlyHis: 1.912 ± 0.046
4.55GlyIle: 4.55 ± 0.06
2.876GlyLys: 2.876 ± 0.055
9.719GlyLeu: 9.719 ± 0.097
2.324GlyMet: 2.324 ± 0.03
1.757GlyAsn: 1.757 ± 0.05
4.487GlyPro: 4.487 ± 0.07
2.57GlyGln: 2.57 ± 0.045
6.946GlyArg: 6.946 ± 0.075
4.179GlySer: 4.179 ± 0.072
5.097GlyThr: 5.097 ± 0.095
6.62GlyVal: 6.62 ± 0.081
1.326GlyTrp: 1.326 ± 0.035
2.186GlyTyr: 2.186 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.423HisAla: 2.423 ± 0.043
0.208HisCys: 0.208 ± 0.012
1.076HisAsp: 1.076 ± 0.031
0.907HisGlu: 0.907 ± 0.026
0.781HisPhe: 0.781 ± 0.023
1.895HisGly: 1.895 ± 0.037
0.504HisHis: 0.504 ± 0.023
0.773HisIle: 0.773 ± 0.02
0.42HisLys: 0.42 ± 0.018
2.149HisLeu: 2.149 ± 0.043
0.509HisMet: 0.509 ± 0.017
0.36HisAsn: 0.36 ± 0.018
1.387HisPro: 1.387 ± 0.033
0.531HisGln: 0.531 ± 0.019
1.382HisArg: 1.382 ± 0.032
0.711HisSer: 0.711 ± 0.024
0.756HisThr: 0.756 ± 0.023
1.668HisVal: 1.668 ± 0.032
0.285HisTrp: 0.285 ± 0.014
0.462HisTyr: 0.462 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
7.552IleAla: 7.552 ± 0.075
0.478IleCys: 0.478 ± 0.019
2.866IleAsp: 2.866 ± 0.041
2.953IleGlu: 2.953 ± 0.045
1.431IlePhe: 1.431 ± 0.037
4.65IleGly: 4.65 ± 0.069
0.791IleHis: 0.791 ± 0.024
1.759IleIle: 1.759 ± 0.046
1.149IleLys: 1.149 ± 0.035
4.172IleLeu: 4.172 ± 0.063
0.816IleMet: 0.816 ± 0.025
1.05IleAsn: 1.05 ± 0.026
2.237IlePro: 2.237 ± 0.043
0.94IleGln: 0.94 ± 0.027
2.759IleArg: 2.759 ± 0.048
2.355IleSer: 2.355 ± 0.05
2.325IleThr: 2.325 ± 0.041
3.917IleVal: 3.917 ± 0.054
0.447IleTrp: 0.447 ± 0.016
1.004IleTyr: 1.004 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.464LysAla: 4.464 ± 0.068
0.143LysCys: 0.143 ± 0.009
1.631LysAsp: 1.631 ± 0.035
1.562LysGlu: 1.562 ± 0.038
0.75LysPhe: 0.75 ± 0.024
2.674LysGly: 2.674 ± 0.046
0.453LysHis: 0.453 ± 0.019
1.35LysIle: 1.35 ± 0.035
1.053LysLys: 1.053 ± 0.037
2.757LysLeu: 2.757 ± 0.055
0.661LysMet: 0.661 ± 0.023
0.664LysAsn: 0.664 ± 0.025
1.871LysPro: 1.871 ± 0.042
0.71LysGln: 0.71 ± 0.025
2.0LysArg: 2.0 ± 0.043
1.568LysSer: 1.568 ± 0.033
1.572LysThr: 1.572 ± 0.039
2.698LysVal: 2.698 ± 0.052
0.32LysTrp: 0.32 ± 0.015
0.538LysTyr: 0.538 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
16.823LeuAla: 16.823 ± 0.161
0.904LeuCys: 0.904 ± 0.024
6.164LeuAsp: 6.164 ± 0.081
5.079LeuGlu: 5.079 ± 0.063
3.522LeuPhe: 3.522 ± 0.059
9.169LeuGly: 9.169 ± 0.081
1.715LeuHis: 1.715 ± 0.039
4.372LeuIle: 4.372 ± 0.062
3.733LeuLys: 3.733 ± 0.059
9.739LeuLeu: 9.739 ± 0.126
2.397LeuMet: 2.397 ± 0.043
2.169LeuAsn: 2.169 ± 0.038
6.089LeuPro: 6.089 ± 0.073
2.154LeuGln: 2.154 ± 0.041
6.515LeuArg: 6.515 ± 0.072
6.044LeuSer: 6.044 ± 0.068
5.207LeuThr: 5.207 ± 0.068
8.747LeuVal: 8.747 ± 0.1
1.091LeuTrp: 1.091 ± 0.035
1.946LeuTyr: 1.946 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
3.329MetAla: 3.329 ± 0.052
0.163MetCys: 0.163 ± 0.012
1.138MetAsp: 1.138 ± 0.03
1.193MetGlu: 1.193 ± 0.033
0.709MetPhe: 0.709 ± 0.025
2.012MetGly: 2.012 ± 0.038
0.372MetHis: 0.372 ± 0.018
1.11MetIle: 1.11 ± 0.029
0.814MetLys: 0.814 ± 0.022
2.396MetLeu: 2.396 ± 0.04
0.556MetMet: 0.556 ± 0.018
0.594MetAsn: 0.594 ± 0.021
1.507MetPro: 1.507 ± 0.033
0.6MetGln: 0.6 ± 0.021
1.824MetArg: 1.824 ± 0.035
1.587MetSer: 1.587 ± 0.034
1.525MetThr: 1.525 ± 0.032
1.795MetVal: 1.795 ± 0.037
0.223MetTrp: 0.223 ± 0.013
0.28MetTyr: 0.28 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.808AsnAla: 2.808 ± 0.043
0.198AsnCys: 0.198 ± 0.014
1.052AsnAsp: 1.052 ± 0.032
0.907AsnGlu: 0.907 ± 0.026
0.793AsnPhe: 0.793 ± 0.023
1.937AsnGly: 1.937 ± 0.056
0.369AsnHis: 0.369 ± 0.016
1.008AsnIle: 1.008 ± 0.029
0.549AsnLys: 0.549 ± 0.022
2.14AsnLeu: 2.14 ± 0.045
0.516AsnMet: 0.516 ± 0.021
0.5AsnAsn: 0.5 ± 0.021
1.641AsnPro: 1.641 ± 0.036
0.505AsnGln: 0.505 ± 0.019
1.491AsnArg: 1.491 ± 0.034
0.864AsnSer: 0.864 ± 0.023
0.975AsnThr: 0.975 ± 0.029
1.79AsnVal: 1.79 ± 0.036
0.299AsnTrp: 0.299 ± 0.014
0.544AsnTyr: 0.544 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
8.551ProAla: 8.551 ± 0.113
0.337ProCys: 0.337 ± 0.016
3.708ProAsp: 3.708 ± 0.058
3.806ProGlu: 3.806 ± 0.063
2.301ProPhe: 2.301 ± 0.042
5.399ProGly: 5.399 ± 0.085
1.101ProHis: 1.101 ± 0.028
2.112ProIle: 2.112 ± 0.039
1.768ProLys: 1.768 ± 0.039
5.411ProLeu: 5.411 ± 0.067
1.265ProMet: 1.265 ± 0.032
1.24ProAsn: 1.24 ± 0.037
3.514ProPro: 3.514 ± 0.08
1.844ProGln: 1.844 ± 0.036
3.637ProArg: 3.637 ± 0.052
2.906ProSer: 2.906 ± 0.047
2.497ProThr: 2.497 ± 0.043
4.662ProVal: 4.662 ± 0.063
0.69ProTrp: 0.69 ± 0.023
1.227ProTyr: 1.227 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
3.792GlnAla: 3.792 ± 0.058
0.191GlnCys: 0.191 ± 0.014
1.301GlnAsp: 1.301 ± 0.031
1.332GlnGlu: 1.332 ± 0.034
0.909GlnPhe: 0.909 ± 0.026
2.296GlnGly: 2.296 ± 0.042
0.512GlnHis: 0.512 ± 0.022
1.448GlnIle: 1.448 ± 0.028
0.928GlnLys: 0.928 ± 0.03
2.379GlnLeu: 2.379 ± 0.044
0.819GlnMet: 0.819 ± 0.027
0.648GlnAsn: 0.648 ± 0.023
1.497GlnPro: 1.497 ± 0.036
1.003GlnGln: 1.003 ± 0.063
2.06GlnArg: 2.06 ± 0.049
1.407GlnSer: 1.407 ± 0.036
1.308GlnThr: 1.308 ± 0.033
2.408GlnVal: 2.408 ± 0.042
0.341GlnTrp: 0.341 ± 0.016
0.5GlnTyr: 0.5 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
10.031ArgAla: 10.031 ± 0.113
0.526ArgCys: 0.526 ± 0.018
3.869ArgAsp: 3.869 ± 0.059
3.997ArgGlu: 3.997 ± 0.064
2.914ArgPhe: 2.914 ± 0.045
5.206ArgGly: 5.206 ± 0.067
1.695ArgHis: 1.695 ± 0.037
3.834ArgIle: 3.834 ± 0.062
1.837ArgLys: 1.837 ± 0.038
8.634ArgLeu: 8.634 ± 0.092
1.848ArgMet: 1.848 ± 0.039
1.511ArgAsn: 1.511 ± 0.04
4.318ArgPro: 4.318 ± 0.082
2.236ArgGln: 2.236 ± 0.044
6.491ArgArg: 6.491 ± 0.083
3.243ArgSer: 3.243 ± 0.054
3.645ArgThr: 3.645 ± 0.049
4.988ArgVal: 4.988 ± 0.061
0.935ArgTrp: 0.935 ± 0.025
1.516ArgTyr: 1.516 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.348SerAla: 6.348 ± 0.084
0.417SerCys: 0.417 ± 0.02
2.393SerAsp: 2.393 ± 0.042
2.337SerGlu: 2.337 ± 0.043
2.043SerPhe: 2.043 ± 0.037
5.396SerGly: 5.396 ± 0.099
0.925SerHis: 0.925 ± 0.026
2.202SerIle: 2.202 ± 0.045
1.188SerLys: 1.188 ± 0.03
4.924SerLeu: 4.924 ± 0.063
1.094SerMet: 1.094 ± 0.029
1.008SerAsn: 1.008 ± 0.027
2.812SerPro: 2.812 ± 0.046
1.337SerGln: 1.337 ± 0.027
3.192SerArg: 3.192 ± 0.052
2.393SerSer: 2.393 ± 0.059
2.329SerThr: 2.329 ± 0.047
3.791SerVal: 3.791 ± 0.057
0.622SerTrp: 0.622 ± 0.023
1.151SerTyr: 1.151 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.136ThrAla: 6.136 ± 0.071
0.45ThrCys: 0.45 ± 0.02
2.395ThrAsp: 2.395 ± 0.043
2.246ThrGlu: 2.246 ± 0.042
2.079ThrPhe: 2.079 ± 0.042
4.992ThrGly: 4.992 ± 0.088
0.949ThrHis: 0.949 ± 0.024
2.376ThrIle: 2.376 ± 0.046
1.224ThrLys: 1.224 ± 0.034
5.848ThrLeu: 5.848 ± 0.079
1.014ThrMet: 1.014 ± 0.028
0.997ThrAsn: 0.997 ± 0.026
3.409ThrPro: 3.409 ± 0.049
1.219ThrGln: 1.219 ± 0.034
3.298ThrArg: 3.298 ± 0.049
2.482ThrSer: 2.482 ± 0.046
2.54ThrThr: 2.54 ± 0.053
4.168ThrVal: 4.168 ± 0.064
0.566ThrTrp: 0.566 ± 0.02
1.154ThrTyr: 1.154 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
10.987ValAla: 10.987 ± 0.095
0.722ValCys: 0.722 ± 0.024
4.104ValAsp: 4.104 ± 0.053
4.758ValGlu: 4.758 ± 0.067
2.956ValPhe: 2.956 ± 0.045
6.145ValGly: 6.145 ± 0.079
1.429ValHis: 1.429 ± 0.03
3.771ValIle: 3.771 ± 0.059
2.415ValLys: 2.415 ± 0.052
8.231ValLeu: 8.231 ± 0.089
1.871ValMet: 1.871 ± 0.035
1.852ValAsn: 1.852 ± 0.04
4.704ValPro: 4.704 ± 0.062
1.783ValGln: 1.783 ± 0.035
5.674ValArg: 5.674 ± 0.067
4.168ValSer: 4.168 ± 0.064
4.406ValThr: 4.406 ± 0.07
6.902ValVal: 6.902 ± 0.083
0.852ValTrp: 0.852 ± 0.028
1.457ValTyr: 1.457 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.195TrpAla: 1.195 ± 0.03
0.135TrpCys: 0.135 ± 0.008
0.573TrpAsp: 0.573 ± 0.021
0.519TrpGlu: 0.519 ± 0.017
0.465TrpPhe: 0.465 ± 0.018
0.912TrpGly: 0.912 ± 0.03
0.315TrpHis: 0.315 ± 0.014
0.566TrpIle: 0.566 ± 0.02
0.344TrpLys: 0.344 ± 0.014
1.365TrpLeu: 1.365 ± 0.034
0.299TrpMet: 0.299 ± 0.017
0.377TrpAsn: 0.377 ± 0.016
0.747TrpPro: 0.747 ± 0.024
0.451TrpGln: 0.451 ± 0.017
1.169TrpArg: 1.169 ± 0.032
0.713TrpSer: 0.713 ± 0.023
0.678TrpThr: 0.678 ± 0.019
0.748TrpVal: 0.748 ± 0.022
0.22TrpTrp: 0.22 ± 0.013
0.24TrpTyr: 0.24 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.568TyrAla: 2.568 ± 0.044
0.205TyrCys: 0.205 ± 0.012
1.327TyrAsp: 1.327 ± 0.031
1.089TyrGlu: 1.089 ± 0.025
0.852TyrPhe: 0.852 ± 0.027
2.117TyrGly: 2.117 ± 0.037
0.368TyrHis: 0.368 ± 0.016
0.706TyrIle: 0.706 ± 0.023
0.533TyrLys: 0.533 ± 0.022
2.118TyrLeu: 2.118 ± 0.045
0.401TyrMet: 0.401 ± 0.017
0.462TyrAsn: 0.462 ± 0.017
1.023TyrPro: 1.023 ± 0.025
0.597TyrGln: 0.597 ± 0.021
1.564TyrArg: 1.564 ± 0.036
1.017TyrSer: 1.017 ± 0.028
0.935TyrThr: 0.935 ± 0.026
1.683TyrVal: 1.683 ± 0.03
0.313TyrTrp: 0.313 ± 0.014
0.509TyrTyr: 0.509 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4442 proteins (1423151 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski