Amino acid dipepetide frequency for Rubrobacter radiotolerans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.102AlaAla: 14.102 ± 0.172
0.85AlaCys: 0.85 ± 0.032
4.975AlaAsp: 4.975 ± 0.076
8.291AlaGlu: 8.291 ± 0.108
3.929AlaPhe: 3.929 ± 0.074
11.584AlaGly: 11.584 ± 0.123
1.672AlaHis: 1.672 ± 0.047
3.979AlaIle: 3.979 ± 0.085
2.733AlaLys: 2.733 ± 0.064
13.174AlaLeu: 13.174 ± 0.155
2.259AlaMet: 2.259 ± 0.044
2.2AlaAsn: 2.2 ± 0.05
4.729AlaPro: 4.729 ± 0.069
2.402AlaGln: 2.402 ± 0.066
9.367AlaArg: 9.367 ± 0.116
6.413AlaSer: 6.413 ± 0.086
5.248AlaThr: 5.248 ± 0.081
9.863AlaVal: 9.863 ± 0.115
1.093AlaTrp: 1.093 ± 0.034
2.489AlaTyr: 2.489 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.807CysAla: 0.807 ± 0.033
0.099CysCys: 0.099 ± 0.01
0.362CysAsp: 0.362 ± 0.02
0.534CysGlu: 0.534 ± 0.026
0.229CysPhe: 0.229 ± 0.015
0.951CysGly: 0.951 ± 0.032
0.148CysHis: 0.148 ± 0.011
0.235CysIle: 0.235 ± 0.016
0.182CysLys: 0.182 ± 0.015
0.624CysLeu: 0.624 ± 0.027
0.098CysMet: 0.098 ± 0.01
0.15CysAsn: 0.15 ± 0.011
0.424CysPro: 0.424 ± 0.025
0.13CysGln: 0.13 ± 0.012
0.544CysArg: 0.544 ± 0.025
0.409CysSer: 0.409 ± 0.02
0.312CysThr: 0.312 ± 0.018
0.632CysVal: 0.632 ± 0.025
0.068CysTrp: 0.068 ± 0.009
0.165CysTyr: 0.165 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
5.747AspAla: 5.747 ± 0.077
0.276AspCys: 0.276 ± 0.016
2.431AspAsp: 2.431 ± 0.06
4.35AspGlu: 4.35 ± 0.079
2.062AspPhe: 2.062 ± 0.041
5.191AspGly: 5.191 ± 0.081
0.921AspHis: 0.921 ± 0.032
1.766AspIle: 1.766 ± 0.048
0.929AspLys: 0.929 ± 0.033
6.723AspLeu: 6.723 ± 0.072
0.716AspMet: 0.716 ± 0.03
0.89AspAsn: 0.89 ± 0.03
3.488AspPro: 3.488 ± 0.066
0.878AspGln: 0.878 ± 0.037
4.348AspArg: 4.348 ± 0.072
2.287AspSer: 2.287 ± 0.049
2.15AspThr: 2.15 ± 0.052
4.321AspVal: 4.321 ± 0.071
0.65AspTrp: 0.65 ± 0.028
1.495AspTyr: 1.495 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
9.819GluAla: 9.819 ± 0.129
0.407GluCys: 0.407 ± 0.024
4.112GluAsp: 4.112 ± 0.065
7.598GluGlu: 7.598 ± 0.11
2.329GluPhe: 2.329 ± 0.049
7.2GluGly: 7.2 ± 0.092
1.457GluHis: 1.457 ± 0.042
3.469GluIle: 3.469 ± 0.068
2.819GluLys: 2.819 ± 0.066
6.983GluLeu: 6.983 ± 0.091
1.521GluMet: 1.521 ± 0.046
2.107GluAsn: 2.107 ± 0.051
3.974GluPro: 3.974 ± 0.07
1.963GluGln: 1.963 ± 0.054
9.004GluArg: 9.004 ± 0.122
4.368GluSer: 4.368 ± 0.073
3.943GluThr: 3.943 ± 0.075
7.485GluVal: 7.485 ± 0.098
0.86GluTrp: 0.86 ± 0.03
1.706GluTyr: 1.706 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
4.003PheAla: 4.003 ± 0.071
0.339PheCys: 0.339 ± 0.021
2.06PheAsp: 2.06 ± 0.053
2.655PheGlu: 2.655 ± 0.054
1.62PhePhe: 1.62 ± 0.048
4.195PheGly: 4.195 ± 0.077
0.587PheHis: 0.587 ± 0.024
1.294PheIle: 1.294 ± 0.04
0.713PheLys: 0.713 ± 0.027
3.603PheLeu: 3.603 ± 0.081
0.637PheMet: 0.637 ± 0.028
0.757PheAsn: 0.757 ± 0.032
1.626PhePro: 1.626 ± 0.04
0.716PheGln: 0.716 ± 0.03
2.453PheArg: 2.453 ± 0.062
2.313PheSer: 2.313 ± 0.052
1.85PheThr: 1.85 ± 0.052
3.312PheVal: 3.312 ± 0.071
0.511PheTrp: 0.511 ± 0.025
1.071PheTyr: 1.071 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
10.343GlyAla: 10.343 ± 0.107
0.789GlyCys: 0.789 ± 0.033
4.674GlyAsp: 4.674 ± 0.076
8.481GlyGlu: 8.481 ± 0.121
3.787GlyPhe: 3.787 ± 0.067
9.57GlyGly: 9.57 ± 0.14
1.575GlyHis: 1.575 ± 0.043
4.077GlyIle: 4.077 ± 0.071
2.767GlyLys: 2.767 ± 0.059
9.63GlyLeu: 9.63 ± 0.122
2.289GlyMet: 2.289 ± 0.046
1.952GlyAsn: 1.952 ± 0.053
4.077GlyPro: 4.077 ± 0.063
2.03GlyGln: 2.03 ± 0.064
7.991GlyArg: 7.991 ± 0.098
5.839GlySer: 5.839 ± 0.089
5.26GlyThr: 5.26 ± 0.076
8.187GlyVal: 8.187 ± 0.103
1.332GlyTrp: 1.332 ± 0.043
2.743GlyTyr: 2.743 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.824HisAla: 1.824 ± 0.049
0.156HisCys: 0.156 ± 0.011
0.964HisAsp: 0.964 ± 0.034
1.291HisGlu: 1.291 ± 0.038
0.607HisPhe: 0.607 ± 0.022
1.49HisGly: 1.49 ± 0.038
0.448HisHis: 0.448 ± 0.023
0.614HisIle: 0.614 ± 0.025
0.348HisLys: 0.348 ± 0.021
1.849HisLeu: 1.849 ± 0.051
0.254HisMet: 0.254 ± 0.016
0.373HisAsn: 0.373 ± 0.019
1.217HisPro: 1.217 ± 0.036
0.319HisGln: 0.319 ± 0.019
1.355HisArg: 1.355 ± 0.038
0.827HisSer: 0.827 ± 0.031
0.77HisThr: 0.77 ± 0.026
1.323HisVal: 1.323 ± 0.037
0.181HisTrp: 0.181 ± 0.014
0.513HisTyr: 0.513 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
3.948IleAla: 3.948 ± 0.077
0.295IleCys: 0.295 ± 0.019
1.801IleAsp: 1.801 ± 0.048
3.191IleGlu: 3.191 ± 0.061
1.392IlePhe: 1.392 ± 0.041
3.631IleGly: 3.631 ± 0.079
0.674IleHis: 0.674 ± 0.025
1.388IleIle: 1.388 ± 0.048
1.059IleLys: 1.059 ± 0.033
3.772IleLeu: 3.772 ± 0.065
0.602IleMet: 0.602 ± 0.025
0.914IleAsn: 0.914 ± 0.036
2.11IlePro: 2.11 ± 0.049
0.827IleGln: 0.827 ± 0.031
2.585IleArg: 2.585 ± 0.05
2.817IleSer: 2.817 ± 0.056
2.084IleThr: 2.084 ± 0.056
3.558IleVal: 3.558 ± 0.076
0.287IleTrp: 0.287 ± 0.018
0.84IleTyr: 0.84 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
2.479LysAla: 2.479 ± 0.061
0.113LysCys: 0.113 ± 0.011
1.554LysAsp: 1.554 ± 0.038
2.013LysGlu: 2.013 ± 0.046
0.624LysPhe: 0.624 ± 0.024
2.144LysGly: 2.144 ± 0.05
0.461LysHis: 0.461 ± 0.024
1.044LysIle: 1.044 ± 0.038
1.165LysLys: 1.165 ± 0.042
2.704LysLeu: 2.704 ± 0.052
0.577LysMet: 0.577 ± 0.028
0.873LysAsn: 0.873 ± 0.032
1.351LysPro: 1.351 ± 0.041
0.619LysGln: 0.619 ± 0.032
2.323LysArg: 2.323 ± 0.055
1.567LysSer: 1.567 ± 0.04
1.67LysThr: 1.67 ± 0.047
2.502LysVal: 2.502 ± 0.053
0.214LysTrp: 0.214 ± 0.014
0.567LysTyr: 0.567 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.727LeuAla: 13.727 ± 0.156
0.819LeuCys: 0.819 ± 0.03
6.033LeuAsp: 6.033 ± 0.095
6.942LeuGlu: 6.942 ± 0.085
3.821LeuPhe: 3.821 ± 0.079
10.141LeuGly: 10.141 ± 0.131
1.568LeuHis: 1.568 ± 0.043
3.726LeuIle: 3.726 ± 0.074
2.824LeuLys: 2.824 ± 0.051
11.008LeuLeu: 11.008 ± 0.159
1.757LeuMet: 1.757 ± 0.041
1.919LeuAsn: 1.919 ± 0.041
5.635LeuPro: 5.635 ± 0.08
2.077LeuGln: 2.077 ± 0.046
8.148LeuArg: 8.148 ± 0.105
6.605LeuSer: 6.605 ± 0.081
4.874LeuThr: 4.874 ± 0.067
9.036LeuVal: 9.036 ± 0.111
1.132LeuTrp: 1.132 ± 0.035
2.622LeuTyr: 2.622 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
1.803MetAla: 1.803 ± 0.041
0.148MetCys: 0.148 ± 0.013
0.978MetAsp: 0.978 ± 0.035
1.36MetGlu: 1.36 ± 0.039
0.562MetPhe: 0.562 ± 0.025
1.524MetGly: 1.524 ± 0.045
0.347MetHis: 0.347 ± 0.02
0.898MetIle: 0.898 ± 0.029
0.784MetLys: 0.784 ± 0.03
1.932MetLeu: 1.932 ± 0.04
0.399MetMet: 0.399 ± 0.026
0.589MetAsn: 0.589 ± 0.025
0.986MetPro: 0.986 ± 0.033
0.591MetGln: 0.591 ± 0.028
1.637MetArg: 1.637 ± 0.046
1.504MetSer: 1.504 ± 0.043
1.168MetThr: 1.168 ± 0.039
1.393MetVal: 1.393 ± 0.039
0.139MetTrp: 0.139 ± 0.013
0.378MetTyr: 0.378 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.407AsnAla: 2.407 ± 0.05
0.191AsnCys: 0.191 ± 0.015
1.088AsnAsp: 1.088 ± 0.04
1.359AsnGlu: 1.359 ± 0.042
0.764AsnPhe: 0.764 ± 0.028
2.034AsnGly: 2.034 ± 0.05
0.387AsnHis: 0.387 ± 0.02
0.963AsnIle: 0.963 ± 0.028
0.429AsnLys: 0.429 ± 0.02
2.532AsnLeu: 2.532 ± 0.051
0.356AsnMet: 0.356 ± 0.02
0.546AsnAsn: 0.546 ± 0.028
1.656AsnPro: 1.656 ± 0.043
0.491AsnGln: 0.491 ± 0.022
1.62AsnArg: 1.62 ± 0.041
1.087AsnSer: 1.087 ± 0.036
1.175AsnThr: 1.175 ± 0.034
2.098AsnVal: 2.098 ± 0.046
0.289AsnTrp: 0.289 ± 0.018
0.586AsnTyr: 0.586 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
5.08ProAla: 5.08 ± 0.074
0.267ProCys: 0.267 ± 0.019
3.622ProAsp: 3.622 ± 0.074
6.572ProGlu: 6.572 ± 0.095
1.791ProPhe: 1.791 ± 0.04
5.24ProGly: 5.24 ± 0.08
0.865ProHis: 0.865 ± 0.032
1.462ProIle: 1.462 ± 0.045
1.462ProLys: 1.462 ± 0.042
4.721ProLeu: 4.721 ± 0.064
0.87ProMet: 0.87 ± 0.028
1.259ProAsn: 1.259 ± 0.037
2.634ProPro: 2.634 ± 0.062
1.273ProGln: 1.273 ± 0.036
2.903ProArg: 2.903 ± 0.053
2.475ProSer: 2.475 ± 0.053
2.39ProThr: 2.39 ± 0.048
4.67ProVal: 4.67 ± 0.075
0.565ProTrp: 0.565 ± 0.025
1.289ProTyr: 1.289 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
2.604GlnAla: 2.604 ± 0.085
0.099GlnCys: 0.099 ± 0.009
1.109GlnAsp: 1.109 ± 0.039
1.773GlnGlu: 1.773 ± 0.048
0.663GlnPhe: 0.663 ± 0.026
1.807GlnGly: 1.807 ± 0.041
0.321GlnHis: 0.321 ± 0.015
0.977GlnIle: 0.977 ± 0.036
0.803GlnLys: 0.803 ± 0.032
1.848GlnLeu: 1.848 ± 0.049
0.561GlnMet: 0.561 ± 0.026
0.71GlnAsn: 0.71 ± 0.032
0.931GlnPro: 0.931 ± 0.028
0.867GlnGln: 0.867 ± 0.055
1.849GlnArg: 1.849 ± 0.04
1.265GlnSer: 1.265 ± 0.035
1.29GlnThr: 1.29 ± 0.043
1.779GlnVal: 1.779 ± 0.048
0.225GlnTrp: 0.225 ± 0.016
0.495GlnTyr: 0.495 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
8.146ArgAla: 8.146 ± 0.118
0.464ArgCys: 0.464 ± 0.026
4.501ArgAsp: 4.501 ± 0.073
8.833ArgGlu: 8.833 ± 0.123
2.957ArgPhe: 2.957 ± 0.055
6.173ArgGly: 6.173 ± 0.101
1.363ArgHis: 1.363 ± 0.035
3.361ArgIle: 3.361 ± 0.063
2.422ArgLys: 2.422 ± 0.049
8.514ArgLeu: 8.514 ± 0.126
1.892ArgMet: 1.892 ± 0.044
1.762ArgAsn: 1.762 ± 0.042
3.885ArgPro: 3.885 ± 0.067
1.932ArgGln: 1.932 ± 0.045
7.985ArgArg: 7.985 ± 0.127
4.812ArgSer: 4.812 ± 0.077
3.988ArgThr: 3.988 ± 0.067
6.303ArgVal: 6.303 ± 0.078
1.023ArgTrp: 1.023 ± 0.038
2.28ArgTyr: 2.28 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
5.767SerAla: 5.767 ± 0.08
0.417SerCys: 0.417 ± 0.022
2.88SerAsp: 2.88 ± 0.057
4.706SerGlu: 4.706 ± 0.073
2.28SerPhe: 2.28 ± 0.048
7.631SerGly: 7.631 ± 0.106
0.897SerHis: 0.897 ± 0.034
1.936SerIle: 1.936 ± 0.047
1.214SerLys: 1.214 ± 0.038
5.764SerLeu: 5.764 ± 0.09
1.146SerMet: 1.146 ± 0.035
1.1SerAsn: 1.1 ± 0.031
3.38SerPro: 3.38 ± 0.062
1.246SerGln: 1.246 ± 0.04
4.594SerArg: 4.594 ± 0.072
3.095SerSer: 3.095 ± 0.078
2.415SerThr: 2.415 ± 0.057
5.093SerVal: 5.093 ± 0.083
0.684SerTrp: 0.684 ± 0.026
1.447SerTyr: 1.447 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
4.875ThrAla: 4.875 ± 0.077
0.3ThrCys: 0.3 ± 0.018
2.684ThrAsp: 2.684 ± 0.051
3.433ThrGlu: 3.433 ± 0.066
1.926ThrPhe: 1.926 ± 0.048
5.31ThrGly: 5.31 ± 0.076
0.909ThrHis: 0.909 ± 0.032
1.822ThrIle: 1.822 ± 0.051
1.16ThrLys: 1.16 ± 0.04
5.764ThrLeu: 5.764 ± 0.088
0.909ThrMet: 0.909 ± 0.028
1.156ThrAsn: 1.156 ± 0.035
2.802ThrPro: 2.802 ± 0.063
1.075ThrGln: 1.075 ± 0.029
2.973ThrArg: 2.973 ± 0.053
2.738ThrSer: 2.738 ± 0.052
2.604ThrThr: 2.604 ± 0.066
4.934ThrVal: 4.934 ± 0.079
0.468ThrTrp: 0.468 ± 0.023
1.259ThrTyr: 1.259 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
10.103ValAla: 10.103 ± 0.113
0.812ValCys: 0.812 ± 0.032
3.905ValAsp: 3.905 ± 0.059
7.26ValGlu: 7.26 ± 0.084
3.612ValPhe: 3.612 ± 0.073
8.109ValGly: 8.109 ± 0.11
1.474ValHis: 1.474 ± 0.045
3.382ValIle: 3.382 ± 0.066
1.941ValLys: 1.941 ± 0.049
9.236ValLeu: 9.236 ± 0.116
1.605ValMet: 1.605 ± 0.042
1.857ValAsn: 1.857 ± 0.05
4.685ValPro: 4.685 ± 0.071
1.734ValGln: 1.734 ± 0.043
7.333ValArg: 7.333 ± 0.091
5.135ValSer: 5.135 ± 0.071
3.903ValThr: 3.903 ± 0.068
8.132ValVal: 8.132 ± 0.124
0.967ValTrp: 0.967 ± 0.037
2.209ValTyr: 2.209 ± 0.048
0.0ValXaa: 0.0 ± 0.0
Trp
1.037TrpAla: 1.037 ± 0.034
0.083TrpCys: 0.083 ± 0.011
0.546TrpAsp: 0.546 ± 0.028
0.712TrpGlu: 0.712 ± 0.032
0.412TrpPhe: 0.412 ± 0.023
0.893TrpGly: 0.893 ± 0.029
0.211TrpHis: 0.211 ± 0.015
0.521TrpIle: 0.521 ± 0.022
0.306TrpLys: 0.306 ± 0.021
1.213TrpLeu: 1.213 ± 0.034
0.291TrpMet: 0.291 ± 0.017
0.325TrpAsn: 0.325 ± 0.019
0.528TrpPro: 0.528 ± 0.024
0.321TrpGln: 0.321 ± 0.02
1.104TrpArg: 1.104 ± 0.033
0.749TrpSer: 0.749 ± 0.033
0.678TrpThr: 0.678 ± 0.027
0.774TrpVal: 0.774 ± 0.032
0.203TrpTrp: 0.203 ± 0.015
0.292TrpTyr: 0.292 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.733TyrAla: 2.733 ± 0.048
0.178TyrCys: 0.178 ± 0.013
1.34TyrAsp: 1.34 ± 0.039
1.771TyrGlu: 1.771 ± 0.039
0.947TyrPhe: 0.947 ± 0.035
2.574TyrGly: 2.574 ± 0.053
0.439TyrHis: 0.439 ± 0.023
0.86TyrIle: 0.86 ± 0.034
0.504TyrLys: 0.504 ± 0.025
2.821TyrLeu: 2.821 ± 0.066
0.38TyrMet: 0.38 ± 0.02
0.62TyrAsn: 0.62 ± 0.033
1.289TyrPro: 1.289 ± 0.036
0.488TyrGln: 0.488 ± 0.022
2.48TyrArg: 2.48 ± 0.057
1.39TyrSer: 1.39 ± 0.036
1.329TyrThr: 1.329 ± 0.043
1.999TyrVal: 1.999 ± 0.039
0.305TyrTrp: 0.305 ± 0.018
0.624TyrTyr: 0.624 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3148 proteins (976227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski