Amino acid dipepetide frequency for Pseudoalteromonas amylolytica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.999AlaAla: 6.999 ± 0.09
1.05AlaCys: 1.05 ± 0.028
4.442AlaAsp: 4.442 ± 0.065
4.453AlaGlu: 4.453 ± 0.068
3.431AlaPhe: 3.431 ± 0.053
5.488AlaGly: 5.488 ± 0.073
2.002AlaHis: 2.002 ± 0.044
5.765AlaIle: 5.765 ± 0.077
5.417AlaLys: 5.417 ± 0.074
10.219AlaLeu: 10.219 ± 0.091
2.418AlaMet: 2.418 ± 0.042
3.911AlaAsn: 3.911 ± 0.058
2.928AlaPro: 2.928 ± 0.054
5.387AlaGln: 5.387 ± 0.071
3.505AlaArg: 3.505 ± 0.055
5.46AlaSer: 5.46 ± 0.065
4.461AlaThr: 4.461 ± 0.075
5.467AlaVal: 5.467 ± 0.067
0.921AlaTrp: 0.921 ± 0.028
2.58AlaTyr: 2.58 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.897CysAla: 0.897 ± 0.026
0.158CysCys: 0.158 ± 0.011
0.706CysAsp: 0.706 ± 0.024
0.716CysGlu: 0.716 ± 0.026
0.506CysPhe: 0.506 ± 0.02
0.824CysGly: 0.824 ± 0.027
0.348CysHis: 0.348 ± 0.016
0.665CysIle: 0.665 ± 0.022
0.523CysLys: 0.523 ± 0.021
1.03CysLeu: 1.03 ± 0.03
0.229CysMet: 0.229 ± 0.013
0.441CysAsn: 0.441 ± 0.017
0.41CysPro: 0.41 ± 0.02
0.569CysGln: 0.569 ± 0.025
0.454CysArg: 0.454 ± 0.019
0.725CysSer: 0.725 ± 0.023
0.562CysThr: 0.562 ± 0.02
0.73CysVal: 0.73 ± 0.025
0.133CysTrp: 0.133 ± 0.01
0.397CysTyr: 0.397 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.564AspAla: 4.564 ± 0.061
0.549AspCys: 0.549 ± 0.022
3.261AspAsp: 3.261 ± 0.059
4.078AspGlu: 4.078 ± 0.061
2.506AspPhe: 2.506 ± 0.049
3.79AspGly: 3.79 ± 0.076
1.056AspHis: 1.056 ± 0.03
4.122AspIle: 4.122 ± 0.058
3.419AspLys: 3.419 ± 0.055
5.118AspLeu: 5.118 ± 0.063
1.391AspMet: 1.391 ± 0.034
2.759AspAsn: 2.759 ± 0.051
1.885AspPro: 1.885 ± 0.039
1.741AspGln: 1.741 ± 0.034
1.832AspArg: 1.832 ± 0.038
3.493AspSer: 3.493 ± 0.065
2.895AspThr: 2.895 ± 0.063
4.061AspVal: 4.061 ± 0.054
0.819AspTrp: 0.819 ± 0.024
2.173AspTyr: 2.173 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
4.768GluAla: 4.768 ± 0.061
0.559GluCys: 0.559 ± 0.023
2.709GluAsp: 2.709 ± 0.052
3.082GluGlu: 3.082 ± 0.061
2.618GluPhe: 2.618 ± 0.047
3.184GluGly: 3.184 ± 0.052
1.991GluHis: 1.991 ± 0.044
3.457GluIle: 3.457 ± 0.05
3.423GluLys: 3.423 ± 0.061
7.355GluLeu: 7.355 ± 0.083
1.35GluMet: 1.35 ± 0.03
2.453GluAsn: 2.453 ± 0.045
2.001GluPro: 2.001 ± 0.049
5.408GluGln: 5.408 ± 0.074
3.188GluArg: 3.188 ± 0.053
3.433GluSer: 3.433 ± 0.053
2.58GluThr: 2.58 ± 0.04
4.29GluVal: 4.29 ± 0.064
0.534GluTrp: 0.534 ± 0.021
1.91GluTyr: 1.91 ± 0.036
0.0GluXaa: 0.0 ± 0.0
Phe
3.976PheAla: 3.976 ± 0.061
0.51PheCys: 0.51 ± 0.02
2.956PheAsp: 2.956 ± 0.05
2.741PheGlu: 2.741 ± 0.048
1.632PhePhe: 1.632 ± 0.036
2.917PheGly: 2.917 ± 0.05
0.788PheHis: 0.788 ± 0.025
2.774PheIle: 2.774 ± 0.045
2.221PheLys: 2.221 ± 0.04
3.215PheLeu: 3.215 ± 0.054
0.988PheMet: 0.988 ± 0.028
2.274PheAsn: 2.274 ± 0.036
1.272PhePro: 1.272 ± 0.031
1.128PheGln: 1.128 ± 0.029
1.39PheArg: 1.39 ± 0.027
3.573PheSer: 3.573 ± 0.058
2.385PheThr: 2.385 ± 0.048
2.901PheVal: 2.901 ± 0.052
0.497PheTrp: 0.497 ± 0.019
1.481PheTyr: 1.481 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
5.041GlyAla: 5.041 ± 0.073
0.837GlyCys: 0.837 ± 0.026
3.578GlyAsp: 3.578 ± 0.086
4.075GlyGlu: 4.075 ± 0.054
3.119GlyPhe: 3.119 ± 0.05
4.293GlyGly: 4.293 ± 0.087
1.58GlyHis: 1.58 ± 0.039
4.112GlyIle: 4.112 ± 0.063
3.707GlyLys: 3.707 ± 0.062
6.432GlyLeu: 6.432 ± 0.079
1.683GlyMet: 1.683 ± 0.038
2.632GlyAsn: 2.632 ± 0.069
1.597GlyPro: 1.597 ± 0.035
2.942GlyGln: 2.942 ± 0.048
2.681GlyArg: 2.681 ± 0.044
3.912GlySer: 3.912 ± 0.065
3.369GlyThr: 3.369 ± 0.076
4.957GlyVal: 4.957 ± 0.071
0.841GlyTrp: 0.841 ± 0.026
2.486GlyTyr: 2.486 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
1.773HisAla: 1.773 ± 0.039
0.37HisCys: 0.37 ± 0.019
1.223HisAsp: 1.223 ± 0.04
1.292HisGlu: 1.292 ± 0.034
1.194HisPhe: 1.194 ± 0.03
1.634HisGly: 1.634 ± 0.035
0.83HisHis: 0.83 ± 0.03
1.652HisIle: 1.652 ± 0.031
1.233HisLys: 1.233 ± 0.032
2.414HisLeu: 2.414 ± 0.041
0.52HisMet: 0.52 ± 0.016
1.121HisAsn: 1.121 ± 0.026
1.148HisPro: 1.148 ± 0.03
1.391HisGln: 1.391 ± 0.034
0.983HisArg: 0.983 ± 0.028
1.708HisSer: 1.708 ± 0.043
1.328HisThr: 1.328 ± 0.031
1.461HisVal: 1.461 ± 0.035
0.405HisTrp: 0.405 ± 0.017
0.989HisTyr: 0.989 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.114IleAla: 6.114 ± 0.065
0.698IleCys: 0.698 ± 0.02
4.169IleAsp: 4.169 ± 0.059
4.604IleGlu: 4.604 ± 0.061
2.138IlePhe: 2.138 ± 0.041
3.997IleGly: 3.997 ± 0.063
1.297IleHis: 1.297 ± 0.032
3.125IleIle: 3.125 ± 0.051
3.594IleLys: 3.594 ± 0.058
4.931IleLeu: 4.931 ± 0.076
1.204IleMet: 1.204 ± 0.034
3.271IleAsn: 3.271 ± 0.049
2.203IlePro: 2.203 ± 0.04
2.129IleGln: 2.129 ± 0.036
2.505IleArg: 2.505 ± 0.045
4.474IleSer: 4.474 ± 0.052
3.663IleThr: 3.663 ± 0.066
3.833IleVal: 3.833 ± 0.057
0.665IleTrp: 0.665 ± 0.025
1.837IleTyr: 1.837 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
5.009LysAla: 5.009 ± 0.07
0.368LysCys: 0.368 ± 0.019
2.749LysAsp: 2.749 ± 0.053
3.237LysGlu: 3.237 ± 0.059
1.656LysPhe: 1.656 ± 0.036
3.212LysGly: 3.212 ± 0.052
1.578LysHis: 1.578 ± 0.035
2.829LysIle: 2.829 ± 0.045
2.922LysLys: 2.922 ± 0.056
5.847LysLeu: 5.847 ± 0.067
1.228LysMet: 1.228 ± 0.031
2.207LysAsn: 2.207 ± 0.039
2.299LysPro: 2.299 ± 0.048
3.684LysGln: 3.684 ± 0.051
2.832LysArg: 2.832 ± 0.055
3.396LysSer: 3.396 ± 0.058
2.869LysThr: 2.869 ± 0.047
4.063LysVal: 4.063 ± 0.056
0.568LysTrp: 0.568 ± 0.023
1.536LysTyr: 1.536 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
10.01LeuAla: 10.01 ± 0.097
1.284LeuCys: 1.284 ± 0.03
6.009LeuAsp: 6.009 ± 0.076
6.285LeuGlu: 6.285 ± 0.075
4.251LeuPhe: 4.251 ± 0.069
6.535LeuGly: 6.535 ± 0.078
2.22LeuHis: 2.22 ± 0.043
5.845LeuIle: 5.845 ± 0.08
5.738LeuLys: 5.738 ± 0.076
11.009LeuLeu: 11.009 ± 0.131
2.438LeuMet: 2.438 ± 0.043
5.088LeuAsn: 5.088 ± 0.062
4.616LeuPro: 4.616 ± 0.059
4.377LeuGln: 4.377 ± 0.061
4.193LeuArg: 4.193 ± 0.062
8.651LeuSer: 8.651 ± 0.085
6.113LeuThr: 6.113 ± 0.066
6.852LeuVal: 6.852 ± 0.074
1.021LeuTrp: 1.021 ± 0.034
3.015LeuTyr: 3.015 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
2.299MetAla: 2.299 ± 0.044
0.217MetCys: 0.217 ± 0.014
1.081MetAsp: 1.081 ± 0.028
1.04MetGlu: 1.04 ± 0.028
0.874MetPhe: 0.874 ± 0.025
1.458MetGly: 1.458 ± 0.03
0.508MetHis: 0.508 ± 0.019
1.259MetIle: 1.259 ± 0.029
1.21MetLys: 1.21 ± 0.029
2.706MetLeu: 2.706 ± 0.046
0.646MetMet: 0.646 ± 0.024
0.991MetAsn: 0.991 ± 0.025
1.114MetPro: 1.114 ± 0.035
1.241MetGln: 1.241 ± 0.031
1.149MetArg: 1.149 ± 0.033
1.893MetSer: 1.893 ± 0.04
1.394MetThr: 1.394 ± 0.03
1.55MetVal: 1.55 ± 0.032
0.226MetTrp: 0.226 ± 0.014
0.577MetTyr: 0.577 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.816AsnAla: 3.816 ± 0.052
0.455AsnCys: 0.455 ± 0.018
2.572AsnAsp: 2.572 ± 0.062
2.743AsnGlu: 2.743 ± 0.048
1.632AsnPhe: 1.632 ± 0.029
3.036AsnGly: 3.036 ± 0.058
1.077AsnHis: 1.077 ± 0.03
2.889AsnIle: 2.889 ± 0.043
2.485AsnLys: 2.485 ± 0.049
4.161AsnLeu: 4.161 ± 0.05
1.067AsnMet: 1.067 ± 0.027
2.29AsnAsn: 2.29 ± 0.041
1.947AsnPro: 1.947 ± 0.037
2.244AsnGln: 2.244 ± 0.042
1.949AsnArg: 1.949 ± 0.038
2.868AsnSer: 2.868 ± 0.047
2.731AsnThr: 2.731 ± 0.049
2.82AsnVal: 2.82 ± 0.044
0.615AsnTrp: 0.615 ± 0.022
1.544AsnTyr: 1.544 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
2.792ProAla: 2.792 ± 0.053
0.351ProCys: 0.351 ± 0.016
2.318ProAsp: 2.318 ± 0.045
2.78ProGlu: 2.78 ± 0.052
1.671ProPhe: 1.671 ± 0.036
1.96ProGly: 1.96 ± 0.042
0.947ProHis: 0.947 ± 0.028
2.37ProIle: 2.37 ± 0.041
2.093ProLys: 2.093 ± 0.04
3.885ProLeu: 3.885 ± 0.063
0.891ProMet: 0.891 ± 0.025
1.798ProAsn: 1.798 ± 0.037
1.035ProPro: 1.035 ± 0.033
1.808ProGln: 1.808 ± 0.036
1.21ProArg: 1.21 ± 0.032
2.606ProSer: 2.606 ± 0.038
2.011ProThr: 2.011 ± 0.042
2.772ProVal: 2.772 ± 0.049
0.49ProTrp: 0.49 ± 0.021
1.334ProTyr: 1.334 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
5.063GlnAla: 5.063 ± 0.077
0.511GlnCys: 0.511 ± 0.019
2.301GlnAsp: 2.301 ± 0.039
2.455GlnGlu: 2.455 ± 0.046
2.083GlnPhe: 2.083 ± 0.039
3.481GlnGly: 3.481 ± 0.052
1.497GlnHis: 1.497 ± 0.039
2.775GlnIle: 2.775 ± 0.046
2.293GlnLys: 2.293 ± 0.042
6.503GlnLeu: 6.503 ± 0.082
1.075GlnMet: 1.075 ± 0.03
1.892GlnAsn: 1.892 ± 0.036
1.862GlnPro: 1.862 ± 0.047
4.196GlnGln: 4.196 ± 0.09
2.389GlnArg: 2.389 ± 0.043
3.557GlnSer: 3.557 ± 0.06
2.702GlnThr: 2.702 ± 0.051
3.687GlnVal: 3.687 ± 0.057
0.791GlnTrp: 0.791 ± 0.029
1.76GlnTyr: 1.76 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
3.457ArgAla: 3.457 ± 0.051
0.433ArgCys: 0.433 ± 0.018
2.247ArgAsp: 2.247 ± 0.041
2.538ArgGlu: 2.538 ± 0.044
2.233ArgPhe: 2.233 ± 0.045
2.357ArgGly: 2.357 ± 0.041
1.086ArgHis: 1.086 ± 0.028
2.74ArgIle: 2.74 ± 0.055
2.115ArgLys: 2.115 ± 0.045
4.733ArgLeu: 4.733 ± 0.064
1.013ArgMet: 1.013 ± 0.027
1.768ArgAsn: 1.768 ± 0.035
1.448ArgPro: 1.448 ± 0.03
1.994ArgGln: 1.994 ± 0.039
1.953ArgArg: 1.953 ± 0.038
2.458ArgSer: 2.458 ± 0.044
2.086ArgThr: 2.086 ± 0.04
3.086ArgVal: 3.086 ± 0.053
0.542ArgTrp: 0.542 ± 0.017
1.747ArgTyr: 1.747 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
5.947SerAla: 5.947 ± 0.065
0.734SerCys: 0.734 ± 0.021
4.026SerAsp: 4.026 ± 0.065
4.442SerGlu: 4.442 ± 0.062
2.963SerPhe: 2.963 ± 0.049
4.901SerGly: 4.901 ± 0.07
1.74SerHis: 1.74 ± 0.04
4.055SerIle: 4.055 ± 0.063
3.603SerLys: 3.603 ± 0.055
7.08SerLeu: 7.08 ± 0.087
1.518SerMet: 1.518 ± 0.035
3.016SerAsn: 3.016 ± 0.048
2.31SerPro: 2.31 ± 0.04
3.553SerGln: 3.553 ± 0.055
2.625SerArg: 2.625 ± 0.042
4.693SerSer: 4.693 ± 0.077
3.467SerThr: 3.467 ± 0.064
4.744SerVal: 4.744 ± 0.066
0.826SerTrp: 0.826 ± 0.027
2.429SerTyr: 2.429 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
4.267ThrAla: 4.267 ± 0.076
0.55ThrCys: 0.55 ± 0.02
2.805ThrAsp: 2.805 ± 0.059
2.96ThrGlu: 2.96 ± 0.05
2.157ThrPhe: 2.157 ± 0.041
3.768ThrGly: 3.768 ± 0.066
1.485ThrHis: 1.485 ± 0.036
3.206ThrIle: 3.206 ± 0.064
2.417ThrLys: 2.417 ± 0.047
6.694ThrLeu: 6.694 ± 0.078
1.069ThrMet: 1.069 ± 0.029
2.135ThrAsn: 2.135 ± 0.041
2.7ThrPro: 2.7 ± 0.044
3.059ThrGln: 3.059 ± 0.044
2.111ThrArg: 2.111 ± 0.039
3.595ThrSer: 3.595 ± 0.066
2.956ThrThr: 2.956 ± 0.059
3.67ThrVal: 3.67 ± 0.068
0.598ThrTrp: 0.598 ± 0.021
1.563ThrTyr: 1.563 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
6.293ValAla: 6.293 ± 0.077
0.804ValCys: 0.804 ± 0.024
4.044ValAsp: 4.044 ± 0.059
4.496ValGlu: 4.496 ± 0.056
2.825ValPhe: 2.825 ± 0.046
4.245ValGly: 4.245 ± 0.059
1.324ValHis: 1.324 ± 0.032
4.391ValIle: 4.391 ± 0.063
3.763ValLys: 3.763 ± 0.062
6.885ValLeu: 6.885 ± 0.083
1.768ValMet: 1.768 ± 0.036
3.171ValAsn: 3.171 ± 0.052
2.488ValPro: 2.488 ± 0.05
2.522ValGln: 2.522 ± 0.04
2.721ValArg: 2.721 ± 0.051
5.133ValSer: 5.133 ± 0.057
4.105ValThr: 4.105 ± 0.069
5.075ValVal: 5.075 ± 0.066
0.723ValTrp: 0.723 ± 0.022
2.066ValTyr: 2.066 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.74TrpAla: 0.74 ± 0.022
0.142TrpCys: 0.142 ± 0.01
0.54TrpAsp: 0.54 ± 0.023
0.473TrpGlu: 0.473 ± 0.019
0.598TrpPhe: 0.598 ± 0.022
0.671TrpGly: 0.671 ± 0.028
0.449TrpHis: 0.449 ± 0.017
0.53TrpIle: 0.53 ± 0.021
0.358TrpLys: 0.358 ± 0.016
1.746TrpLeu: 1.746 ± 0.044
0.283TrpMet: 0.283 ± 0.015
0.414TrpAsn: 0.414 ± 0.019
0.472TrpPro: 0.472 ± 0.018
1.26TrpGln: 1.26 ± 0.034
0.67TrpArg: 0.67 ± 0.023
0.756TrpSer: 0.756 ± 0.029
0.429TrpThr: 0.429 ± 0.02
0.749TrpVal: 0.749 ± 0.025
0.188TrpTrp: 0.188 ± 0.012
0.434TrpTyr: 0.434 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.477TyrAla: 2.477 ± 0.041
0.442TyrCys: 0.442 ± 0.022
1.776TyrAsp: 1.776 ± 0.037
1.8TyrGlu: 1.8 ± 0.038
1.531TyrPhe: 1.531 ± 0.033
2.125TyrGly: 2.125 ± 0.041
0.865TyrHis: 0.865 ± 0.031
1.892TyrIle: 1.892 ± 0.037
1.554TyrLys: 1.554 ± 0.038
3.588TyrLeu: 3.588 ± 0.058
0.652TyrMet: 0.652 ± 0.025
1.309TyrAsn: 1.309 ± 0.032
1.387TyrPro: 1.387 ± 0.032
2.238TyrGln: 2.238 ± 0.04
1.696TyrArg: 1.696 ± 0.037
2.289TyrSer: 2.289 ± 0.043
1.661TyrThr: 1.661 ± 0.038
2.074TyrVal: 2.074 ± 0.04
0.501TyrTrp: 0.501 ± 0.018
1.175TyrTyr: 1.175 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4015 proteins (1409126 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski