Amino acid dipepetide frequency for Streptomyces sp. Amel2xC10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.723AlaAla: 21.723 ± 0.168
1.048AlaCys: 1.048 ± 0.024
8.589AlaAsp: 8.589 ± 0.081
8.89AlaGlu: 8.89 ± 0.078
3.501AlaPhe: 3.501 ± 0.044
13.206AlaGly: 13.206 ± 0.095
3.072AlaHis: 3.072 ± 0.039
3.162AlaIle: 3.162 ± 0.041
2.738AlaLys: 2.738 ± 0.044
15.016AlaLeu: 15.016 ± 0.116
2.305AlaMet: 2.305 ± 0.036
1.835AlaAsn: 1.835 ± 0.033
7.508AlaPro: 7.508 ± 0.08
3.58AlaGln: 3.58 ± 0.048
10.93AlaArg: 10.93 ± 0.086
5.815AlaSer: 5.815 ± 0.06
7.029AlaThr: 7.029 ± 0.062
12.771AlaVal: 12.771 ± 0.109
1.906AlaTrp: 1.906 ± 0.027
2.807AlaTyr: 2.807 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.092CysAla: 1.092 ± 0.024
0.089CysCys: 0.089 ± 0.006
0.458CysAsp: 0.458 ± 0.013
0.384CysGlu: 0.384 ± 0.014
0.204CysPhe: 0.204 ± 0.01
0.943CysGly: 0.943 ± 0.021
0.198CysHis: 0.198 ± 0.01
0.138CysIle: 0.138 ± 0.007
0.103CysLys: 0.103 ± 0.007
0.723CysLeu: 0.723 ± 0.02
0.114CysMet: 0.114 ± 0.006
0.123CysAsn: 0.123 ± 0.008
0.505CysPro: 0.505 ± 0.017
0.166CysGln: 0.166 ± 0.009
0.605CysArg: 0.605 ± 0.018
0.401CysSer: 0.401 ± 0.015
0.483CysThr: 0.483 ± 0.014
0.689CysVal: 0.689 ± 0.017
0.123CysTrp: 0.123 ± 0.006
0.143CysTyr: 0.143 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.935AspAla: 7.935 ± 0.067
0.41AspCys: 0.41 ± 0.013
3.978AspAsp: 3.978 ± 0.056
3.815AspGlu: 3.815 ± 0.048
1.626AspPhe: 1.626 ± 0.027
6.72AspGly: 6.72 ± 0.062
1.48AspHis: 1.48 ± 0.028
1.833AspIle: 1.833 ± 0.028
1.172AspLys: 1.172 ± 0.025
6.384AspLeu: 6.384 ± 0.06
0.779AspMet: 0.779 ± 0.017
0.939AspAsn: 0.939 ± 0.021
4.784AspPro: 4.784 ± 0.05
1.455AspGln: 1.455 ± 0.027
5.06AspArg: 5.06 ± 0.044
2.482AspSer: 2.482 ± 0.032
3.566AspThr: 3.566 ± 0.049
4.785AspVal: 4.785 ± 0.049
1.025AspTrp: 1.025 ± 0.022
1.08AspTyr: 1.08 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
7.411GluAla: 7.411 ± 0.072
0.365GluCys: 0.365 ± 0.013
2.944GluAsp: 2.944 ± 0.038
3.63GluGlu: 3.63 ± 0.056
1.478GluPhe: 1.478 ± 0.026
4.387GluGly: 4.387 ± 0.044
1.531GluHis: 1.531 ± 0.025
2.269GluIle: 2.269 ± 0.033
1.4GluLys: 1.4 ± 0.032
6.584GluLeu: 6.584 ± 0.056
0.843GluMet: 0.843 ± 0.017
1.039GluAsn: 1.039 ± 0.023
3.468GluPro: 3.468 ± 0.045
2.042GluGln: 2.042 ± 0.029
5.625GluArg: 5.625 ± 0.06
2.532GluSer: 2.532 ± 0.033
3.01GluThr: 3.01 ± 0.039
4.43GluVal: 4.43 ± 0.047
0.782GluTrp: 0.782 ± 0.021
1.119GluTyr: 1.119 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.625PheAla: 3.625 ± 0.047
0.252PheCys: 0.252 ± 0.01
1.901PheAsp: 1.901 ± 0.029
1.427PheGlu: 1.427 ± 0.024
0.856PhePhe: 0.856 ± 0.022
2.958PheGly: 2.958 ± 0.039
0.602PheHis: 0.602 ± 0.018
0.618PheIle: 0.618 ± 0.019
0.489PheLys: 0.489 ± 0.014
2.586PheLeu: 2.586 ± 0.035
0.392PheMet: 0.392 ± 0.015
0.533PheAsn: 0.533 ± 0.016
1.366PhePro: 1.366 ± 0.025
0.651PheGln: 0.651 ± 0.016
1.862PheArg: 1.862 ± 0.031
1.391PheSer: 1.391 ± 0.026
2.013PheThr: 2.013 ± 0.032
2.268PheVal: 2.268 ± 0.029
0.396PheTrp: 0.396 ± 0.013
0.56PheTyr: 0.56 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
11.298GlyAla: 11.298 ± 0.103
0.809GlyCys: 0.809 ± 0.018
5.337GlyAsp: 5.337 ± 0.05
5.119GlyGlu: 5.119 ± 0.048
2.824GlyPhe: 2.824 ± 0.039
9.092GlyGly: 9.092 ± 0.102
2.349GlyHis: 2.349 ± 0.032
3.281GlyIle: 3.281 ± 0.041
2.314GlyLys: 2.314 ± 0.038
9.494GlyLeu: 9.494 ± 0.073
1.889GlyMet: 1.889 ± 0.031
1.67GlyAsn: 1.67 ± 0.032
5.42GlyPro: 5.42 ± 0.063
2.523GlyGln: 2.523 ± 0.038
7.987GlyArg: 7.987 ± 0.065
5.248GlySer: 5.248 ± 0.062
6.736GlyThr: 6.736 ± 0.069
7.749GlyVal: 7.749 ± 0.064
1.707GlyTrp: 1.707 ± 0.028
2.202GlyTyr: 2.202 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
2.781HisAla: 2.781 ± 0.033
0.214HisCys: 0.214 ± 0.009
1.387HisAsp: 1.387 ± 0.026
1.191HisGlu: 1.191 ± 0.024
0.614HisPhe: 0.614 ± 0.017
2.452HisGly: 2.452 ± 0.037
0.705HisHis: 0.705 ± 0.019
0.664HisIle: 0.664 ± 0.02
0.351HisLys: 0.351 ± 0.012
2.47HisLeu: 2.47 ± 0.039
0.307HisMet: 0.307 ± 0.011
0.362HisAsn: 0.362 ± 0.014
1.853HisPro: 1.853 ± 0.033
0.613HisGln: 0.613 ± 0.019
2.175HisArg: 2.175 ± 0.031
0.979HisSer: 0.979 ± 0.023
1.445HisThr: 1.445 ± 0.028
1.677HisVal: 1.677 ± 0.027
0.368HisTrp: 0.368 ± 0.013
0.465HisTyr: 0.465 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.517IleAla: 4.517 ± 0.054
0.262IleCys: 0.262 ± 0.011
2.136IleAsp: 2.136 ± 0.032
1.873IleGlu: 1.873 ± 0.032
0.628IlePhe: 0.628 ± 0.02
3.412IleGly: 3.412 ± 0.041
0.577IleHis: 0.577 ± 0.017
0.742IleIle: 0.742 ± 0.021
0.697IleLys: 0.697 ± 0.023
2.311IleLeu: 2.311 ± 0.036
0.411IleMet: 0.411 ± 0.014
0.609IleAsn: 0.609 ± 0.019
1.693IlePro: 1.693 ± 0.03
0.656IleGln: 0.656 ± 0.018
2.2IleArg: 2.2 ± 0.032
1.494IleSer: 1.494 ± 0.026
2.109IleThr: 2.109 ± 0.033
2.699IleVal: 2.699 ± 0.037
0.351IleTrp: 0.351 ± 0.013
0.482IleTyr: 0.482 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
2.85LysAla: 2.85 ± 0.049
0.114LysCys: 0.114 ± 0.007
1.352LysAsp: 1.352 ± 0.03
1.171LysGlu: 1.171 ± 0.026
0.44LysPhe: 0.44 ± 0.016
1.81LysGly: 1.81 ± 0.037
0.406LysHis: 0.406 ± 0.014
0.83LysIle: 0.83 ± 0.023
0.851LysLys: 0.851 ± 0.027
1.94LysLeu: 1.94 ± 0.033
0.343LysMet: 0.343 ± 0.013
0.522LysAsn: 0.522 ± 0.017
1.248LysPro: 1.248 ± 0.026
0.637LysGln: 0.637 ± 0.018
1.358LysArg: 1.358 ± 0.028
1.125LysSer: 1.125 ± 0.027
1.248LysThr: 1.248 ± 0.027
1.834LysVal: 1.834 ± 0.035
0.261LysTrp: 0.261 ± 0.012
0.439LysTyr: 0.439 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
15.302LeuAla: 15.302 ± 0.112
0.835LeuCys: 0.835 ± 0.023
6.895LeuAsp: 6.895 ± 0.061
4.663LeuGlu: 4.663 ± 0.058
2.557LeuPhe: 2.557 ± 0.034
9.326LeuGly: 9.326 ± 0.069
2.308LeuHis: 2.308 ± 0.03
3.181LeuIle: 3.181 ± 0.042
2.027LeuLys: 2.027 ± 0.033
11.562LeuLeu: 11.562 ± 0.108
1.579LeuMet: 1.579 ± 0.029
1.603LeuAsn: 1.603 ± 0.027
6.635LeuPro: 6.635 ± 0.059
2.013LeuGln: 2.013 ± 0.033
8.813LeuArg: 8.813 ± 0.066
5.174LeuSer: 5.174 ± 0.047
7.263LeuThr: 7.263 ± 0.066
8.832LeuVal: 8.832 ± 0.073
1.317LeuTrp: 1.317 ± 0.031
1.904LeuTyr: 1.904 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.137MetAla: 2.137 ± 0.038
0.123MetCys: 0.123 ± 0.007
0.843MetAsp: 0.843 ± 0.019
0.704MetGlu: 0.704 ± 0.019
0.424MetPhe: 0.424 ± 0.014
1.229MetGly: 1.229 ± 0.026
0.338MetHis: 0.338 ± 0.011
0.613MetIle: 0.613 ± 0.016
0.387MetLys: 0.387 ± 0.014
1.504MetLeu: 1.504 ± 0.028
0.262MetMet: 0.262 ± 0.012
0.419MetAsn: 0.419 ± 0.014
1.048MetPro: 1.048 ± 0.022
0.399MetGln: 0.399 ± 0.015
1.377MetArg: 1.377 ± 0.024
1.276MetSer: 1.276 ± 0.025
1.55MetThr: 1.55 ± 0.027
1.187MetVal: 1.187 ± 0.027
0.209MetTrp: 0.209 ± 0.009
0.298MetTyr: 0.298 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.138AsnAla: 2.138 ± 0.031
0.167AsnCys: 0.167 ± 0.01
0.906AsnAsp: 0.906 ± 0.022
0.751AsnGlu: 0.751 ± 0.018
0.447AsnPhe: 0.447 ± 0.015
1.82AsnGly: 1.82 ± 0.035
0.363AsnHis: 0.363 ± 0.012
0.603AsnIle: 0.603 ± 0.018
0.393AsnLys: 0.393 ± 0.013
1.604AsnLeu: 1.604 ± 0.028
0.271AsnMet: 0.271 ± 0.011
0.411AsnAsn: 0.411 ± 0.016
1.305AsnPro: 1.305 ± 0.024
0.463AsnGln: 0.463 ± 0.015
1.216AsnArg: 1.216 ± 0.026
0.888AsnSer: 0.888 ± 0.019
1.071AsnThr: 1.071 ± 0.024
1.323AsnVal: 1.323 ± 0.022
0.276AsnTrp: 0.276 ± 0.012
0.382AsnTyr: 0.382 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
9.165ProAla: 9.165 ± 0.092
0.35ProCys: 0.35 ± 0.012
4.701ProAsp: 4.701 ± 0.056
4.514ProGlu: 4.514 ± 0.049
1.515ProPhe: 1.515 ± 0.026
6.827ProGly: 6.827 ± 0.075
1.428ProHis: 1.428 ± 0.027
1.159ProIle: 1.159 ± 0.026
1.139ProLys: 1.139 ± 0.021
5.489ProLeu: 5.489 ± 0.057
0.965ProMet: 0.965 ± 0.021
0.86ProAsn: 0.86 ± 0.02
3.804ProPro: 3.804 ± 0.074
1.635ProGln: 1.635 ± 0.039
4.25ProArg: 4.25 ± 0.047
3.234ProSer: 3.234 ± 0.054
3.323ProThr: 3.323 ± 0.041
5.705ProVal: 5.705 ± 0.06
0.923ProTrp: 0.923 ± 0.02
1.502ProTyr: 1.502 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.468GlnAla: 3.468 ± 0.044
0.156GlnCys: 0.156 ± 0.008
1.393GlnAsp: 1.393 ± 0.023
1.375GlnGlu: 1.375 ± 0.025
0.632GlnPhe: 0.632 ± 0.018
2.244GlnGly: 2.244 ± 0.031
0.581GlnHis: 0.581 ± 0.016
1.043GlnIle: 1.043 ± 0.021
0.593GlnLys: 0.593 ± 0.018
2.743GlnLeu: 2.743 ± 0.04
0.451GlnMet: 0.451 ± 0.012
0.471GlnAsn: 0.471 ± 0.015
1.549GlnPro: 1.549 ± 0.038
1.121GlnGln: 1.121 ± 0.034
2.245GlnArg: 2.245 ± 0.032
1.173GlnSer: 1.173 ± 0.025
1.291GlnThr: 1.291 ± 0.024
2.257GlnVal: 2.257 ± 0.031
0.429GlnTrp: 0.429 ± 0.015
0.565GlnTyr: 0.565 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
10.497ArgAla: 10.497 ± 0.086
0.594ArgCys: 0.594 ± 0.015
4.405ArgAsp: 4.405 ± 0.044
4.896ArgGlu: 4.896 ± 0.049
2.381ArgPhe: 2.381 ± 0.036
5.932ArgGly: 5.932 ± 0.053
2.19ArgHis: 2.19 ± 0.033
3.205ArgIle: 3.205 ± 0.041
1.524ArgLys: 1.524 ± 0.027
9.233ArgLeu: 9.233 ± 0.066
1.659ArgMet: 1.659 ± 0.028
1.27ArgAsn: 1.27 ± 0.024
5.387ArgPro: 5.387 ± 0.059
2.237ArgGln: 2.237 ± 0.032
8.196ArgArg: 8.196 ± 0.076
3.866ArgSer: 3.866 ± 0.045
5.677ArgThr: 5.677 ± 0.058
6.0ArgVal: 6.0 ± 0.049
1.395ArgTrp: 1.395 ± 0.025
1.793ArgTyr: 1.793 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.736SerAla: 6.736 ± 0.06
0.382SerCys: 0.382 ± 0.013
2.679SerAsp: 2.679 ± 0.037
2.338SerGlu: 2.338 ± 0.036
1.458SerPhe: 1.458 ± 0.025
5.821SerGly: 5.821 ± 0.068
0.973SerHis: 0.973 ± 0.021
1.224SerIle: 1.224 ± 0.026
0.961SerLys: 0.961 ± 0.022
4.694SerLeu: 4.694 ± 0.051
0.988SerMet: 0.988 ± 0.021
0.813SerAsn: 0.813 ± 0.021
3.166SerPro: 3.166 ± 0.043
1.163SerGln: 1.163 ± 0.019
3.583SerArg: 3.583 ± 0.038
2.722SerSer: 2.722 ± 0.042
2.971SerThr: 2.971 ± 0.043
4.241SerVal: 4.241 ± 0.051
0.842SerTrp: 0.842 ± 0.019
1.257SerTyr: 1.257 ± 0.025
0.0SerXaa: 0.0 ± 0.0
Thr
9.499ThrAla: 9.499 ± 0.086
0.438ThrCys: 0.438 ± 0.014
3.868ThrAsp: 3.868 ± 0.046
3.389ThrGlu: 3.389 ± 0.043
1.597ThrPhe: 1.597 ± 0.029
6.948ThrGly: 6.948 ± 0.062
1.248ThrHis: 1.248 ± 0.026
1.587ThrIle: 1.587 ± 0.028
1.143ThrLys: 1.143 ± 0.027
5.972ThrLeu: 5.972 ± 0.054
0.886ThrMet: 0.886 ± 0.02
0.934ThrAsn: 0.934 ± 0.024
4.355ThrPro: 4.355 ± 0.052
1.249ThrGln: 1.249 ± 0.025
4.109ThrArg: 4.109 ± 0.042
3.05ThrSer: 3.05 ± 0.04
4.068ThrThr: 4.068 ± 0.053
6.351ThrVal: 6.351 ± 0.067
0.962ThrTrp: 0.962 ± 0.02
1.395ThrTyr: 1.395 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
10.71ValAla: 10.71 ± 0.077
0.763ValCys: 0.763 ± 0.019
5.067ValAsp: 5.067 ± 0.048
4.742ValGlu: 4.742 ± 0.052
2.453ValPhe: 2.453 ± 0.037
6.532ValGly: 6.532 ± 0.062
1.993ValHis: 1.993 ± 0.034
2.69ValIle: 2.69 ± 0.039
1.73ValLys: 1.73 ± 0.034
9.611ValLeu: 9.611 ± 0.072
1.363ValMet: 1.363 ± 0.024
1.6ValAsn: 1.6 ± 0.032
5.511ValPro: 5.511 ± 0.057
2.013ValGln: 2.013 ± 0.033
7.583ValArg: 7.583 ± 0.064
4.328ValSer: 4.328 ± 0.049
5.936ValThr: 5.936 ± 0.058
8.255ValVal: 8.255 ± 0.082
1.171ValTrp: 1.171 ± 0.021
1.613ValTyr: 1.613 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.7TrpAla: 1.7 ± 0.028
0.152TrpCys: 0.152 ± 0.008
0.826TrpAsp: 0.826 ± 0.018
0.753TrpGlu: 0.753 ± 0.018
0.499TrpPhe: 0.499 ± 0.016
1.067TrpGly: 1.067 ± 0.024
0.364TrpHis: 0.364 ± 0.014
0.554TrpIle: 0.554 ± 0.015
0.364TrpLys: 0.364 ± 0.015
1.786TrpLeu: 1.786 ± 0.03
0.271TrpMet: 0.271 ± 0.012
0.417TrpAsn: 0.417 ± 0.014
0.806TrpPro: 0.806 ± 0.018
0.594TrpGln: 0.594 ± 0.017
1.354TrpArg: 1.354 ± 0.028
0.914TrpSer: 0.914 ± 0.019
1.046TrpThr: 1.046 ± 0.02
0.962TrpVal: 0.962 ± 0.023
0.344TrpTrp: 0.344 ± 0.012
0.365TrpTyr: 0.365 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.848TyrAla: 2.848 ± 0.037
0.165TyrCys: 0.165 ± 0.009
1.643TyrAsp: 1.643 ± 0.033
1.261TyrGlu: 1.261 ± 0.026
0.612TyrPhe: 0.612 ± 0.016
2.326TyrGly: 2.326 ± 0.036
0.368TyrHis: 0.368 ± 0.012
0.466TyrIle: 0.466 ± 0.015
0.403TyrLys: 0.403 ± 0.014
2.08TyrLeu: 2.08 ± 0.031
0.249TyrMet: 0.249 ± 0.01
0.367TyrAsn: 0.367 ± 0.013
1.074TyrPro: 1.074 ± 0.022
0.568TyrGln: 0.568 ± 0.018
1.807TyrArg: 1.807 ± 0.027
0.924TyrSer: 0.924 ± 0.019
1.211TyrThr: 1.211 ± 0.024
1.65TyrVal: 1.65 ± 0.028
0.35TyrTrp: 0.35 ± 0.013
0.435TyrTyr: 0.435 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7161 proteins (2358681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski