Amino acid dipepetide frequency for Streptomyces niveus NCIMB 11891

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.381AlaAla: 20.381 ± 0.158
0.951AlaCys: 0.951 ± 0.02
8.651AlaAsp: 8.651 ± 0.076
8.507AlaGlu: 8.507 ± 0.076
3.435AlaPhe: 3.435 ± 0.039
13.122AlaGly: 13.122 ± 0.08
2.733AlaHis: 2.733 ± 0.036
3.509AlaIle: 3.509 ± 0.04
2.904AlaLys: 2.904 ± 0.049
14.359AlaLeu: 14.359 ± 0.108
2.48AlaMet: 2.48 ± 0.03
2.039AlaAsn: 2.039 ± 0.033
6.983AlaPro: 6.983 ± 0.064
3.483AlaGln: 3.483 ± 0.042
9.975AlaArg: 9.975 ± 0.09
6.054AlaSer: 6.054 ± 0.051
7.236AlaThr: 7.236 ± 0.069
12.308AlaVal: 12.308 ± 0.088
1.795AlaTrp: 1.795 ± 0.025
2.737AlaTyr: 2.737 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.995CysAla: 0.995 ± 0.022
0.084CysCys: 0.084 ± 0.006
0.458CysAsp: 0.458 ± 0.013
0.401CysGlu: 0.401 ± 0.012
0.199CysPhe: 0.199 ± 0.009
0.888CysGly: 0.888 ± 0.019
0.195CysHis: 0.195 ± 0.009
0.14CysIle: 0.14 ± 0.007
0.103CysLys: 0.103 ± 0.007
0.684CysLeu: 0.684 ± 0.018
0.111CysMet: 0.111 ± 0.007
0.125CysAsn: 0.125 ± 0.006
0.444CysPro: 0.444 ± 0.015
0.165CysGln: 0.165 ± 0.008
0.571CysArg: 0.571 ± 0.015
0.403CysSer: 0.403 ± 0.014
0.42CysThr: 0.42 ± 0.013
0.669CysVal: 0.669 ± 0.018
0.109CysTrp: 0.109 ± 0.007
0.145CysTyr: 0.145 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
7.863AspAla: 7.863 ± 0.063
0.409AspCys: 0.409 ± 0.014
3.83AspAsp: 3.83 ± 0.047
4.11AspGlu: 4.11 ± 0.051
1.722AspPhe: 1.722 ± 0.024
6.649AspGly: 6.649 ± 0.066
1.465AspHis: 1.465 ± 0.025
2.083AspIle: 2.083 ± 0.031
1.289AspLys: 1.289 ± 0.029
6.102AspLeu: 6.102 ± 0.054
0.845AspMet: 0.845 ± 0.018
1.092AspAsn: 1.092 ± 0.021
4.548AspPro: 4.548 ± 0.045
1.584AspGln: 1.584 ± 0.025
4.824AspArg: 4.824 ± 0.044
2.882AspSer: 2.882 ± 0.032
3.448AspThr: 3.448 ± 0.038
4.73AspVal: 4.73 ± 0.044
1.024AspTrp: 1.024 ± 0.019
1.095AspTyr: 1.095 ± 0.022
0.0AspXaa: 0.0 ± 0.0
Glu
7.121GluAla: 7.121 ± 0.069
0.364GluCys: 0.364 ± 0.012
2.662GluAsp: 2.662 ± 0.035
3.342GluGlu: 3.342 ± 0.042
1.487GluPhe: 1.487 ± 0.023
4.214GluGly: 4.214 ± 0.044
1.455GluHis: 1.455 ± 0.021
2.34GluIle: 2.34 ± 0.034
1.472GluLys: 1.472 ± 0.03
6.905GluLeu: 6.905 ± 0.063
0.918GluMet: 0.918 ± 0.019
1.042GluAsn: 1.042 ± 0.022
3.439GluPro: 3.439 ± 0.041
2.122GluGln: 2.122 ± 0.032
5.607GluArg: 5.607 ± 0.064
2.959GluSer: 2.959 ± 0.037
3.02GluThr: 3.02 ± 0.038
4.191GluVal: 4.191 ± 0.045
0.791GluTrp: 0.791 ± 0.019
1.15GluTyr: 1.15 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
3.684PheAla: 3.684 ± 0.045
0.247PheCys: 0.247 ± 0.01
2.061PheAsp: 2.061 ± 0.029
1.415PheGlu: 1.415 ± 0.028
0.887PhePhe: 0.887 ± 0.02
3.056PheGly: 3.056 ± 0.035
0.599PheHis: 0.599 ± 0.014
0.777PheIle: 0.777 ± 0.018
0.529PheLys: 0.529 ± 0.016
2.537PheLeu: 2.537 ± 0.037
0.408PheMet: 0.408 ± 0.012
0.57PheAsn: 0.57 ± 0.018
1.385PhePro: 1.385 ± 0.024
0.693PheGln: 0.693 ± 0.017
1.753PheArg: 1.753 ± 0.029
1.483PheSer: 1.483 ± 0.027
2.087PheThr: 2.087 ± 0.029
2.209PheVal: 2.209 ± 0.03
0.421PheTrp: 0.421 ± 0.014
0.559PheTyr: 0.559 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
11.393GlyAla: 11.393 ± 0.09
0.781GlyCys: 0.781 ± 0.019
5.461GlyAsp: 5.461 ± 0.05
5.268GlyGlu: 5.268 ± 0.047
2.856GlyPhe: 2.856 ± 0.035
9.182GlyGly: 9.182 ± 0.095
2.269GlyHis: 2.269 ± 0.032
3.546GlyIle: 3.546 ± 0.04
2.506GlyLys: 2.506 ± 0.042
9.421GlyLeu: 9.421 ± 0.069
1.957GlyMet: 1.957 ± 0.03
1.852GlyAsn: 1.852 ± 0.037
5.241GlyPro: 5.241 ± 0.052
2.708GlyGln: 2.708 ± 0.038
7.622GlyArg: 7.622 ± 0.056
5.492GlySer: 5.492 ± 0.064
6.469GlyThr: 6.469 ± 0.063
7.467GlyVal: 7.467 ± 0.06
1.7GlyTrp: 1.7 ± 0.029
2.222GlyTyr: 2.222 ± 0.031
0.0GlyXaa: 0.0 ± 0.0
His
2.6HisAla: 2.6 ± 0.033
0.201HisCys: 0.201 ± 0.009
1.369HisAsp: 1.369 ± 0.024
1.186HisGlu: 1.186 ± 0.021
0.652HisPhe: 0.652 ± 0.018
2.377HisGly: 2.377 ± 0.038
0.695HisHis: 0.695 ± 0.019
0.675HisIle: 0.675 ± 0.019
0.332HisLys: 0.332 ± 0.011
2.332HisLeu: 2.332 ± 0.037
0.325HisMet: 0.325 ± 0.012
0.376HisAsn: 0.376 ± 0.012
1.724HisPro: 1.724 ± 0.026
0.651HisGln: 0.651 ± 0.018
2.012HisArg: 2.012 ± 0.029
1.027HisSer: 1.027 ± 0.019
1.329HisThr: 1.329 ± 0.023
1.639HisVal: 1.639 ± 0.026
0.365HisTrp: 0.365 ± 0.011
0.459HisTyr: 0.459 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.784IleAla: 4.784 ± 0.046
0.289IleCys: 0.289 ± 0.01
2.367IleAsp: 2.367 ± 0.032
2.006IleGlu: 2.006 ± 0.032
0.749IlePhe: 0.749 ± 0.02
3.66IleGly: 3.66 ± 0.047
0.632IleHis: 0.632 ± 0.016
0.914IleIle: 0.914 ± 0.024
0.801IleLys: 0.801 ± 0.019
2.503IleLeu: 2.503 ± 0.036
0.48IleMet: 0.48 ± 0.016
0.747IleAsn: 0.747 ± 0.018
1.817IlePro: 1.817 ± 0.026
0.789IleGln: 0.789 ± 0.019
2.233IleArg: 2.233 ± 0.03
1.788IleSer: 1.788 ± 0.028
2.334IleThr: 2.334 ± 0.035
2.811IleVal: 2.811 ± 0.04
0.382IleTrp: 0.382 ± 0.012
0.546IleTyr: 0.546 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
2.908LysAla: 2.908 ± 0.047
0.117LysCys: 0.117 ± 0.009
1.447LysAsp: 1.447 ± 0.032
1.251LysGlu: 1.251 ± 0.023
0.464LysPhe: 0.464 ± 0.015
1.898LysGly: 1.898 ± 0.037
0.414LysHis: 0.414 ± 0.015
0.915LysIle: 0.915 ± 0.023
0.899LysLys: 0.899 ± 0.029
2.045LysLeu: 2.045 ± 0.033
0.415LysMet: 0.415 ± 0.014
0.562LysAsn: 0.562 ± 0.019
1.427LysPro: 1.427 ± 0.03
0.691LysGln: 0.691 ± 0.02
1.478LysArg: 1.478 ± 0.027
1.28LysSer: 1.28 ± 0.027
1.37LysThr: 1.37 ± 0.028
1.825LysVal: 1.825 ± 0.035
0.285LysTrp: 0.285 ± 0.012
0.463LysTyr: 0.463 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
14.961LeuAla: 14.961 ± 0.117
0.774LeuCys: 0.774 ± 0.019
6.731LeuAsp: 6.731 ± 0.055
4.485LeuGlu: 4.485 ± 0.052
2.591LeuPhe: 2.591 ± 0.034
9.193LeuGly: 9.193 ± 0.072
2.207LeuHis: 2.207 ± 0.037
3.478LeuIle: 3.478 ± 0.047
2.057LeuLys: 2.057 ± 0.036
11.432LeuLeu: 11.432 ± 0.106
1.686LeuMet: 1.686 ± 0.025
1.793LeuAsn: 1.793 ± 0.027
6.428LeuPro: 6.428 ± 0.057
2.048LeuGln: 2.048 ± 0.031
8.606LeuArg: 8.606 ± 0.084
5.502LeuSer: 5.502 ± 0.046
7.096LeuThr: 7.096 ± 0.053
8.798LeuVal: 8.798 ± 0.066
1.309LeuTrp: 1.309 ± 0.025
1.873LeuTyr: 1.873 ± 0.028
0.0LeuXaa: 0.0 ± 0.0
Met
2.237MetAla: 2.237 ± 0.034
0.138MetCys: 0.138 ± 0.008
0.877MetAsp: 0.877 ± 0.021
0.771MetGlu: 0.771 ± 0.019
0.469MetPhe: 0.469 ± 0.014
1.335MetGly: 1.335 ± 0.026
0.33MetHis: 0.33 ± 0.012
0.703MetIle: 0.703 ± 0.018
0.432MetLys: 0.432 ± 0.012
1.717MetLeu: 1.717 ± 0.024
0.307MetMet: 0.307 ± 0.012
0.439MetAsn: 0.439 ± 0.012
1.105MetPro: 1.105 ± 0.021
0.404MetGln: 0.404 ± 0.016
1.44MetArg: 1.44 ± 0.023
1.362MetSer: 1.362 ± 0.022
1.587MetThr: 1.587 ± 0.023
1.29MetVal: 1.29 ± 0.024
0.224MetTrp: 0.224 ± 0.01
0.328MetTyr: 0.328 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.229AsnAla: 2.229 ± 0.032
0.168AsnCys: 0.168 ± 0.008
1.008AsnAsp: 1.008 ± 0.023
0.899AsnGlu: 0.899 ± 0.018
0.519AsnPhe: 0.519 ± 0.015
2.053AsnGly: 2.053 ± 0.038
0.403AsnHis: 0.403 ± 0.013
0.661AsnIle: 0.661 ± 0.021
0.433AsnLys: 0.433 ± 0.014
1.709AsnLeu: 1.709 ± 0.029
0.312AsnMet: 0.312 ± 0.01
0.491AsnAsn: 0.491 ± 0.018
1.368AsnPro: 1.368 ± 0.025
0.532AsnGln: 0.532 ± 0.017
1.277AsnArg: 1.277 ± 0.026
1.09AsnSer: 1.09 ± 0.024
1.215AsnThr: 1.215 ± 0.028
1.398AsnVal: 1.398 ± 0.031
0.303AsnTrp: 0.303 ± 0.012
0.449AsnTyr: 0.449 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
8.489ProAla: 8.489 ± 0.068
0.303ProCys: 0.303 ± 0.012
4.621ProAsp: 4.621 ± 0.049
4.228ProGlu: 4.228 ± 0.048
1.535ProPhe: 1.535 ± 0.025
6.595ProGly: 6.595 ± 0.054
1.33ProHis: 1.33 ± 0.025
1.274ProIle: 1.274 ± 0.027
1.281ProLys: 1.281 ± 0.029
5.342ProLeu: 5.342 ± 0.057
1.015ProMet: 1.015 ± 0.018
0.972ProAsn: 0.972 ± 0.021
3.607ProPro: 3.607 ± 0.054
1.666ProGln: 1.666 ± 0.03
3.89ProArg: 3.89 ± 0.044
3.158ProSer: 3.158 ± 0.046
3.253ProThr: 3.253 ± 0.038
5.571ProVal: 5.571 ± 0.054
0.899ProTrp: 0.899 ± 0.022
1.431ProTyr: 1.431 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.448GlnAla: 3.448 ± 0.039
0.162GlnCys: 0.162 ± 0.008
1.379GlnAsp: 1.379 ± 0.028
1.408GlnGlu: 1.408 ± 0.026
0.713GlnPhe: 0.713 ± 0.019
2.201GlnGly: 2.201 ± 0.037
0.625GlnHis: 0.625 ± 0.017
1.09GlnIle: 1.09 ± 0.025
0.583GlnLys: 0.583 ± 0.017
3.188GlnLeu: 3.188 ± 0.037
0.458GlnMet: 0.458 ± 0.014
0.515GlnAsn: 0.515 ± 0.017
1.618GlnPro: 1.618 ± 0.032
1.22GlnGln: 1.22 ± 0.037
2.324GlnArg: 2.324 ± 0.033
1.303GlnSer: 1.303 ± 0.024
1.343GlnThr: 1.343 ± 0.024
2.234GlnVal: 2.234 ± 0.031
0.448GlnTrp: 0.448 ± 0.013
0.597GlnTyr: 0.597 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
9.71ArgAla: 9.71 ± 0.08
0.548ArgCys: 0.548 ± 0.015
4.407ArgAsp: 4.407 ± 0.045
4.689ArgGlu: 4.689 ± 0.046
2.287ArgPhe: 2.287 ± 0.035
5.796ArgGly: 5.796 ± 0.05
2.093ArgHis: 2.093 ± 0.028
3.221ArgIle: 3.221 ± 0.038
1.641ArgLys: 1.641 ± 0.03
8.647ArgLeu: 8.647 ± 0.082
1.715ArgMet: 1.715 ± 0.024
1.356ArgAsn: 1.356 ± 0.026
4.818ArgPro: 4.818 ± 0.051
2.266ArgGln: 2.266 ± 0.034
7.483ArgArg: 7.483 ± 0.077
4.038ArgSer: 4.038 ± 0.053
5.528ArgThr: 5.528 ± 0.051
5.811ArgVal: 5.811 ± 0.057
1.342ArgTrp: 1.342 ± 0.025
1.751ArgTyr: 1.751 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.925SerAla: 6.925 ± 0.059
0.388SerCys: 0.388 ± 0.014
2.921SerAsp: 2.921 ± 0.034
2.471SerGlu: 2.471 ± 0.028
1.593SerPhe: 1.593 ± 0.027
6.054SerGly: 6.054 ± 0.063
1.058SerHis: 1.058 ± 0.023
1.511SerIle: 1.511 ± 0.027
1.116SerLys: 1.116 ± 0.025
5.057SerLeu: 5.057 ± 0.048
1.09SerMet: 1.09 ± 0.022
0.914SerAsn: 0.914 ± 0.02
3.251SerPro: 3.251 ± 0.043
1.27SerGln: 1.27 ± 0.024
3.812SerArg: 3.812 ± 0.043
2.89SerSer: 2.89 ± 0.04
3.251SerThr: 3.251 ± 0.043
4.469SerVal: 4.469 ± 0.045
0.939SerTrp: 0.939 ± 0.022
1.309SerTyr: 1.309 ± 0.027
0.0SerXaa: 0.0 ± 0.0
Thr
8.927ThrAla: 8.927 ± 0.071
0.369ThrCys: 0.369 ± 0.014
3.913ThrAsp: 3.913 ± 0.043
3.389ThrGlu: 3.389 ± 0.038
1.655ThrPhe: 1.655 ± 0.026
7.031ThrGly: 7.031 ± 0.062
1.212ThrHis: 1.212 ± 0.024
1.81ThrIle: 1.81 ± 0.034
1.277ThrLys: 1.277 ± 0.028
5.894ThrLeu: 5.894 ± 0.057
1.001ThrMet: 1.001 ± 0.021
1.111ThrAsn: 1.111 ± 0.023
4.158ThrPro: 4.158 ± 0.043
1.356ThrGln: 1.356 ± 0.025
3.989ThrArg: 3.989 ± 0.039
3.295ThrSer: 3.295 ± 0.037
4.074ThrThr: 4.074 ± 0.064
6.257ThrVal: 6.257 ± 0.059
0.866ThrTrp: 0.866 ± 0.02
1.381ThrTyr: 1.381 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
10.63ValAla: 10.63 ± 0.081
0.719ValCys: 0.719 ± 0.016
5.049ValAsp: 5.049 ± 0.051
4.686ValGlu: 4.686 ± 0.052
2.421ValPhe: 2.421 ± 0.037
6.688ValGly: 6.688 ± 0.057
1.875ValHis: 1.875 ± 0.03
2.925ValIle: 2.925 ± 0.035
1.761ValLys: 1.761 ± 0.03
9.386ValLeu: 9.386 ± 0.078
1.438ValMet: 1.438 ± 0.023
1.674ValAsn: 1.674 ± 0.027
5.227ValPro: 5.227 ± 0.048
1.985ValGln: 1.985 ± 0.031
7.157ValArg: 7.157 ± 0.06
4.367ValSer: 4.367 ± 0.043
5.687ValThr: 5.687 ± 0.062
8.008ValVal: 8.008 ± 0.07
1.125ValTrp: 1.125 ± 0.025
1.555ValTyr: 1.555 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.624TrpAla: 1.624 ± 0.025
0.136TrpCys: 0.136 ± 0.008
0.846TrpAsp: 0.846 ± 0.02
0.734TrpGlu: 0.734 ± 0.018
0.499TrpPhe: 0.499 ± 0.013
1.051TrpGly: 1.051 ± 0.024
0.361TrpHis: 0.361 ± 0.014
0.554TrpIle: 0.554 ± 0.014
0.356TrpLys: 0.356 ± 0.012
1.747TrpLeu: 1.747 ± 0.03
0.288TrpMet: 0.288 ± 0.01
0.407TrpAsn: 0.407 ± 0.014
0.803TrpPro: 0.803 ± 0.021
0.629TrpGln: 0.629 ± 0.017
1.329TrpArg: 1.329 ± 0.027
0.955TrpSer: 0.955 ± 0.025
1.013TrpThr: 1.013 ± 0.021
0.98TrpVal: 0.98 ± 0.019
0.341TrpTrp: 0.341 ± 0.013
0.353TrpTyr: 0.353 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.731TyrAla: 2.731 ± 0.034
0.158TyrCys: 0.158 ± 0.008
1.535TyrAsp: 1.535 ± 0.03
1.357TyrGlu: 1.357 ± 0.024
0.628TyrPhe: 0.628 ± 0.016
2.374TyrGly: 2.374 ± 0.032
0.372TyrHis: 0.372 ± 0.013
0.507TyrIle: 0.507 ± 0.014
0.379TyrLys: 0.379 ± 0.015
2.042TyrLeu: 2.042 ± 0.031
0.247TyrMet: 0.247 ± 0.011
0.438TyrAsn: 0.438 ± 0.016
1.069TyrPro: 1.069 ± 0.023
0.597TyrGln: 0.597 ± 0.015
1.765TyrArg: 1.765 ± 0.029
0.962TyrSer: 0.962 ± 0.022
1.206TyrThr: 1.206 ± 0.025
1.695TyrVal: 1.695 ± 0.026
0.338TyrTrp: 0.338 ± 0.012
0.424TyrTyr: 0.424 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7746 proteins (2460520 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski