Amino acid dipepetide frequency for Streptomyces sp. M41(2017)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.851AlaAla: 20.851 ± 0.143
1.029AlaCys: 1.029 ± 0.02
8.384AlaAsp: 8.384 ± 0.065
8.407AlaGlu: 8.407 ± 0.083
3.601AlaPhe: 3.601 ± 0.045
13.058AlaGly: 13.058 ± 0.096
2.933AlaHis: 2.933 ± 0.038
3.396AlaIle: 3.396 ± 0.04
2.924AlaLys: 2.924 ± 0.048
14.291AlaLeu: 14.291 ± 0.108
2.489AlaMet: 2.489 ± 0.036
1.918AlaAsn: 1.918 ± 0.032
7.088AlaPro: 7.088 ± 0.078
3.727AlaGln: 3.727 ± 0.041
10.216AlaArg: 10.216 ± 0.073
6.332AlaSer: 6.332 ± 0.055
7.178AlaThr: 7.178 ± 0.065
12.362AlaVal: 12.362 ± 0.087
1.894AlaTrp: 1.894 ± 0.029
2.838AlaTyr: 2.838 ± 0.03
0.0AlaXaa: 0.0 ± 0.0
Cys
1.066CysAla: 1.066 ± 0.02
0.09CysCys: 0.09 ± 0.006
0.445CysAsp: 0.445 ± 0.012
0.361CysGlu: 0.361 ± 0.013
0.207CysPhe: 0.207 ± 0.009
0.963CysGly: 0.963 ± 0.02
0.183CysHis: 0.183 ± 0.008
0.156CysIle: 0.156 ± 0.008
0.111CysLys: 0.111 ± 0.007
0.707CysLeu: 0.707 ± 0.018
0.119CysMet: 0.119 ± 0.007
0.132CysAsn: 0.132 ± 0.009
0.456CysPro: 0.456 ± 0.016
0.153CysGln: 0.153 ± 0.008
0.559CysArg: 0.559 ± 0.017
0.439CysSer: 0.439 ± 0.012
0.507CysThr: 0.507 ± 0.014
0.664CysVal: 0.664 ± 0.018
0.122CysTrp: 0.122 ± 0.008
0.147CysTyr: 0.147 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.89AspAla: 7.89 ± 0.068
0.415AspCys: 0.415 ± 0.013
3.818AspAsp: 3.818 ± 0.049
3.843AspGlu: 3.843 ± 0.047
1.679AspPhe: 1.679 ± 0.024
6.628AspGly: 6.628 ± 0.062
1.475AspHis: 1.475 ± 0.026
1.916AspIle: 1.916 ± 0.028
1.264AspLys: 1.264 ± 0.027
6.372AspLeu: 6.372 ± 0.054
0.841AspMet: 0.841 ± 0.016
0.993AspAsn: 0.993 ± 0.023
4.479AspPro: 4.479 ± 0.048
1.563AspGln: 1.563 ± 0.028
5.012AspArg: 5.012 ± 0.05
2.679AspSer: 2.679 ± 0.034
3.477AspThr: 3.477 ± 0.037
4.937AspVal: 4.937 ± 0.051
1.035AspTrp: 1.035 ± 0.022
1.083AspTyr: 1.083 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
7.093GluAla: 7.093 ± 0.069
0.329GluCys: 0.329 ± 0.011
2.879GluAsp: 2.879 ± 0.04
3.389GluGlu: 3.389 ± 0.047
1.401GluPhe: 1.401 ± 0.026
4.294GluGly: 4.294 ± 0.047
1.527GluHis: 1.527 ± 0.025
2.22GluIle: 2.22 ± 0.034
1.428GluLys: 1.428 ± 0.028
6.52GluLeu: 6.52 ± 0.063
0.827GluMet: 0.827 ± 0.018
1.025GluAsn: 1.025 ± 0.021
3.251GluPro: 3.251 ± 0.041
2.194GluGln: 2.194 ± 0.035
5.378GluArg: 5.378 ± 0.057
2.61GluSer: 2.61 ± 0.031
2.904GluThr: 2.904 ± 0.038
4.2GluVal: 4.2 ± 0.046
0.708GluTrp: 0.708 ± 0.017
1.046GluTyr: 1.046 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
3.692PheAla: 3.692 ± 0.042
0.277PheCys: 0.277 ± 0.011
1.984PheAsp: 1.984 ± 0.031
1.4PheGlu: 1.4 ± 0.023
0.892PhePhe: 0.892 ± 0.024
3.081PheGly: 3.081 ± 0.037
0.597PheHis: 0.597 ± 0.017
0.698PheIle: 0.698 ± 0.017
0.536PheLys: 0.536 ± 0.015
2.562PheLeu: 2.562 ± 0.036
0.427PheMet: 0.427 ± 0.014
0.552PheAsn: 0.552 ± 0.016
1.371PhePro: 1.371 ± 0.023
0.666PheGln: 0.666 ± 0.019
1.808PheArg: 1.808 ± 0.029
1.503PheSer: 1.503 ± 0.028
2.072PheThr: 2.072 ± 0.027
2.318PheVal: 2.318 ± 0.033
0.425PheTrp: 0.425 ± 0.015
0.59PheTyr: 0.59 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
11.275GlyAla: 11.275 ± 0.077
0.834GlyCys: 0.834 ± 0.019
5.432GlyAsp: 5.432 ± 0.054
4.921GlyGlu: 4.921 ± 0.044
2.929GlyPhe: 2.929 ± 0.036
8.95GlyGly: 8.95 ± 0.102
2.417GlyHis: 2.417 ± 0.038
3.484GlyIle: 3.484 ± 0.042
2.4GlyLys: 2.4 ± 0.041
9.486GlyLeu: 9.486 ± 0.074
1.951GlyMet: 1.951 ± 0.031
1.811GlyAsn: 1.811 ± 0.031
5.276GlyPro: 5.276 ± 0.065
2.699GlyGln: 2.699 ± 0.04
7.786GlyArg: 7.786 ± 0.063
5.696GlySer: 5.696 ± 0.058
6.659GlyThr: 6.659 ± 0.062
7.671GlyVal: 7.671 ± 0.056
1.733GlyTrp: 1.733 ± 0.026
2.276GlyTyr: 2.276 ± 0.035
0.0GlyXaa: 0.0 ± 0.0
His
2.718HisAla: 2.718 ± 0.042
0.192HisCys: 0.192 ± 0.01
1.413HisAsp: 1.413 ± 0.024
1.225HisGlu: 1.225 ± 0.02
0.653HisPhe: 0.653 ± 0.017
2.432HisGly: 2.432 ± 0.034
0.711HisHis: 0.711 ± 0.021
0.683HisIle: 0.683 ± 0.018
0.353HisLys: 0.353 ± 0.013
2.43HisLeu: 2.43 ± 0.034
0.339HisMet: 0.339 ± 0.012
0.357HisAsn: 0.357 ± 0.012
1.816HisPro: 1.816 ± 0.027
0.631HisGln: 0.631 ± 0.02
2.067HisArg: 2.067 ± 0.032
0.986HisSer: 0.986 ± 0.021
1.379HisThr: 1.379 ± 0.026
1.73HisVal: 1.73 ± 0.027
0.387HisTrp: 0.387 ± 0.013
0.48HisTyr: 0.48 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
4.745IleAla: 4.745 ± 0.053
0.27IleCys: 0.27 ± 0.011
2.259IleAsp: 2.259 ± 0.031
1.929IleGlu: 1.929 ± 0.027
0.704IlePhe: 0.704 ± 0.018
3.632IleGly: 3.632 ± 0.046
0.591IleHis: 0.591 ± 0.016
0.878IleIle: 0.878 ± 0.019
0.741IleLys: 0.741 ± 0.019
2.42IleLeu: 2.42 ± 0.037
0.444IleMet: 0.444 ± 0.016
0.685IleAsn: 0.685 ± 0.017
1.738IlePro: 1.738 ± 0.03
0.715IleGln: 0.715 ± 0.016
2.132IleArg: 2.132 ± 0.027
1.678IleSer: 1.678 ± 0.027
2.217IleThr: 2.217 ± 0.033
2.785IleVal: 2.785 ± 0.037
0.369IleTrp: 0.369 ± 0.012
0.525IleTyr: 0.525 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
3.01LysAla: 3.01 ± 0.045
0.11LysCys: 0.11 ± 0.007
1.433LysAsp: 1.433 ± 0.028
1.179LysGlu: 1.179 ± 0.026
0.465LysPhe: 0.465 ± 0.012
1.978LysGly: 1.978 ± 0.036
0.412LysHis: 0.412 ± 0.015
0.875LysIle: 0.875 ± 0.022
0.951LysLys: 0.951 ± 0.032
2.015LysLeu: 2.015 ± 0.034
0.378LysMet: 0.378 ± 0.013
0.57LysAsn: 0.57 ± 0.017
1.313LysPro: 1.313 ± 0.024
0.713LysGln: 0.713 ± 0.019
1.401LysArg: 1.401 ± 0.027
1.213LysSer: 1.213 ± 0.028
1.31LysThr: 1.31 ± 0.027
1.941LysVal: 1.941 ± 0.034
0.271LysTrp: 0.271 ± 0.011
0.474LysTyr: 0.474 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
14.666LeuAla: 14.666 ± 0.091
0.817LeuCys: 0.817 ± 0.021
6.86LeuAsp: 6.86 ± 0.063
4.474LeuGlu: 4.474 ± 0.041
2.63LeuPhe: 2.63 ± 0.036
9.297LeuGly: 9.297 ± 0.081
2.288LeuHis: 2.288 ± 0.03
3.258LeuIle: 3.258 ± 0.039
2.068LeuLys: 2.068 ± 0.031
11.154LeuLeu: 11.154 ± 0.096
1.638LeuMet: 1.638 ± 0.028
1.71LeuAsn: 1.71 ± 0.027
6.419LeuPro: 6.419 ± 0.056
2.224LeuGln: 2.224 ± 0.033
8.529LeuArg: 8.529 ± 0.066
5.407LeuSer: 5.407 ± 0.051
6.861LeuThr: 6.861 ± 0.057
8.85LeuVal: 8.85 ± 0.067
1.303LeuTrp: 1.303 ± 0.026
1.863LeuTyr: 1.863 ± 0.03
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.034
0.135MetCys: 0.135 ± 0.007
0.881MetAsp: 0.881 ± 0.018
0.764MetGlu: 0.764 ± 0.019
0.459MetPhe: 0.459 ± 0.013
1.312MetGly: 1.312 ± 0.024
0.351MetHis: 0.351 ± 0.013
0.651MetIle: 0.651 ± 0.017
0.42MetLys: 0.42 ± 0.015
1.612MetLeu: 1.612 ± 0.027
0.303MetMet: 0.303 ± 0.013
0.472MetAsn: 0.472 ± 0.013
1.115MetPro: 1.115 ± 0.023
0.439MetGln: 0.439 ± 0.015
1.399MetArg: 1.399 ± 0.024
1.297MetSer: 1.297 ± 0.026
1.59MetThr: 1.59 ± 0.026
1.254MetVal: 1.254 ± 0.026
0.217MetTrp: 0.217 ± 0.01
0.333MetTyr: 0.333 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.247AsnAla: 2.247 ± 0.037
0.163AsnCys: 0.163 ± 0.008
0.97AsnAsp: 0.97 ± 0.02
0.795AsnGlu: 0.795 ± 0.02
0.469AsnPhe: 0.469 ± 0.015
1.912AsnGly: 1.912 ± 0.033
0.386AsnHis: 0.386 ± 0.013
0.679AsnIle: 0.679 ± 0.017
0.437AsnLys: 0.437 ± 0.014
1.624AsnLeu: 1.624 ± 0.024
0.296AsnMet: 0.296 ± 0.012
0.432AsnAsn: 0.432 ± 0.015
1.328AsnPro: 1.328 ± 0.024
0.504AsnGln: 0.504 ± 0.016
1.275AsnArg: 1.275 ± 0.021
0.986AsnSer: 0.986 ± 0.023
1.148AsnThr: 1.148 ± 0.022
1.388AsnVal: 1.388 ± 0.027
0.319AsnTrp: 0.319 ± 0.01
0.407AsnTyr: 0.407 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
8.578ProAla: 8.578 ± 0.077
0.31ProCys: 0.31 ± 0.012
4.534ProAsp: 4.534 ± 0.048
4.193ProGlu: 4.193 ± 0.05
1.528ProPhe: 1.528 ± 0.028
6.662ProGly: 6.662 ± 0.071
1.395ProHis: 1.395 ± 0.03
1.228ProIle: 1.228 ± 0.024
1.2ProLys: 1.2 ± 0.026
5.404ProLeu: 5.404 ± 0.053
0.975ProMet: 0.975 ± 0.02
0.868ProAsn: 0.868 ± 0.02
3.416ProPro: 3.416 ± 0.057
1.712ProGln: 1.712 ± 0.033
3.907ProArg: 3.907 ± 0.051
3.315ProSer: 3.315 ± 0.038
3.221ProThr: 3.221 ± 0.034
5.497ProVal: 5.497 ± 0.054
0.876ProTrp: 0.876 ± 0.02
1.41ProTyr: 1.41 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.617GlnAla: 3.617 ± 0.041
0.154GlnCys: 0.154 ± 0.008
1.468GlnAsp: 1.468 ± 0.026
1.413GlnGlu: 1.413 ± 0.026
0.673GlnPhe: 0.673 ± 0.018
2.405GlnGly: 2.405 ± 0.033
0.658GlnHis: 0.658 ± 0.018
1.079GlnIle: 1.079 ± 0.02
0.649GlnLys: 0.649 ± 0.019
3.006GlnLeu: 3.006 ± 0.039
0.469GlnMet: 0.469 ± 0.015
0.519GlnAsn: 0.519 ± 0.016
1.652GlnPro: 1.652 ± 0.03
1.266GlnGln: 1.266 ± 0.035
2.317GlnArg: 2.317 ± 0.029
1.303GlnSer: 1.303 ± 0.024
1.327GlnThr: 1.327 ± 0.023
2.37GlnVal: 2.37 ± 0.036
0.451GlnTrp: 0.451 ± 0.015
0.622GlnTyr: 0.622 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.802ArgAla: 9.802 ± 0.08
0.544ArgCys: 0.544 ± 0.016
4.346ArgAsp: 4.346 ± 0.048
4.563ArgGlu: 4.563 ± 0.053
2.312ArgPhe: 2.312 ± 0.033
5.829ArgGly: 5.829 ± 0.053
2.129ArgHis: 2.129 ± 0.03
3.147ArgIle: 3.147 ± 0.039
1.599ArgLys: 1.599 ± 0.026
8.631ArgLeu: 8.631 ± 0.073
1.673ArgMet: 1.673 ± 0.026
1.341ArgAsn: 1.341 ± 0.023
4.947ArgPro: 4.947 ± 0.056
2.252ArgGln: 2.252 ± 0.032
7.607ArgArg: 7.607 ± 0.074
4.222ArgSer: 4.222 ± 0.045
5.592ArgThr: 5.592 ± 0.05
5.881ArgVal: 5.881 ± 0.044
1.335ArgTrp: 1.335 ± 0.024
1.722ArgTyr: 1.722 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
7.079SerAla: 7.079 ± 0.065
0.414SerCys: 0.414 ± 0.015
2.956SerAsp: 2.956 ± 0.037
2.438SerGlu: 2.438 ± 0.031
1.529SerPhe: 1.529 ± 0.026
6.312SerGly: 6.312 ± 0.065
1.028SerHis: 1.028 ± 0.02
1.441SerIle: 1.441 ± 0.025
1.116SerLys: 1.116 ± 0.027
4.952SerLeu: 4.952 ± 0.042
1.096SerMet: 1.096 ± 0.022
0.909SerAsn: 0.909 ± 0.022
3.214SerPro: 3.214 ± 0.035
1.278SerGln: 1.278 ± 0.021
3.741SerArg: 3.741 ± 0.039
3.086SerSer: 3.086 ± 0.049
3.285SerThr: 3.285 ± 0.044
4.494SerVal: 4.494 ± 0.051
0.924SerTrp: 0.924 ± 0.021
1.272SerTyr: 1.272 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
9.143ThrAla: 9.143 ± 0.07
0.437ThrCys: 0.437 ± 0.014
3.823ThrAsp: 3.823 ± 0.04
3.262ThrGlu: 3.262 ± 0.043
1.728ThrPhe: 1.728 ± 0.031
6.826ThrGly: 6.826 ± 0.059
1.217ThrHis: 1.217 ± 0.026
1.731ThrIle: 1.731 ± 0.032
1.221ThrLys: 1.221 ± 0.024
5.86ThrLeu: 5.86 ± 0.058
0.951ThrMet: 0.951 ± 0.019
0.985ThrAsn: 0.985 ± 0.021
4.235ThrPro: 4.235 ± 0.049
1.363ThrGln: 1.363 ± 0.024
3.992ThrArg: 3.992 ± 0.042
3.383ThrSer: 3.383 ± 0.037
3.968ThrThr: 3.968 ± 0.057
6.381ThrVal: 6.381 ± 0.055
0.921ThrTrp: 0.921 ± 0.021
1.393ThrTyr: 1.393 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
10.75ValAla: 10.75 ± 0.073
0.739ValCys: 0.739 ± 0.019
5.112ValAsp: 5.112 ± 0.051
4.7ValGlu: 4.7 ± 0.048
2.422ValPhe: 2.422 ± 0.033
6.725ValGly: 6.725 ± 0.053
1.966ValHis: 1.966 ± 0.032
2.835ValIle: 2.835 ± 0.037
1.802ValLys: 1.802 ± 0.03
9.52ValLeu: 9.52 ± 0.074
1.456ValMet: 1.456 ± 0.028
1.667ValAsn: 1.667 ± 0.029
5.285ValPro: 5.285 ± 0.051
2.13ValGln: 2.13 ± 0.03
7.231ValArg: 7.231 ± 0.053
4.454ValSer: 4.454 ± 0.046
5.827ValThr: 5.827 ± 0.056
8.102ValVal: 8.102 ± 0.072
1.186ValTrp: 1.186 ± 0.021
1.555ValTyr: 1.555 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.651TrpAla: 1.651 ± 0.027
0.157TrpCys: 0.157 ± 0.008
0.863TrpAsp: 0.863 ± 0.02
0.706TrpGlu: 0.706 ± 0.02
0.521TrpPhe: 0.521 ± 0.016
1.08TrpGly: 1.08 ± 0.02
0.359TrpHis: 0.359 ± 0.012
0.597TrpIle: 0.597 ± 0.017
0.384TrpLys: 0.384 ± 0.013
1.736TrpLeu: 1.736 ± 0.03
0.294TrpMet: 0.294 ± 0.01
0.407TrpAsn: 0.407 ± 0.014
0.773TrpPro: 0.773 ± 0.018
0.598TrpGln: 0.598 ± 0.018
1.296TrpArg: 1.296 ± 0.021
0.988TrpSer: 0.988 ± 0.021
1.073TrpThr: 1.073 ± 0.02
0.968TrpVal: 0.968 ± 0.02
0.353TrpTrp: 0.353 ± 0.012
0.37TrpTyr: 0.37 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.823TyrAla: 2.823 ± 0.034
0.172TyrCys: 0.172 ± 0.009
1.542TyrAsp: 1.542 ± 0.027
1.26TyrGlu: 1.26 ± 0.025
0.648TyrPhe: 0.648 ± 0.015
2.314TyrGly: 2.314 ± 0.034
0.36TyrHis: 0.36 ± 0.011
0.502TyrIle: 0.502 ± 0.014
0.408TyrLys: 0.408 ± 0.014
2.013TyrLeu: 2.013 ± 0.031
0.258TyrMet: 0.258 ± 0.01
0.413TyrAsn: 0.413 ± 0.013
1.056TyrPro: 1.056 ± 0.022
0.593TyrGln: 0.593 ± 0.016
1.819TyrArg: 1.819 ± 0.026
0.985TyrSer: 0.985 ± 0.023
1.221TyrThr: 1.221 ± 0.025
1.674TyrVal: 1.674 ± 0.023
0.345TyrTrp: 0.345 ± 0.013
0.443TyrTyr: 0.443 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7701 proteins (2489732 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski