Amino acid dipepetide frequency for Streptomyces sp. CB01635

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.493AlaAla: 20.493 ± 0.122
1.071AlaCys: 1.071 ± 0.024
8.278AlaAsp: 8.278 ± 0.065
8.231AlaGlu: 8.231 ± 0.081
3.637AlaPhe: 3.637 ± 0.035
12.836AlaGly: 12.836 ± 0.074
2.91AlaHis: 2.91 ± 0.035
3.65AlaIle: 3.65 ± 0.043
3.102AlaLys: 3.102 ± 0.049
14.192AlaLeu: 14.192 ± 0.094
2.572AlaMet: 2.572 ± 0.029
1.945AlaAsn: 1.945 ± 0.029
7.032AlaPro: 7.032 ± 0.068
3.944AlaGln: 3.944 ± 0.043
9.774AlaArg: 9.774 ± 0.062
6.14AlaSer: 6.14 ± 0.047
7.106AlaThr: 7.106 ± 0.052
12.014AlaVal: 12.014 ± 0.08
1.85AlaTrp: 1.85 ± 0.027
2.81AlaTyr: 2.81 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.075CysAla: 1.075 ± 0.022
0.091CysCys: 0.091 ± 0.007
0.482CysAsp: 0.482 ± 0.012
0.419CysGlu: 0.419 ± 0.013
0.216CysPhe: 0.216 ± 0.009
0.954CysGly: 0.954 ± 0.022
0.198CysHis: 0.198 ± 0.008
0.183CysIle: 0.183 ± 0.009
0.124CysLys: 0.124 ± 0.007
0.732CysLeu: 0.732 ± 0.019
0.124CysMet: 0.124 ± 0.006
0.13CysAsn: 0.13 ± 0.007
0.457CysPro: 0.457 ± 0.015
0.158CysGln: 0.158 ± 0.007
0.575CysArg: 0.575 ± 0.015
0.463CysSer: 0.463 ± 0.015
0.522CysThr: 0.522 ± 0.015
0.668CysVal: 0.668 ± 0.018
0.126CysTrp: 0.126 ± 0.008
0.151CysTyr: 0.151 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
7.801AspAla: 7.801 ± 0.064
0.411AspCys: 0.411 ± 0.014
3.72AspAsp: 3.72 ± 0.046
3.964AspGlu: 3.964 ± 0.04
1.713AspPhe: 1.713 ± 0.027
6.413AspGly: 6.413 ± 0.058
1.455AspHis: 1.455 ± 0.027
2.027AspIle: 2.027 ± 0.031
1.409AspLys: 1.409 ± 0.028
6.278AspLeu: 6.278 ± 0.051
0.841AspMet: 0.841 ± 0.018
1.014AspAsn: 1.014 ± 0.023
4.423AspPro: 4.423 ± 0.048
1.593AspGln: 1.593 ± 0.027
4.735AspArg: 4.735 ± 0.05
2.701AspSer: 2.701 ± 0.033
3.253AspThr: 3.253 ± 0.037
4.784AspVal: 4.784 ± 0.043
1.035AspTrp: 1.035 ± 0.017
1.148AspTyr: 1.148 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
7.041GluAla: 7.041 ± 0.068
0.349GluCys: 0.349 ± 0.012
2.703GluAsp: 2.703 ± 0.035
3.212GluGlu: 3.212 ± 0.043
1.513GluPhe: 1.513 ± 0.026
4.329GluGly: 4.329 ± 0.047
1.531GluHis: 1.531 ± 0.021
2.21GluIle: 2.21 ± 0.034
1.447GluLys: 1.447 ± 0.027
6.72GluLeu: 6.72 ± 0.055
0.872GluMet: 0.872 ± 0.017
1.049GluAsn: 1.049 ± 0.019
3.252GluPro: 3.252 ± 0.035
2.222GluGln: 2.222 ± 0.031
5.33GluArg: 5.33 ± 0.055
2.642GluSer: 2.642 ± 0.036
2.818GluThr: 2.818 ± 0.037
4.217GluVal: 4.217 ± 0.043
0.768GluTrp: 0.768 ± 0.016
1.102GluTyr: 1.102 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.815PheAla: 3.815 ± 0.038
0.271PheCys: 0.271 ± 0.009
2.001PheAsp: 2.001 ± 0.031
1.437PheGlu: 1.437 ± 0.026
0.938PhePhe: 0.938 ± 0.022
3.186PheGly: 3.186 ± 0.034
0.642PheHis: 0.642 ± 0.016
0.78PheIle: 0.78 ± 0.018
0.579PheLys: 0.579 ± 0.014
2.655PheLeu: 2.655 ± 0.035
0.459PheMet: 0.459 ± 0.011
0.642PheAsn: 0.642 ± 0.018
1.442PhePro: 1.442 ± 0.024
0.713PheGln: 0.713 ± 0.016
1.834PheArg: 1.834 ± 0.029
1.545PheSer: 1.545 ± 0.023
2.123PheThr: 2.123 ± 0.03
2.315PheVal: 2.315 ± 0.029
0.452PheTrp: 0.452 ± 0.014
0.623PheTyr: 0.623 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
11.139GlyAla: 11.139 ± 0.073
0.84GlyCys: 0.84 ± 0.019
5.342GlyAsp: 5.342 ± 0.047
5.005GlyGlu: 5.005 ± 0.044
3.008GlyPhe: 3.008 ± 0.033
8.84GlyGly: 8.84 ± 0.092
2.341GlyHis: 2.341 ± 0.031
3.733GlyIle: 3.733 ± 0.043
2.714GlyLys: 2.714 ± 0.038
9.515GlyLeu: 9.515 ± 0.062
2.016GlyMet: 2.016 ± 0.027
1.832GlyAsn: 1.832 ± 0.026
4.995GlyPro: 4.995 ± 0.055
2.741GlyGln: 2.741 ± 0.036
7.449GlyArg: 7.449 ± 0.055
5.578GlySer: 5.578 ± 0.05
6.281GlyThr: 6.281 ± 0.05
7.552GlyVal: 7.552 ± 0.057
1.692GlyTrp: 1.692 ± 0.025
2.294GlyTyr: 2.294 ± 0.03
0.0GlyXaa: 0.0 ± 0.0
His
2.728HisAla: 2.728 ± 0.036
0.219HisCys: 0.219 ± 0.009
1.432HisAsp: 1.432 ± 0.023
1.26HisGlu: 1.26 ± 0.023
0.68HisPhe: 0.68 ± 0.017
2.468HisGly: 2.468 ± 0.029
0.687HisHis: 0.687 ± 0.017
0.741HisIle: 0.741 ± 0.016
0.395HisLys: 0.395 ± 0.012
2.398HisLeu: 2.398 ± 0.027
0.363HisMet: 0.363 ± 0.013
0.413HisAsn: 0.413 ± 0.013
1.817HisPro: 1.817 ± 0.029
0.648HisGln: 0.648 ± 0.016
2.021HisArg: 2.021 ± 0.029
1.07HisSer: 1.07 ± 0.02
1.36HisThr: 1.36 ± 0.024
1.715HisVal: 1.715 ± 0.025
0.396HisTrp: 0.396 ± 0.014
0.508HisTyr: 0.508 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
5.051IleAla: 5.051 ± 0.05
0.307IleCys: 0.307 ± 0.012
2.366IleAsp: 2.366 ± 0.033
1.986IleGlu: 1.986 ± 0.029
0.763IlePhe: 0.763 ± 0.019
3.852IleGly: 3.852 ± 0.049
0.633IleHis: 0.633 ± 0.014
0.909IleIle: 0.909 ± 0.021
0.8IleLys: 0.8 ± 0.017
2.564IleLeu: 2.564 ± 0.038
0.475IleMet: 0.475 ± 0.013
0.758IleAsn: 0.758 ± 0.017
1.849IlePro: 1.849 ± 0.027
0.741IleGln: 0.741 ± 0.015
2.256IleArg: 2.256 ± 0.027
1.778IleSer: 1.778 ± 0.028
2.291IleThr: 2.291 ± 0.026
2.931IleVal: 2.931 ± 0.039
0.416IleTrp: 0.416 ± 0.012
0.537IleTyr: 0.537 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
3.19LysAla: 3.19 ± 0.04
0.123LysCys: 0.123 ± 0.007
1.555LysAsp: 1.555 ± 0.03
1.269LysGlu: 1.269 ± 0.024
0.559LysPhe: 0.559 ± 0.014
2.087LysGly: 2.087 ± 0.035
0.476LysHis: 0.476 ± 0.014
0.926LysIle: 0.926 ± 0.02
1.007LysLys: 1.007 ± 0.031
2.183LysLeu: 2.183 ± 0.031
0.423LysMet: 0.423 ± 0.015
0.597LysAsn: 0.597 ± 0.018
1.454LysPro: 1.454 ± 0.027
0.767LysGln: 0.767 ± 0.021
1.518LysArg: 1.518 ± 0.025
1.355LysSer: 1.355 ± 0.024
1.386LysThr: 1.386 ± 0.027
2.04LysVal: 2.04 ± 0.032
0.313LysTrp: 0.313 ± 0.011
0.51LysTyr: 0.51 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
14.658LeuAla: 14.658 ± 0.086
0.838LeuCys: 0.838 ± 0.021
6.753LeuAsp: 6.753 ± 0.06
4.472LeuGlu: 4.472 ± 0.049
2.746LeuPhe: 2.746 ± 0.036
9.312LeuGly: 9.312 ± 0.08
2.359LeuHis: 2.359 ± 0.035
3.469LeuIle: 3.469 ± 0.045
2.219LeuLys: 2.219 ± 0.031
10.894LeuLeu: 10.894 ± 0.085
1.725LeuMet: 1.725 ± 0.027
1.772LeuAsn: 1.772 ± 0.028
6.359LeuPro: 6.359 ± 0.053
2.244LeuGln: 2.244 ± 0.029
8.2LeuArg: 8.2 ± 0.065
5.448LeuSer: 5.448 ± 0.048
6.935LeuThr: 6.935 ± 0.053
8.705LeuVal: 8.705 ± 0.07
1.312LeuTrp: 1.312 ± 0.024
1.861LeuTyr: 1.861 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.314MetAla: 2.314 ± 0.032
0.164MetCys: 0.164 ± 0.008
0.908MetAsp: 0.908 ± 0.018
0.755MetGlu: 0.755 ± 0.018
0.475MetPhe: 0.475 ± 0.014
1.435MetGly: 1.435 ± 0.026
0.372MetHis: 0.372 ± 0.012
0.659MetIle: 0.659 ± 0.015
0.47MetLys: 0.47 ± 0.013
1.708MetLeu: 1.708 ± 0.026
0.319MetMet: 0.319 ± 0.012
0.473MetAsn: 0.473 ± 0.013
1.151MetPro: 1.151 ± 0.018
0.467MetGln: 0.467 ± 0.011
1.433MetArg: 1.433 ± 0.022
1.368MetSer: 1.368 ± 0.024
1.605MetThr: 1.605 ± 0.023
1.318MetVal: 1.318 ± 0.024
0.222MetTrp: 0.222 ± 0.008
0.357MetTyr: 0.357 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.258AsnAla: 2.258 ± 0.03
0.157AsnCys: 0.157 ± 0.009
1.022AsnAsp: 1.022 ± 0.02
0.853AsnGlu: 0.853 ± 0.018
0.523AsnPhe: 0.523 ± 0.016
1.917AsnGly: 1.917 ± 0.03
0.427AsnHis: 0.427 ± 0.012
0.707AsnIle: 0.707 ± 0.017
0.458AsnLys: 0.458 ± 0.013
1.734AsnLeu: 1.734 ± 0.03
0.328AsnMet: 0.328 ± 0.01
0.465AsnAsn: 0.465 ± 0.017
1.414AsnPro: 1.414 ± 0.026
0.536AsnGln: 0.536 ± 0.019
1.262AsnArg: 1.262 ± 0.022
1.042AsnSer: 1.042 ± 0.022
1.153AsnThr: 1.153 ± 0.022
1.427AsnVal: 1.427 ± 0.023
0.311AsnTrp: 0.311 ± 0.011
0.428AsnTyr: 0.428 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
8.174ProAla: 8.174 ± 0.078
0.324ProCys: 0.324 ± 0.009
4.461ProAsp: 4.461 ± 0.042
4.169ProGlu: 4.169 ± 0.042
1.583ProPhe: 1.583 ± 0.022
6.348ProGly: 6.348 ± 0.058
1.451ProHis: 1.451 ± 0.024
1.339ProIle: 1.339 ± 0.024
1.338ProLys: 1.338 ± 0.023
5.33ProLeu: 5.33 ± 0.051
1.041ProMet: 1.041 ± 0.017
0.953ProAsn: 0.953 ± 0.019
3.26ProPro: 3.26 ± 0.047
1.897ProGln: 1.897 ± 0.03
3.798ProArg: 3.798 ± 0.04
3.23ProSer: 3.23 ± 0.038
3.178ProThr: 3.178 ± 0.042
5.358ProVal: 5.358 ± 0.047
0.887ProTrp: 0.887 ± 0.019
1.431ProTyr: 1.431 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
3.703GlnAla: 3.703 ± 0.042
0.178GlnCys: 0.178 ± 0.008
1.489GlnAsp: 1.489 ± 0.024
1.478GlnGlu: 1.478 ± 0.026
0.778GlnPhe: 0.778 ± 0.017
2.451GlnGly: 2.451 ± 0.037
0.708GlnHis: 0.708 ± 0.015
1.137GlnIle: 1.137 ± 0.021
0.669GlnLys: 0.669 ± 0.017
3.189GlnLeu: 3.189 ± 0.032
0.537GlnMet: 0.537 ± 0.014
0.548GlnAsn: 0.548 ± 0.015
1.661GlnPro: 1.661 ± 0.033
1.301GlnGln: 1.301 ± 0.028
2.329GlnArg: 2.329 ± 0.032
1.361GlnSer: 1.361 ± 0.025
1.411GlnThr: 1.411 ± 0.025
2.41GlnVal: 2.41 ± 0.031
0.479GlnTrp: 0.479 ± 0.013
0.616GlnTyr: 0.616 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
9.447ArgAla: 9.447 ± 0.067
0.564ArgCys: 0.564 ± 0.016
4.234ArgAsp: 4.234 ± 0.043
4.495ArgGlu: 4.495 ± 0.042
2.251ArgPhe: 2.251 ± 0.03
5.722ArgGly: 5.722 ± 0.046
2.071ArgHis: 2.071 ± 0.027
3.332ArgIle: 3.332 ± 0.031
1.754ArgLys: 1.754 ± 0.023
8.21ArgLeu: 8.21 ± 0.069
1.65ArgMet: 1.65 ± 0.026
1.348ArgAsn: 1.348 ± 0.026
4.655ArgPro: 4.655 ± 0.041
2.197ArgGln: 2.197 ± 0.033
7.257ArgArg: 7.257 ± 0.075
4.006ArgSer: 4.006 ± 0.039
5.313ArgThr: 5.313 ± 0.048
5.634ArgVal: 5.634 ± 0.042
1.301ArgTrp: 1.301 ± 0.025
1.758ArgTyr: 1.758 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
6.861SerAla: 6.861 ± 0.055
0.415SerCys: 0.415 ± 0.013
2.894SerAsp: 2.894 ± 0.034
2.469SerGlu: 2.469 ± 0.034
1.632SerPhe: 1.632 ± 0.025
6.115SerGly: 6.115 ± 0.049
1.094SerHis: 1.094 ± 0.022
1.538SerIle: 1.538 ± 0.023
1.219SerLys: 1.219 ± 0.022
4.969SerLeu: 4.969 ± 0.051
1.173SerMet: 1.173 ± 0.022
0.923SerAsn: 0.923 ± 0.018
3.184SerPro: 3.184 ± 0.034
1.38SerGln: 1.38 ± 0.022
3.641SerArg: 3.641 ± 0.04
3.024SerSer: 3.024 ± 0.046
3.122SerThr: 3.122 ± 0.035
4.5SerVal: 4.5 ± 0.039
0.941SerTrp: 0.941 ± 0.02
1.319SerTyr: 1.319 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
8.716ThrAla: 8.716 ± 0.062
0.439ThrCys: 0.439 ± 0.014
3.742ThrAsp: 3.742 ± 0.037
3.196ThrGlu: 3.196 ± 0.04
1.711ThrPhe: 1.711 ± 0.025
6.588ThrGly: 6.588 ± 0.054
1.249ThrHis: 1.249 ± 0.023
1.757ThrIle: 1.757 ± 0.03
1.278ThrLys: 1.278 ± 0.024
5.852ThrLeu: 5.852 ± 0.049
0.984ThrMet: 0.984 ± 0.019
1.032ThrAsn: 1.032 ± 0.021
4.1ThrPro: 4.1 ± 0.035
1.477ThrGln: 1.477 ± 0.02
3.844ThrArg: 3.844 ± 0.038
3.292ThrSer: 3.292 ± 0.036
3.834ThrThr: 3.834 ± 0.042
6.133ThrVal: 6.133 ± 0.049
0.919ThrTrp: 0.919 ± 0.018
1.474ThrTyr: 1.474 ± 0.025
0.0ThrXaa: 0.0 ± 0.0
Val
10.66ValAla: 10.66 ± 0.066
0.764ValCys: 0.764 ± 0.017
4.966ValAsp: 4.966 ± 0.042
4.624ValGlu: 4.624 ± 0.049
2.466ValPhe: 2.466 ± 0.032
6.575ValGly: 6.575 ± 0.055
1.942ValHis: 1.942 ± 0.025
3.061ValIle: 3.061 ± 0.039
1.895ValLys: 1.895 ± 0.029
9.264ValLeu: 9.264 ± 0.072
1.474ValMet: 1.474 ± 0.023
1.682ValAsn: 1.682 ± 0.027
5.2ValPro: 5.2 ± 0.044
2.16ValGln: 2.16 ± 0.03
6.916ValArg: 6.916 ± 0.054
4.448ValSer: 4.448 ± 0.04
5.656ValThr: 5.656 ± 0.049
7.834ValVal: 7.834 ± 0.072
1.131ValTrp: 1.131 ± 0.019
1.588ValTyr: 1.588 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
1.625TrpAla: 1.625 ± 0.026
0.146TrpCys: 0.146 ± 0.007
0.855TrpAsp: 0.855 ± 0.017
0.714TrpGlu: 0.714 ± 0.017
0.556TrpPhe: 0.556 ± 0.014
1.129TrpGly: 1.129 ± 0.019
0.384TrpHis: 0.384 ± 0.012
0.578TrpIle: 0.578 ± 0.014
0.395TrpLys: 0.395 ± 0.013
1.773TrpLeu: 1.773 ± 0.028
0.323TrpMet: 0.323 ± 0.011
0.418TrpAsn: 0.418 ± 0.013
0.776TrpPro: 0.776 ± 0.017
0.636TrpGln: 0.636 ± 0.017
1.261TrpArg: 1.261 ± 0.022
0.947TrpSer: 0.947 ± 0.022
1.039TrpThr: 1.039 ± 0.02
0.997TrpVal: 0.997 ± 0.022
0.346TrpTrp: 0.346 ± 0.013
0.375TrpTyr: 0.375 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.839TyrAla: 2.839 ± 0.035
0.176TyrCys: 0.176 ± 0.008
1.515TyrAsp: 1.515 ± 0.031
1.318TyrGlu: 1.318 ± 0.017
0.705TyrPhe: 0.705 ± 0.018
2.352TyrGly: 2.352 ± 0.032
0.392TyrHis: 0.392 ± 0.014
0.525TyrIle: 0.525 ± 0.014
0.466TyrLys: 0.466 ± 0.014
2.122TyrLeu: 2.122 ± 0.028
0.275TyrMet: 0.275 ± 0.011
0.431TyrAsn: 0.431 ± 0.013
1.069TyrPro: 1.069 ± 0.023
0.612TyrGln: 0.612 ± 0.014
1.765TyrArg: 1.765 ± 0.025
0.974TyrSer: 0.974 ± 0.023
1.23TyrThr: 1.23 ± 0.024
1.751TyrVal: 1.751 ± 0.026
0.375TyrTrp: 0.375 ± 0.013
0.477TyrTyr: 0.477 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8519 proteins (2820413 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski