Amino acid dipepetide frequency for Cupriavidus sp. USMAA2-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.958AlaAla: 21.958 ± 0.162
1.505AlaCys: 1.505 ± 0.028
7.068AlaAsp: 7.068 ± 0.056
6.989AlaGlu: 6.989 ± 0.074
4.297AlaPhe: 4.297 ± 0.045
13.66AlaGly: 13.66 ± 0.12
2.693AlaHis: 2.693 ± 0.043
5.364AlaIle: 5.364 ± 0.051
3.103AlaLys: 3.103 ± 0.046
16.076AlaLeu: 16.076 ± 0.104
3.653AlaMet: 3.653 ± 0.043
2.753AlaAsn: 2.753 ± 0.042
7.12AlaPro: 7.12 ± 0.076
5.892AlaGln: 5.892 ± 0.064
11.209AlaArg: 11.209 ± 0.074
7.017AlaSer: 7.017 ± 0.071
6.328AlaThr: 6.328 ± 0.066
9.507AlaVal: 9.507 ± 0.075
1.995AlaTrp: 1.995 ± 0.032
2.794AlaTyr: 2.794 ± 0.033
0.0AlaXaa: 0.0 ± 0.0
Cys
1.347CysAla: 1.347 ± 0.024
0.141CysCys: 0.141 ± 0.008
0.521CysAsp: 0.521 ± 0.014
0.496CysGlu: 0.496 ± 0.014
0.329CysPhe: 0.329 ± 0.012
1.039CysGly: 1.039 ± 0.023
0.258CysHis: 0.258 ± 0.011
0.332CysIle: 0.332 ± 0.012
0.177CysLys: 0.177 ± 0.01
0.959CysLeu: 0.959 ± 0.023
0.19CysMet: 0.19 ± 0.008
0.203CysAsn: 0.203 ± 0.008
0.471CysPro: 0.471 ± 0.015
0.266CysGln: 0.266 ± 0.011
0.688CysArg: 0.688 ± 0.018
0.491CysSer: 0.491 ± 0.016
0.454CysThr: 0.454 ± 0.015
0.687CysVal: 0.687 ± 0.019
0.127CysTrp: 0.127 ± 0.007
0.222CysTyr: 0.222 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.433AspAla: 7.433 ± 0.063
0.465AspCys: 0.465 ± 0.015
2.644AspAsp: 2.644 ± 0.042
2.858AspGlu: 2.858 ± 0.038
1.954AspPhe: 1.954 ± 0.03
5.131AspGly: 5.131 ± 0.058
1.093AspHis: 1.093 ± 0.027
2.264AspIle: 2.264 ± 0.037
1.528AspLys: 1.528 ± 0.03
5.163AspLeu: 5.163 ± 0.047
1.2AspMet: 1.2 ± 0.022
1.169AspAsn: 1.169 ± 0.024
2.95AspPro: 2.95 ± 0.036
1.504AspGln: 1.504 ± 0.025
3.214AspArg: 3.214 ± 0.038
2.257AspSer: 2.257 ± 0.034
2.505AspThr: 2.505 ± 0.032
3.702AspVal: 3.702 ± 0.044
0.94AspTrp: 0.94 ± 0.021
1.446AspTyr: 1.446 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
7.656GluAla: 7.656 ± 0.072
0.375GluCys: 0.375 ± 0.013
2.224GluAsp: 2.224 ± 0.031
2.365GluGlu: 2.365 ± 0.044
1.522GluPhe: 1.522 ± 0.029
3.565GluGly: 3.565 ± 0.046
1.355GluHis: 1.355 ± 0.026
2.592GluIle: 2.592 ± 0.037
1.403GluLys: 1.403 ± 0.031
5.572GluLeu: 5.572 ± 0.053
1.2GluMet: 1.2 ± 0.023
1.099GluAsn: 1.099 ± 0.021
2.441GluPro: 2.441 ± 0.039
2.565GluGln: 2.565 ± 0.036
4.912GluArg: 4.912 ± 0.057
2.272GluSer: 2.272 ± 0.037
2.373GluThr: 2.373 ± 0.031
3.794GluVal: 3.794 ± 0.047
0.684GluTrp: 0.684 ± 0.017
1.042GluTyr: 1.042 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
4.473PheAla: 4.473 ± 0.044
0.401PheCys: 0.401 ± 0.013
2.348PheAsp: 2.348 ± 0.036
1.768PheGlu: 1.768 ± 0.032
1.246PhePhe: 1.246 ± 0.029
3.535PheGly: 3.535 ± 0.044
0.779PheHis: 0.779 ± 0.021
1.284PheIle: 1.284 ± 0.028
0.84PheLys: 0.84 ± 0.023
3.117PheLeu: 3.117 ± 0.042
0.664PheMet: 0.664 ± 0.019
0.953PheAsn: 0.953 ± 0.018
1.556PhePro: 1.556 ± 0.027
1.041PheGln: 1.041 ± 0.023
2.115PheArg: 2.115 ± 0.031
2.057PheSer: 2.057 ± 0.031
1.729PheThr: 1.729 ± 0.026
2.597PheVal: 2.597 ± 0.034
0.478PheTrp: 0.478 ± 0.017
0.895PheTyr: 0.895 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
10.926GlyAla: 10.926 ± 0.097
0.901GlyCys: 0.901 ± 0.021
4.015GlyAsp: 4.015 ± 0.055
4.715GlyGlu: 4.715 ± 0.048
3.238GlyPhe: 3.238 ± 0.036
7.975GlyGly: 7.975 ± 0.107
1.935GlyHis: 1.935 ± 0.031
4.108GlyIle: 4.108 ± 0.048
3.227GlyLys: 3.227 ± 0.04
9.161GlyLeu: 9.161 ± 0.07
2.475GlyMet: 2.475 ± 0.038
2.333GlyAsn: 2.333 ± 0.047
3.319GlyPro: 3.319 ± 0.037
3.36GlyGln: 3.36 ± 0.04
6.17GlyArg: 6.17 ± 0.06
4.899GlySer: 4.899 ± 0.072
4.962GlyThr: 4.962 ± 0.078
6.548GlyVal: 6.548 ± 0.058
1.5GlyTrp: 1.5 ± 0.026
2.508GlyTyr: 2.508 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
3.158HisAla: 3.158 ± 0.042
0.27HisCys: 0.27 ± 0.012
1.177HisAsp: 1.177 ± 0.025
1.09HisGlu: 1.09 ± 0.021
0.883HisPhe: 0.883 ± 0.02
2.279HisGly: 2.279 ± 0.033
0.6HisHis: 0.6 ± 0.02
0.846HisIle: 0.846 ± 0.023
0.445HisLys: 0.445 ± 0.014
2.221HisLeu: 2.221 ± 0.032
0.461HisMet: 0.461 ± 0.015
0.464HisAsn: 0.464 ± 0.015
1.492HisPro: 1.492 ± 0.03
0.684HisGln: 0.684 ± 0.018
1.503HisArg: 1.503 ± 0.029
0.935HisSer: 0.935 ± 0.019
0.993HisThr: 0.993 ± 0.022
1.589HisVal: 1.589 ± 0.026
0.416HisTrp: 0.416 ± 0.014
0.634HisTyr: 0.634 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
5.974IleAla: 5.974 ± 0.045
0.402IleCys: 0.402 ± 0.013
2.816IleAsp: 2.816 ± 0.032
2.812IleGlu: 2.812 ± 0.039
1.174IlePhe: 1.174 ± 0.025
4.206IleGly: 4.206 ± 0.047
0.856IleHis: 0.856 ± 0.019
1.308IleIle: 1.308 ± 0.024
1.092IleLys: 1.092 ± 0.022
3.293IleLeu: 3.293 ± 0.043
0.627IleMet: 0.627 ± 0.016
1.159IleAsn: 1.159 ± 0.026
1.967IlePro: 1.967 ± 0.03
1.216IleGln: 1.216 ± 0.023
2.813IleArg: 2.813 ± 0.039
2.136IleSer: 2.136 ± 0.032
2.08IleThr: 2.08 ± 0.027
3.434IleVal: 3.434 ± 0.05
0.467IleTrp: 0.467 ± 0.017
0.908IleTyr: 0.908 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
3.423LysAla: 3.423 ± 0.051
0.123LysCys: 0.123 ± 0.008
1.286LysAsp: 1.286 ± 0.03
1.267LysGlu: 1.267 ± 0.026
0.712LysPhe: 0.712 ± 0.021
1.961LysGly: 1.961 ± 0.034
0.532LysHis: 0.532 ± 0.016
1.201LysIle: 1.201 ± 0.024
0.868LysLys: 0.868 ± 0.031
2.985LysLeu: 2.985 ± 0.038
0.608LysMet: 0.608 ± 0.017
0.613LysAsn: 0.613 ± 0.018
1.654LysPro: 1.654 ± 0.025
1.066LysGln: 1.066 ± 0.024
1.95LysArg: 1.95 ± 0.033
1.326LysSer: 1.326 ± 0.026
1.532LysThr: 1.532 ± 0.028
2.166LysVal: 2.166 ± 0.036
0.318LysTrp: 0.318 ± 0.013
0.567LysTyr: 0.567 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
16.92LeuAla: 16.92 ± 0.117
1.141LeuCys: 1.141 ± 0.023
5.905LeuAsp: 5.905 ± 0.062
5.152LeuGlu: 5.152 ± 0.054
3.439LeuPhe: 3.439 ± 0.046
9.07LeuGly: 9.07 ± 0.078
2.336LeuHis: 2.336 ± 0.036
3.822LeuIle: 3.822 ± 0.044
2.856LeuLys: 2.856 ± 0.041
11.629LeuLeu: 11.629 ± 0.112
2.29LeuMet: 2.29 ± 0.038
2.433LeuAsn: 2.433 ± 0.035
6.655LeuPro: 6.655 ± 0.065
3.971LeuGln: 3.971 ± 0.046
8.396LeuArg: 8.396 ± 0.074
6.274LeuSer: 6.274 ± 0.063
5.055LeuThr: 5.055 ± 0.055
7.944LeuVal: 7.944 ± 0.065
1.253LeuTrp: 1.253 ± 0.025
2.213LeuTyr: 2.213 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
2.975MetAla: 2.975 ± 0.042
0.156MetCys: 0.156 ± 0.009
0.993MetAsp: 0.993 ± 0.022
1.048MetGlu: 1.048 ± 0.024
0.675MetPhe: 0.675 ± 0.018
1.562MetGly: 1.562 ± 0.031
0.523MetHis: 0.523 ± 0.015
0.932MetIle: 0.932 ± 0.022
0.731MetLys: 0.731 ± 0.019
2.703MetLeu: 2.703 ± 0.038
0.566MetMet: 0.566 ± 0.017
0.69MetAsn: 0.69 ± 0.018
1.603MetPro: 1.603 ± 0.028
1.077MetGln: 1.077 ± 0.023
1.846MetArg: 1.846 ± 0.024
1.52MetSer: 1.52 ± 0.025
1.462MetThr: 1.462 ± 0.025
1.63MetVal: 1.63 ± 0.028
0.194MetTrp: 0.194 ± 0.009
0.351MetTyr: 0.351 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
3.121AsnAla: 3.121 ± 0.046
0.231AsnCys: 0.231 ± 0.012
1.147AsnAsp: 1.147 ± 0.026
0.979AsnGlu: 0.979 ± 0.02
0.805AsnPhe: 0.805 ± 0.019
2.259AsnGly: 2.259 ± 0.053
0.458AsnHis: 0.458 ± 0.015
1.001AsnIle: 1.001 ± 0.025
0.622AsnLys: 0.622 ± 0.02
2.49AsnLeu: 2.49 ± 0.036
0.478AsnMet: 0.478 ± 0.014
0.619AsnAsn: 0.619 ± 0.02
1.622AsnPro: 1.622 ± 0.026
0.792AsnGln: 0.792 ± 0.019
1.616AsnArg: 1.616 ± 0.024
1.06AsnSer: 1.06 ± 0.026
1.231AsnThr: 1.231 ± 0.026
1.878AsnVal: 1.878 ± 0.037
0.378AsnTrp: 0.378 ± 0.012
0.611AsnTyr: 0.611 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
8.821ProAla: 8.821 ± 0.077
0.402ProCys: 0.402 ± 0.015
3.287ProAsp: 3.287 ± 0.043
3.211ProGlu: 3.211 ± 0.037
1.796ProPhe: 1.796 ± 0.027
5.17ProGly: 5.17 ± 0.05
1.165ProHis: 1.165 ± 0.023
1.785ProIle: 1.785 ± 0.025
1.191ProLys: 1.191 ± 0.027
5.653ProLeu: 5.653 ± 0.061
1.185ProMet: 1.185 ± 0.026
1.157ProAsn: 1.157 ± 0.021
2.899ProPro: 2.899 ± 0.05
2.009ProGln: 2.009 ± 0.029
3.346ProArg: 3.346 ± 0.033
2.624ProSer: 2.624 ± 0.033
2.277ProThr: 2.277 ± 0.035
4.31ProVal: 4.31 ± 0.047
0.758ProTrp: 0.758 ± 0.02
1.31ProTyr: 1.31 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
6.27GlnAla: 6.27 ± 0.063
0.297GlnCys: 0.297 ± 0.011
1.695GlnAsp: 1.695 ± 0.029
1.619GlnGlu: 1.619 ± 0.026
1.185GlnPhe: 1.185 ± 0.021
3.074GlnGly: 3.074 ± 0.039
0.883GlnHis: 0.883 ± 0.02
1.604GlnIle: 1.604 ± 0.028
0.908GlnLys: 0.908 ± 0.021
3.777GlnLeu: 3.777 ± 0.047
0.924GlnMet: 0.924 ± 0.02
0.762GlnAsn: 0.762 ± 0.018
2.296GlnPro: 2.296 ± 0.027
1.964GlnGln: 1.964 ± 0.044
3.352GlnArg: 3.352 ± 0.041
1.799GlnSer: 1.799 ± 0.031
1.69GlnThr: 1.69 ± 0.028
2.962GlnVal: 2.962 ± 0.034
0.599GlnTrp: 0.599 ± 0.018
0.845GlnTyr: 0.845 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
9.516ArgAla: 9.516 ± 0.076
0.637ArgCys: 0.637 ± 0.017
3.812ArgAsp: 3.812 ± 0.049
4.456ArgGlu: 4.456 ± 0.053
2.954ArgPhe: 2.954 ± 0.043
5.141ArgGly: 5.141 ± 0.053
2.077ArgHis: 2.077 ± 0.032
3.508ArgIle: 3.508 ± 0.039
1.931ArgLys: 1.931 ± 0.036
8.904ArgLeu: 8.904 ± 0.076
1.887ArgMet: 1.887 ± 0.027
1.699ArgAsn: 1.699 ± 0.03
3.77ArgPro: 3.77 ± 0.049
3.329ArgGln: 3.329 ± 0.044
6.157ArgArg: 6.157 ± 0.072
3.459ArgSer: 3.459 ± 0.039
3.363ArgThr: 3.363 ± 0.041
5.109ArgVal: 5.109 ± 0.044
1.248ArgTrp: 1.248 ± 0.024
2.069ArgTyr: 2.069 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
6.784SerAla: 6.784 ± 0.064
0.432SerCys: 0.432 ± 0.013
2.413SerAsp: 2.413 ± 0.032
2.38SerGlu: 2.38 ± 0.031
1.886SerPhe: 1.886 ± 0.031
5.467SerGly: 5.467 ± 0.08
1.169SerHis: 1.169 ± 0.023
2.133SerIle: 2.133 ± 0.036
1.268SerLys: 1.268 ± 0.027
5.63SerLeu: 5.63 ± 0.054
1.234SerMet: 1.234 ± 0.024
1.274SerAsn: 1.274 ± 0.026
2.793SerPro: 2.793 ± 0.033
1.778SerGln: 1.778 ± 0.032
3.514SerArg: 3.514 ± 0.045
2.799SerSer: 2.799 ± 0.05
2.658SerThr: 2.658 ± 0.038
3.822SerVal: 3.822 ± 0.049
0.706SerTrp: 0.706 ± 0.017
1.242SerTyr: 1.242 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
6.049ThrAla: 6.049 ± 0.064
0.374ThrCys: 0.374 ± 0.012
2.317ThrAsp: 2.317 ± 0.038
2.106ThrGlu: 2.106 ± 0.032
1.683ThrPhe: 1.683 ± 0.031
4.584ThrGly: 4.584 ± 0.055
1.063ThrHis: 1.063 ± 0.02
2.088ThrIle: 2.088 ± 0.034
0.93ThrLys: 0.93 ± 0.027
6.336ThrLeu: 6.336 ± 0.078
1.061ThrMet: 1.061 ± 0.023
1.056ThrAsn: 1.056 ± 0.029
3.349ThrPro: 3.349 ± 0.041
1.67ThrGln: 1.67 ± 0.029
3.417ThrArg: 3.417 ± 0.035
2.388ThrSer: 2.388 ± 0.033
2.554ThrThr: 2.554 ± 0.043
4.322ThrVal: 4.322 ± 0.052
0.626ThrTrp: 0.626 ± 0.016
1.107ThrTyr: 1.107 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
10.032ValAla: 10.032 ± 0.066
0.733ValCys: 0.733 ± 0.018
3.848ValAsp: 3.848 ± 0.041
4.034ValGlu: 4.034 ± 0.049
2.65ValPhe: 2.65 ± 0.035
5.474ValGly: 5.474 ± 0.052
1.469ValHis: 1.469 ± 0.03
3.169ValIle: 3.169 ± 0.046
2.078ValLys: 2.078 ± 0.039
8.332ValLeu: 8.332 ± 0.064
1.751ValMet: 1.751 ± 0.03
2.002ValAsn: 2.002 ± 0.04
4.435ValPro: 4.435 ± 0.049
2.559ValGln: 2.559 ± 0.036
5.513ValArg: 5.513 ± 0.058
4.172ValSer: 4.172 ± 0.045
3.93ValThr: 3.93 ± 0.047
6.213ValVal: 6.213 ± 0.06
0.856ValTrp: 0.856 ± 0.017
1.56ValTyr: 1.56 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.024
0.158TrpCys: 0.158 ± 0.008
0.587TrpAsp: 0.587 ± 0.016
0.578TrpGlu: 0.578 ± 0.018
0.534TrpPhe: 0.534 ± 0.016
0.929TrpGly: 0.929 ± 0.021
0.414TrpHis: 0.414 ± 0.014
0.642TrpIle: 0.642 ± 0.015
0.383TrpLys: 0.383 ± 0.015
2.157TrpLeu: 2.157 ± 0.034
0.389TrpMet: 0.389 ± 0.012
0.394TrpAsn: 0.394 ± 0.013
0.71TrpPro: 0.71 ± 0.017
0.822TrpGln: 0.822 ± 0.018
1.298TrpArg: 1.298 ± 0.024
0.772TrpSer: 0.772 ± 0.019
0.67TrpThr: 0.67 ± 0.017
0.919TrpVal: 0.919 ± 0.019
0.25TrpTrp: 0.25 ± 0.01
0.308TrpTyr: 0.308 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.903TyrAla: 2.903 ± 0.037
0.252TyrCys: 0.252 ± 0.012
1.316TyrAsp: 1.316 ± 0.029
1.087TyrGlu: 1.087 ± 0.022
0.913TyrPhe: 0.913 ± 0.018
2.192TyrGly: 2.192 ± 0.035
0.482TyrHis: 0.482 ± 0.016
0.757TyrIle: 0.757 ± 0.02
0.58TyrLys: 0.58 ± 0.018
2.598TyrLeu: 2.598 ± 0.034
0.394TyrMet: 0.394 ± 0.013
0.563TyrAsn: 0.563 ± 0.017
1.259TyrPro: 1.259 ± 0.022
0.939TyrGln: 0.939 ± 0.022
1.997TyrArg: 1.997 ± 0.033
1.127TyrSer: 1.127 ± 0.024
1.233TyrThr: 1.233 ± 0.032
1.679TyrVal: 1.679 ± 0.028
0.36TyrTrp: 0.36 ± 0.013
0.662TyrTyr: 0.662 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7301 proteins (2332933 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski