Amino acid dipepetide frequency for Aliiroseovarius sediminilitoris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.472AlaAla: 14.472 ± 0.169
1.007AlaCys: 1.007 ± 0.036
7.105AlaAsp: 7.105 ± 0.079
7.196AlaGlu: 7.196 ± 0.102
4.217AlaPhe: 4.217 ± 0.066
9.657AlaGly: 9.657 ± 0.109
2.235AlaHis: 2.235 ± 0.046
6.187AlaIle: 6.187 ± 0.091
4.168AlaLys: 4.168 ± 0.07
12.623AlaLeu: 12.623 ± 0.137
3.649AlaMet: 3.649 ± 0.07
2.957AlaAsn: 2.957 ± 0.053
5.35AlaPro: 5.35 ± 0.092
4.326AlaGln: 4.326 ± 0.077
8.014AlaArg: 8.014 ± 0.101
5.601AlaSer: 5.601 ± 0.083
5.925AlaThr: 5.925 ± 0.076
7.918AlaVal: 7.918 ± 0.101
1.381AlaTrp: 1.381 ± 0.042
2.526AlaTyr: 2.526 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.0CysAla: 1.0 ± 0.036
0.107CysCys: 0.107 ± 0.011
0.63CysAsp: 0.63 ± 0.028
0.437CysGlu: 0.437 ± 0.021
0.312CysPhe: 0.312 ± 0.018
0.906CysGly: 0.906 ± 0.033
0.307CysHis: 0.307 ± 0.017
0.437CysIle: 0.437 ± 0.021
0.246CysLys: 0.246 ± 0.013
0.778CysLeu: 0.778 ± 0.031
0.184CysMet: 0.184 ± 0.015
0.238CysAsn: 0.238 ± 0.014
0.541CysPro: 0.541 ± 0.026
0.255CysGln: 0.255 ± 0.017
0.479CysArg: 0.479 ± 0.023
0.437CysSer: 0.437 ± 0.02
0.439CysThr: 0.439 ± 0.024
0.588CysVal: 0.588 ± 0.023
0.101CysTrp: 0.101 ± 0.01
0.221CysTyr: 0.221 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
7.581AspAla: 7.581 ± 0.103
0.543AspCys: 0.543 ± 0.025
3.974AspAsp: 3.974 ± 0.085
3.939AspGlu: 3.939 ± 0.067
2.396AspPhe: 2.396 ± 0.045
5.885AspGly: 5.885 ± 0.088
1.487AspHis: 1.487 ± 0.043
3.504AspIle: 3.504 ± 0.064
2.086AspLys: 2.086 ± 0.047
6.489AspLeu: 6.489 ± 0.082
1.927AspMet: 1.927 ± 0.044
1.481AspAsn: 1.481 ± 0.04
3.653AspPro: 3.653 ± 0.061
2.189AspGln: 2.189 ± 0.053
4.359AspArg: 4.359 ± 0.065
2.183AspSer: 2.183 ± 0.048
3.355AspThr: 3.355 ± 0.065
4.473AspVal: 4.473 ± 0.075
1.127AspTrp: 1.127 ± 0.03
1.592AspTyr: 1.592 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
7.258GluAla: 7.258 ± 0.101
0.385GluCys: 0.385 ± 0.019
3.615GluAsp: 3.615 ± 0.072
3.267GluGlu: 3.267 ± 0.07
1.834GluPhe: 1.834 ± 0.047
4.486GluGly: 4.486 ± 0.084
1.174GluHis: 1.174 ± 0.04
3.481GluIle: 3.481 ± 0.065
2.342GluLys: 2.342 ± 0.051
5.308GluLeu: 5.308 ± 0.087
1.823GluMet: 1.823 ± 0.043
1.849GluAsn: 1.849 ± 0.043
2.32GluPro: 2.32 ± 0.052
1.982GluGln: 1.982 ± 0.052
3.917GluArg: 3.917 ± 0.068
2.083GluSer: 2.083 ± 0.048
3.609GluThr: 3.609 ± 0.064
4.333GluVal: 4.333 ± 0.069
0.676GluTrp: 0.676 ± 0.025
1.078GluTyr: 1.078 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.406PheAla: 4.406 ± 0.066
0.452PheCys: 0.452 ± 0.021
2.99PheAsp: 2.99 ± 0.055
2.149PheGlu: 2.149 ± 0.046
1.562PhePhe: 1.562 ± 0.045
3.846PheGly: 3.846 ± 0.064
0.801PheHis: 0.801 ± 0.029
1.785PheIle: 1.785 ± 0.045
1.08PheLys: 1.08 ± 0.033
3.634PheLeu: 3.634 ± 0.067
0.957PheMet: 0.957 ± 0.033
1.121PheAsn: 1.121 ± 0.032
1.642PhePro: 1.642 ± 0.039
1.105PheGln: 1.105 ± 0.033
2.077PheArg: 2.077 ± 0.045
2.196PheSer: 2.196 ± 0.046
2.19PheThr: 2.19 ± 0.052
2.75PheVal: 2.75 ± 0.05
0.637PheTrp: 0.637 ± 0.031
0.952PheTyr: 0.952 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.416GlyAla: 9.416 ± 0.108
0.84GlyCys: 0.84 ± 0.031
4.992GlyAsp: 4.992 ± 0.082
4.546GlyGlu: 4.546 ± 0.071
3.897GlyPhe: 3.897 ± 0.063
7.096GlyGly: 7.096 ± 0.116
1.952GlyHis: 1.952 ± 0.05
4.529GlyIle: 4.529 ± 0.064
3.529GlyLys: 3.529 ± 0.059
8.917GlyLeu: 8.917 ± 0.116
2.68GlyMet: 2.68 ± 0.053
2.247GlyAsn: 2.247 ± 0.065
3.525GlyPro: 3.525 ± 0.064
3.198GlyGln: 3.198 ± 0.061
5.256GlyArg: 5.256 ± 0.079
4.262GlySer: 4.262 ± 0.071
4.455GlyThr: 4.455 ± 0.078
6.536GlyVal: 6.536 ± 0.075
1.447GlyTrp: 1.447 ± 0.039
2.326GlyTyr: 2.326 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.222HisAla: 2.222 ± 0.056
0.214HisCys: 0.214 ± 0.014
1.366HisAsp: 1.366 ± 0.042
1.098HisGlu: 1.098 ± 0.034
0.825HisPhe: 0.825 ± 0.03
1.902HisGly: 1.902 ± 0.046
0.58HisHis: 0.58 ± 0.03
1.024HisIle: 1.024 ± 0.029
0.637HisLys: 0.637 ± 0.027
2.119HisLeu: 2.119 ± 0.049
0.605HisMet: 0.605 ± 0.025
0.494HisAsn: 0.494 ± 0.018
1.407HisPro: 1.407 ± 0.034
0.657HisGln: 0.657 ± 0.023
1.326HisArg: 1.326 ± 0.035
0.911HisSer: 0.911 ± 0.032
0.859HisThr: 0.859 ± 0.031
1.622HisVal: 1.622 ± 0.041
0.372HisTrp: 0.372 ± 0.022
0.548HisTyr: 0.548 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
7.118IleAla: 7.118 ± 0.09
0.622IleCys: 0.622 ± 0.026
3.582IleAsp: 3.582 ± 0.056
3.585IleGlu: 3.585 ± 0.063
1.892IlePhe: 1.892 ± 0.056
5.01IleGly: 5.01 ± 0.084
0.936IleHis: 0.936 ± 0.032
2.539IleIle: 2.539 ± 0.066
1.705IleLys: 1.705 ± 0.034
4.96IleLeu: 4.96 ± 0.083
1.229IleMet: 1.229 ± 0.035
1.533IleAsn: 1.533 ± 0.041
2.502IlePro: 2.502 ± 0.051
1.299IleGln: 1.299 ± 0.033
3.249IleArg: 3.249 ± 0.053
3.246IleSer: 3.246 ± 0.07
3.089IleThr: 3.089 ± 0.06
4.029IleVal: 4.029 ± 0.065
0.8IleTrp: 0.8 ± 0.029
1.254IleTyr: 1.254 ± 0.037
0.0IleXaa: 0.0 ± 0.0
Lys
4.157LysAla: 4.157 ± 0.066
0.208LysCys: 0.208 ± 0.015
2.153LysAsp: 2.153 ± 0.043
1.718LysGlu: 1.718 ± 0.043
1.079LysPhe: 1.079 ± 0.035
3.053LysGly: 3.053 ± 0.062
0.74LysHis: 0.74 ± 0.028
1.955LysIle: 1.955 ± 0.043
1.526LysLys: 1.526 ± 0.051
3.406LysLeu: 3.406 ± 0.062
1.028LysMet: 1.028 ± 0.032
1.022LysAsn: 1.022 ± 0.031
2.011LysPro: 2.011 ± 0.055
1.08LysGln: 1.08 ± 0.031
2.601LysArg: 2.601 ± 0.056
2.045LysSer: 2.045 ± 0.05
2.283LysThr: 2.283 ± 0.05
2.593LysVal: 2.593 ± 0.058
0.451LysTrp: 0.451 ± 0.019
0.741LysTyr: 0.741 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
11.971LeuAla: 11.971 ± 0.121
0.84LeuCys: 0.84 ± 0.032
6.298LeuAsp: 6.298 ± 0.083
5.207LeuGlu: 5.207 ± 0.08
3.657LeuPhe: 3.657 ± 0.068
8.412LeuGly: 8.412 ± 0.097
1.825LeuHis: 1.825 ± 0.045
5.389LeuIle: 5.389 ± 0.087
3.491LeuLys: 3.491 ± 0.053
8.609LeuLeu: 8.609 ± 0.114
2.67LeuMet: 2.67 ± 0.053
2.817LeuAsn: 2.817 ± 0.047
5.304LeuPro: 5.304 ± 0.083
2.579LeuGln: 2.579 ± 0.051
6.435LeuArg: 6.435 ± 0.089
6.761LeuSer: 6.761 ± 0.086
6.322LeuThr: 6.322 ± 0.075
6.907LeuVal: 6.907 ± 0.095
1.263LeuTrp: 1.263 ± 0.038
1.949LeuTyr: 1.949 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.355MetAla: 3.355 ± 0.058
0.201MetCys: 0.201 ± 0.015
1.545MetAsp: 1.545 ± 0.037
1.348MetGlu: 1.348 ± 0.036
0.964MetPhe: 0.964 ± 0.028
2.434MetGly: 2.434 ± 0.053
0.488MetHis: 0.488 ± 0.024
1.684MetIle: 1.684 ± 0.047
1.256MetLys: 1.256 ± 0.038
2.6MetLeu: 2.6 ± 0.052
0.851MetMet: 0.851 ± 0.033
1.031MetAsn: 1.031 ± 0.035
1.486MetPro: 1.486 ± 0.044
0.952MetGln: 0.952 ± 0.028
1.922MetArg: 1.922 ± 0.042
1.961MetSer: 1.961 ± 0.042
2.152MetThr: 2.152 ± 0.044
2.018MetVal: 2.018 ± 0.044
0.261MetTrp: 0.261 ± 0.017
0.349MetTyr: 0.349 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.314AsnAla: 3.314 ± 0.056
0.274AsnCys: 0.274 ± 0.016
1.658AsnAsp: 1.658 ± 0.044
1.369AsnGlu: 1.369 ± 0.035
0.994AsnPhe: 0.994 ± 0.032
2.456AsnGly: 2.456 ± 0.058
0.596AsnHis: 0.596 ± 0.025
1.54AsnIle: 1.54 ± 0.041
0.874AsnLys: 0.874 ± 0.03
2.63AsnLeu: 2.63 ± 0.055
0.776AsnMet: 0.776 ± 0.028
0.75AsnAsn: 0.75 ± 0.03
1.94AsnPro: 1.94 ± 0.044
0.872AsnGln: 0.872 ± 0.029
1.837AsnArg: 1.837 ± 0.035
1.292AsnSer: 1.292 ± 0.035
1.537AsnThr: 1.537 ± 0.04
2.013AsnVal: 2.013 ± 0.048
0.522AsnTrp: 0.522 ± 0.022
0.675AsnTyr: 0.675 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
5.15ProAla: 5.15 ± 0.075
0.36ProCys: 0.36 ± 0.018
4.228ProAsp: 4.228 ± 0.062
3.648ProGlu: 3.648 ± 0.071
2.029ProPhe: 2.029 ± 0.051
4.198ProGly: 4.198 ± 0.067
1.107ProHis: 1.107 ± 0.037
2.463ProIle: 2.463 ± 0.052
1.868ProLys: 1.868 ± 0.044
4.393ProLeu: 4.393 ± 0.069
1.352ProMet: 1.352 ± 0.034
1.432ProAsn: 1.432 ± 0.036
2.142ProPro: 2.142 ± 0.059
1.464ProGln: 1.464 ± 0.038
2.508ProArg: 2.508 ± 0.054
2.553ProSer: 2.553 ± 0.057
2.645ProThr: 2.645 ± 0.049
4.119ProVal: 4.119 ± 0.065
0.674ProTrp: 0.674 ± 0.029
1.168ProTyr: 1.168 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.893GlnAla: 3.893 ± 0.064
0.201GlnCys: 0.201 ± 0.014
1.901GlnAsp: 1.901 ± 0.047
1.455GlnGlu: 1.455 ± 0.037
1.131GlnPhe: 1.131 ± 0.033
2.539GlnGly: 2.539 ± 0.048
0.604GlnHis: 0.604 ± 0.021
2.117GlnIle: 2.117 ± 0.052
1.237GlnLys: 1.237 ± 0.037
2.857GlnLeu: 2.857 ± 0.052
1.135GlnMet: 1.135 ± 0.035
1.041GlnAsn: 1.041 ± 0.032
1.601GlnPro: 1.601 ± 0.037
1.034GlnGln: 1.034 ± 0.035
2.009GlnArg: 2.009 ± 0.048
1.946GlnSer: 1.946 ± 0.053
1.924GlnThr: 1.924 ± 0.049
2.391GlnVal: 2.391 ± 0.049
0.418GlnTrp: 0.418 ± 0.017
0.596GlnTyr: 0.596 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
7.351ArgAla: 7.351 ± 0.087
0.449ArgCys: 0.449 ± 0.021
4.346ArgAsp: 4.346 ± 0.078
3.577ArgGlu: 3.577 ± 0.061
2.674ArgPhe: 2.674 ± 0.052
4.302ArgGly: 4.302 ± 0.066
1.473ArgHis: 1.473 ± 0.037
3.68ArgIle: 3.68 ± 0.065
2.533ArgLys: 2.533 ± 0.059
6.808ArgLeu: 6.808 ± 0.097
1.93ArgMet: 1.93 ± 0.039
1.857ArgAsn: 1.857 ± 0.045
2.899ArgPro: 2.899 ± 0.059
2.184ArgGln: 2.184 ± 0.045
4.407ArgArg: 4.407 ± 0.08
3.225ArgSer: 3.225 ± 0.048
3.013ArgThr: 3.013 ± 0.054
4.689ArgVal: 4.689 ± 0.063
0.907ArgTrp: 0.907 ± 0.028
1.515ArgTyr: 1.515 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.639SerAla: 5.639 ± 0.081
0.454SerCys: 0.454 ± 0.024
3.523SerAsp: 3.523 ± 0.063
2.881SerGlu: 2.881 ± 0.055
2.391SerPhe: 2.391 ± 0.051
5.275SerGly: 5.275 ± 0.074
1.085SerHis: 1.085 ± 0.039
2.693SerIle: 2.693 ± 0.05
1.808SerLys: 1.808 ± 0.043
5.096SerLeu: 5.096 ± 0.071
1.484SerMet: 1.484 ± 0.037
1.508SerAsn: 1.508 ± 0.038
2.5SerPro: 2.5 ± 0.045
1.692SerGln: 1.692 ± 0.038
3.238SerArg: 3.238 ± 0.056
2.655SerSer: 2.655 ± 0.055
2.593SerThr: 2.593 ± 0.048
3.865SerVal: 3.865 ± 0.062
0.701SerTrp: 0.701 ± 0.026
1.312SerTyr: 1.312 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
6.012ThrAla: 6.012 ± 0.079
0.526ThrCys: 0.526 ± 0.024
3.426ThrAsp: 3.426 ± 0.06
2.992ThrGlu: 2.992 ± 0.054
2.006ThrPhe: 2.006 ± 0.046
5.403ThrGly: 5.403 ± 0.077
1.231ThrHis: 1.231 ± 0.033
2.986ThrIle: 2.986 ± 0.055
1.754ThrLys: 1.754 ± 0.046
5.93ThrLeu: 5.93 ± 0.07
1.402ThrMet: 1.402 ± 0.038
1.414ThrAsn: 1.414 ± 0.04
3.517ThrPro: 3.517 ± 0.057
1.732ThrGln: 1.732 ± 0.037
3.683ThrArg: 3.683 ± 0.061
2.832ThrSer: 2.832 ± 0.053
2.959ThrThr: 2.959 ± 0.059
4.052ThrVal: 4.052 ± 0.066
0.67ThrTrp: 0.67 ± 0.029
1.394ThrTyr: 1.394 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
8.436ValAla: 8.436 ± 0.101
0.585ValCys: 0.585 ± 0.026
4.406ValAsp: 4.406 ± 0.065
4.47ValGlu: 4.47 ± 0.068
3.01ValPhe: 3.01 ± 0.053
5.639ValGly: 5.639 ± 0.081
1.344ValHis: 1.344 ± 0.04
4.464ValIle: 4.464 ± 0.071
2.561ValLys: 2.561 ± 0.047
7.562ValLeu: 7.562 ± 0.093
2.175ValMet: 2.175 ± 0.052
2.091ValAsn: 2.091 ± 0.043
3.392ValPro: 3.392 ± 0.067
2.134ValGln: 2.134 ± 0.048
3.951ValArg: 3.951 ± 0.06
4.22ValSer: 4.22 ± 0.061
4.681ValThr: 4.681 ± 0.073
5.816ValVal: 5.816 ± 0.088
0.926ValTrp: 0.926 ± 0.029
1.495ValTyr: 1.495 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.327TrpAla: 1.327 ± 0.039
0.145TrpCys: 0.145 ± 0.011
0.823TrpAsp: 0.823 ± 0.029
0.617TrpGlu: 0.617 ± 0.026
0.56TrpPhe: 0.56 ± 0.027
1.057TrpGly: 1.057 ± 0.034
0.314TrpHis: 0.314 ± 0.015
0.772TrpIle: 0.772 ± 0.032
0.479TrpLys: 0.479 ± 0.021
1.6TrpLeu: 1.6 ± 0.044
0.459TrpMet: 0.459 ± 0.019
0.414TrpAsn: 0.414 ± 0.02
0.716TrpPro: 0.716 ± 0.03
0.552TrpGln: 0.552 ± 0.022
1.007TrpArg: 1.007 ± 0.031
0.805TrpSer: 0.805 ± 0.027
0.772TrpThr: 0.772 ± 0.028
1.028TrpVal: 1.028 ± 0.034
0.212TrpTrp: 0.212 ± 0.014
0.278TrpTyr: 0.278 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.439TyrAla: 2.439 ± 0.047
0.227TyrCys: 0.227 ± 0.016
1.659TyrAsp: 1.659 ± 0.042
1.322TyrGlu: 1.322 ± 0.041
0.906TyrPhe: 0.906 ± 0.03
2.091TyrGly: 2.091 ± 0.038
0.509TyrHis: 0.509 ± 0.024
0.953TyrIle: 0.953 ± 0.035
0.669TyrLys: 0.669 ± 0.025
2.389TyrLeu: 2.389 ± 0.056
0.539TyrMet: 0.539 ± 0.024
0.622TyrAsn: 0.622 ± 0.025
1.052TyrPro: 1.052 ± 0.033
0.743TyrGln: 0.743 ± 0.029
1.54TyrArg: 1.54 ± 0.038
1.182TyrSer: 1.182 ± 0.037
1.121TyrThr: 1.121 ± 0.032
1.616TyrVal: 1.616 ± 0.042
0.389TyrTrp: 0.389 ± 0.02
0.596TyrTyr: 0.596 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3295 proteins (1032283 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski