Amino acid dipepetide frequency for Poriferisphaera corsica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.582AlaAla: 8.582 ± 0.13
0.956AlaCys: 0.956 ± 0.033
5.022AlaAsp: 5.022 ± 0.084
5.263AlaGlu: 5.263 ± 0.099
3.086AlaPhe: 3.086 ± 0.056
6.58AlaGly: 6.58 ± 0.109
1.612AlaHis: 1.612 ± 0.035
6.006AlaIle: 6.006 ± 0.084
4.654AlaLys: 4.654 ± 0.073
7.411AlaLeu: 7.411 ± 0.095
2.806AlaMet: 2.806 ± 0.051
3.392AlaAsn: 3.392 ± 0.059
2.699AlaPro: 2.699 ± 0.059
2.99AlaGln: 2.99 ± 0.061
3.902AlaArg: 3.902 ± 0.068
5.317AlaSer: 5.317 ± 0.075
4.549AlaThr: 4.549 ± 0.079
5.954AlaVal: 5.954 ± 0.087
1.133AlaTrp: 1.133 ± 0.035
2.635AlaTyr: 2.635 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.03
0.212CysCys: 0.212 ± 0.014
0.7CysAsp: 0.7 ± 0.028
0.778CysGlu: 0.778 ± 0.026
0.404CysPhe: 0.404 ± 0.019
1.121CysGly: 1.121 ± 0.035
0.268CysHis: 0.268 ± 0.015
0.621CysIle: 0.621 ± 0.024
0.514CysLys: 0.514 ± 0.022
0.93CysLeu: 0.93 ± 0.029
0.276CysMet: 0.276 ± 0.017
0.384CysAsn: 0.384 ± 0.017
0.566CysPro: 0.566 ± 0.025
0.269CysGln: 0.269 ± 0.016
0.534CysArg: 0.534 ± 0.02
0.61CysSer: 0.61 ± 0.03
0.497CysThr: 0.497 ± 0.022
0.851CysVal: 0.851 ± 0.03
0.139CysTrp: 0.139 ± 0.011
0.323CysTyr: 0.323 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
5.203AspAla: 5.203 ± 0.076
0.566AspCys: 0.566 ± 0.021
3.756AspAsp: 3.756 ± 0.065
4.57AspGlu: 4.57 ± 0.088
2.351AspPhe: 2.351 ± 0.044
4.894AspGly: 4.894 ± 0.092
1.435AspHis: 1.435 ± 0.039
3.604AspIle: 3.604 ± 0.054
2.692AspLys: 2.692 ± 0.054
5.706AspLeu: 5.706 ± 0.082
1.519AspMet: 1.519 ± 0.037
2.252AspAsn: 2.252 ± 0.048
2.747AspPro: 2.747 ± 0.05
2.614AspGln: 2.614 ± 0.05
3.17AspArg: 3.17 ± 0.059
3.145AspSer: 3.145 ± 0.059
2.973AspThr: 2.973 ± 0.05
4.508AspVal: 4.508 ± 0.066
1.086AspTrp: 1.086 ± 0.03
1.875AspTyr: 1.875 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
5.378GluAla: 5.378 ± 0.094
0.516GluCys: 0.516 ± 0.022
3.357GluAsp: 3.357 ± 0.06
3.721GluGlu: 3.721 ± 0.08
1.927GluPhe: 1.927 ± 0.04
4.338GluGly: 4.338 ± 0.073
1.407GluHis: 1.407 ± 0.033
3.989GluIle: 3.989 ± 0.072
3.619GluLys: 3.619 ± 0.067
5.955GluLeu: 5.955 ± 0.07
2.091GluMet: 2.091 ± 0.048
2.474GluAsn: 2.474 ± 0.045
2.311GluPro: 2.311 ± 0.044
3.192GluGln: 3.192 ± 0.064
3.523GluArg: 3.523 ± 0.069
3.556GluSer: 3.556 ± 0.053
3.033GluThr: 3.033 ± 0.051
4.611GluVal: 4.611 ± 0.096
0.728GluTrp: 0.728 ± 0.025
1.806GluTyr: 1.806 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.219PheAla: 3.219 ± 0.062
0.455PheCys: 0.455 ± 0.02
2.766PheAsp: 2.766 ± 0.054
2.515PheGlu: 2.515 ± 0.052
1.512PhePhe: 1.512 ± 0.04
3.241PheGly: 3.241 ± 0.07
0.708PheHis: 0.708 ± 0.026
2.19PheIle: 2.19 ± 0.044
1.725PheLys: 1.725 ± 0.036
3.094PheLeu: 3.094 ± 0.065
0.905PheMet: 0.905 ± 0.027
1.75PheAsn: 1.75 ± 0.041
1.295PhePro: 1.295 ± 0.034
1.051PheGln: 1.051 ± 0.033
1.567PheArg: 1.567 ± 0.035
2.399PheSer: 2.399 ± 0.047
2.378PheThr: 2.378 ± 0.049
2.696PheVal: 2.696 ± 0.05
0.573PheTrp: 0.573 ± 0.025
1.216PheTyr: 1.216 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
5.429GlyAla: 5.429 ± 0.088
0.999GlyCys: 0.999 ± 0.03
4.62GlyAsp: 4.62 ± 0.089
5.089GlyGlu: 5.089 ± 0.096
3.185GlyPhe: 3.185 ± 0.056
6.532GlyGly: 6.532 ± 0.121
1.778GlyHis: 1.778 ± 0.044
4.904GlyIle: 4.904 ± 0.069
4.141GlyLys: 4.141 ± 0.072
7.153GlyLeu: 7.153 ± 0.091
2.697GlyMet: 2.697 ± 0.051
2.997GlyAsn: 2.997 ± 0.07
2.153GlyPro: 2.153 ± 0.045
2.814GlyGln: 2.814 ± 0.052
3.85GlyArg: 3.85 ± 0.065
4.726GlySer: 4.726 ± 0.077
3.835GlyThr: 3.835 ± 0.087
5.782GlyVal: 5.782 ± 0.101
1.218GlyTrp: 1.218 ± 0.033
2.683GlyTyr: 2.683 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
1.928HisAla: 1.928 ± 0.042
0.276HisCys: 0.276 ± 0.016
1.422HisAsp: 1.422 ± 0.039
1.46HisGlu: 1.46 ± 0.037
0.926HisPhe: 0.926 ± 0.028
1.663HisGly: 1.663 ± 0.038
0.767HisHis: 0.767 ± 0.028
1.417HisIle: 1.417 ± 0.035
0.954HisLys: 0.954 ± 0.03
2.133HisLeu: 2.133 ± 0.049
0.522HisMet: 0.522 ± 0.022
0.93HisAsn: 0.93 ± 0.028
1.191HisPro: 1.191 ± 0.034
0.883HisGln: 0.883 ± 0.028
1.2HisArg: 1.2 ± 0.034
1.062HisSer: 1.062 ± 0.034
1.286HisThr: 1.286 ± 0.033
1.525HisVal: 1.525 ± 0.036
0.368HisTrp: 0.368 ± 0.018
0.734HisTyr: 0.734 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.524IleAla: 6.524 ± 0.085
0.784IleCys: 0.784 ± 0.025
4.698IleAsp: 4.698 ± 0.065
4.551IleGlu: 4.551 ± 0.068
2.292IlePhe: 2.292 ± 0.055
5.132IleGly: 5.132 ± 0.081
1.416IleHis: 1.416 ± 0.037
3.877IleIle: 3.877 ± 0.066
3.189IleLys: 3.189 ± 0.058
5.373IleLeu: 5.373 ± 0.087
1.259IleMet: 1.259 ± 0.029
3.132IleAsn: 3.132 ± 0.065
2.814IlePro: 2.814 ± 0.05
2.213IleGln: 2.213 ± 0.04
3.089IleArg: 3.089 ± 0.054
3.965IleSer: 3.965 ± 0.066
4.195IleThr: 4.195 ± 0.08
4.332IleVal: 4.332 ± 0.066
0.765IleTrp: 0.765 ± 0.027
1.872IleTyr: 1.872 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.212LysAla: 4.212 ± 0.071
0.443LysCys: 0.443 ± 0.02
2.802LysAsp: 2.802 ± 0.052
2.666LysGlu: 2.666 ± 0.061
1.481LysPhe: 1.481 ± 0.037
3.057LysGly: 3.057 ± 0.06
1.404LysHis: 1.404 ± 0.036
3.145LysIle: 3.145 ± 0.056
3.113LysLys: 3.113 ± 0.077
5.035LysLeu: 5.035 ± 0.073
1.478LysMet: 1.478 ± 0.032
2.139LysAsn: 2.139 ± 0.042
2.581LysPro: 2.581 ± 0.05
2.75LysGln: 2.75 ± 0.06
3.308LysArg: 3.308 ± 0.061
2.976LysSer: 2.976 ± 0.059
2.874LysThr: 2.874 ± 0.053
3.133LysVal: 3.133 ± 0.059
0.64LysTrp: 0.64 ± 0.026
1.609LysTyr: 1.609 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
8.453LeuAla: 8.453 ± 0.098
1.097LeuCys: 1.097 ± 0.028
5.597LeuAsp: 5.597 ± 0.074
5.139LeuGlu: 5.139 ± 0.085
3.379LeuPhe: 3.379 ± 0.062
7.226LeuGly: 7.226 ± 0.086
1.877LeuHis: 1.877 ± 0.041
6.401LeuIle: 6.401 ± 0.09
4.693LeuLys: 4.693 ± 0.061
9.09LeuLeu: 9.09 ± 0.109
2.461LeuMet: 2.461 ± 0.047
4.178LeuAsn: 4.178 ± 0.068
4.607LeuPro: 4.607 ± 0.068
3.203LeuGln: 3.203 ± 0.066
4.906LeuArg: 4.906 ± 0.071
6.151LeuSer: 6.151 ± 0.075
5.695LeuThr: 5.695 ± 0.08
6.003LeuVal: 6.003 ± 0.075
1.121LeuTrp: 1.121 ± 0.034
2.467LeuTyr: 2.467 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.167MetAla: 2.167 ± 0.044
0.315MetCys: 0.315 ± 0.018
1.415MetAsp: 1.415 ± 0.04
1.162MetGlu: 1.162 ± 0.029
0.987MetPhe: 0.987 ± 0.033
2.292MetGly: 2.292 ± 0.052
0.601MetHis: 0.601 ± 0.023
1.909MetIle: 1.909 ± 0.038
1.594MetLys: 1.594 ± 0.037
2.857MetLeu: 2.857 ± 0.053
0.988MetMet: 0.988 ± 0.027
1.329MetAsn: 1.329 ± 0.032
1.37MetPro: 1.37 ± 0.031
1.122MetGln: 1.122 ± 0.034
1.781MetArg: 1.781 ± 0.037
1.897MetSer: 1.897 ± 0.037
1.74MetThr: 1.74 ± 0.039
1.796MetVal: 1.796 ± 0.04
0.347MetTrp: 0.347 ± 0.019
0.671MetTyr: 0.671 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.609AsnAla: 3.609 ± 0.066
0.354AsnCys: 0.354 ± 0.02
2.712AsnAsp: 2.712 ± 0.056
2.409AsnGlu: 2.409 ± 0.049
1.499AsnPhe: 1.499 ± 0.035
3.142AsnGly: 3.142 ± 0.077
1.065AsnHis: 1.065 ± 0.028
2.785AsnIle: 2.785 ± 0.063
2.016AsnLys: 2.016 ± 0.048
4.001AsnLeu: 4.001 ± 0.07
1.016AsnMet: 1.016 ± 0.03
2.188AsnAsn: 2.188 ± 0.062
2.393AsnPro: 2.393 ± 0.055
2.066AsnGln: 2.066 ± 0.048
2.166AsnArg: 2.166 ± 0.042
2.287AsnSer: 2.287 ± 0.055
2.615AsnThr: 2.615 ± 0.064
2.634AsnVal: 2.634 ± 0.052
0.589AsnTrp: 0.589 ± 0.024
1.285AsnTyr: 1.285 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
3.502ProAla: 3.502 ± 0.063
0.357ProCys: 0.357 ± 0.018
2.748ProAsp: 2.748 ± 0.06
3.148ProGlu: 3.148 ± 0.059
1.553ProPhe: 1.553 ± 0.034
2.854ProGly: 2.854 ± 0.052
1.006ProHis: 1.006 ± 0.033
2.991ProIle: 2.991 ± 0.056
2.044ProLys: 2.044 ± 0.04
3.561ProLeu: 3.561 ± 0.062
1.103ProMet: 1.103 ± 0.038
2.11ProAsn: 2.11 ± 0.045
1.668ProPro: 1.668 ± 0.049
1.674ProGln: 1.674 ± 0.044
1.858ProArg: 1.858 ± 0.046
2.728ProSer: 2.728 ± 0.054
2.764ProThr: 2.764 ± 0.058
2.822ProVal: 2.822 ± 0.054
0.578ProTrp: 0.578 ± 0.027
1.24ProTyr: 1.24 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.359GlnAla: 3.359 ± 0.062
0.347GlnCys: 0.347 ± 0.017
1.807GlnAsp: 1.807 ± 0.043
1.816GlnGlu: 1.816 ± 0.039
1.441GlnPhe: 1.441 ± 0.037
2.346GlnGly: 2.346 ± 0.048
0.957GlnHis: 0.957 ± 0.03
2.998GlnIle: 2.998 ± 0.06
1.948GlnLys: 1.948 ± 0.041
4.099GlnLeu: 4.099 ± 0.069
1.226GlnMet: 1.226 ± 0.032
1.736GlnAsn: 1.736 ± 0.045
1.821GlnPro: 1.821 ± 0.052
2.129GlnGln: 2.129 ± 0.059
2.106GlnArg: 2.106 ± 0.052
2.73GlnSer: 2.73 ± 0.059
2.531GlnThr: 2.531 ± 0.05
2.41GlnVal: 2.41 ± 0.042
0.523GlnTrp: 0.523 ± 0.023
1.325GlnTyr: 1.325 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.624ArgAla: 3.624 ± 0.061
0.563ArgCys: 0.563 ± 0.025
3.038ArgAsp: 3.038 ± 0.053
3.559ArgGlu: 3.559 ± 0.067
2.219ArgPhe: 2.219 ± 0.05
3.556ArgGly: 3.556 ± 0.069
1.149ArgHis: 1.149 ± 0.03
3.553ArgIle: 3.553 ± 0.06
2.917ArgLys: 2.917 ± 0.059
5.131ArgLeu: 5.131 ± 0.074
1.611ArgMet: 1.611 ± 0.035
1.954ArgAsn: 1.954 ± 0.038
1.877ArgPro: 1.877 ± 0.047
2.064ArgGln: 2.064 ± 0.044
3.191ArgArg: 3.191 ± 0.069
2.875ArgSer: 2.875 ± 0.05
2.485ArgThr: 2.485 ± 0.044
3.741ArgVal: 3.741 ± 0.076
0.828ArgTrp: 0.828 ± 0.028
1.879ArgTyr: 1.879 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.563SerAla: 4.563 ± 0.07
0.61SerCys: 0.61 ± 0.023
3.713SerAsp: 3.713 ± 0.059
3.665SerGlu: 3.665 ± 0.052
2.454SerPhe: 2.454 ± 0.047
5.063SerGly: 5.063 ± 0.097
1.392SerHis: 1.392 ± 0.034
3.977SerIle: 3.977 ± 0.066
2.902SerLys: 2.902 ± 0.05
5.652SerLeu: 5.652 ± 0.072
1.639SerMet: 1.639 ± 0.038
2.848SerAsn: 2.848 ± 0.062
2.657SerPro: 2.657 ± 0.05
2.386SerGln: 2.386 ± 0.042
3.019SerArg: 3.019 ± 0.058
4.026SerSer: 4.026 ± 0.08
3.42SerThr: 3.42 ± 0.061
3.91SerVal: 3.91 ± 0.063
0.827SerTrp: 0.827 ± 0.026
1.819SerTyr: 1.819 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.776ThrAla: 4.776 ± 0.066
0.547ThrCys: 0.547 ± 0.026
3.28ThrAsp: 3.28 ± 0.054
2.891ThrGlu: 2.891 ± 0.051
2.131ThrPhe: 2.131 ± 0.052
4.654ThrGly: 4.654 ± 0.085
1.401ThrHis: 1.401 ± 0.036
4.045ThrIle: 4.045 ± 0.08
2.613ThrLys: 2.613 ± 0.052
5.569ThrLeu: 5.569 ± 0.089
1.259ThrMet: 1.259 ± 0.032
2.425ThrAsn: 2.425 ± 0.056
3.055ThrPro: 3.055 ± 0.062
2.234ThrGln: 2.234 ± 0.051
2.41ThrArg: 2.41 ± 0.046
3.533ThrSer: 3.533 ± 0.068
3.62ThrThr: 3.62 ± 0.073
3.731ThrVal: 3.731 ± 0.061
0.716ThrTrp: 0.716 ± 0.028
1.727ThrTyr: 1.727 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
5.52ValAla: 5.52 ± 0.085
0.891ValCys: 0.891 ± 0.029
4.328ValAsp: 4.328 ± 0.068
4.35ValGlu: 4.35 ± 0.072
2.59ValPhe: 2.59 ± 0.057
5.514ValGly: 5.514 ± 0.099
1.301ValHis: 1.301 ± 0.032
4.605ValIle: 4.605 ± 0.066
3.579ValLys: 3.579 ± 0.06
6.501ValLeu: 6.501 ± 0.094
2.264ValMet: 2.264 ± 0.057
2.711ValAsn: 2.711 ± 0.049
2.635ValPro: 2.635 ± 0.049
2.034ValGln: 2.034 ± 0.045
3.6ValArg: 3.6 ± 0.07
4.138ValSer: 4.138 ± 0.062
3.724ValThr: 3.724 ± 0.059
5.705ValVal: 5.705 ± 0.112
1.011ValTrp: 1.011 ± 0.032
2.049ValTyr: 2.049 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
1.058TrpAla: 1.058 ± 0.03
0.187TrpCys: 0.187 ± 0.015
0.853TrpAsp: 0.853 ± 0.03
0.747TrpGlu: 0.747 ± 0.025
0.598TrpPhe: 0.598 ± 0.022
1.115TrpGly: 1.115 ± 0.034
0.368TrpHis: 0.368 ± 0.02
0.753TrpIle: 0.753 ± 0.027
0.588TrpLys: 0.588 ± 0.023
1.531TrpLeu: 1.531 ± 0.041
0.459TrpMet: 0.459 ± 0.019
0.539TrpAsn: 0.539 ± 0.021
0.58TrpPro: 0.58 ± 0.022
0.686TrpGln: 0.686 ± 0.024
0.796TrpArg: 0.796 ± 0.027
0.78TrpSer: 0.78 ± 0.028
0.651TrpThr: 0.651 ± 0.026
0.955TrpVal: 0.955 ± 0.035
0.297TrpTrp: 0.297 ± 0.019
0.46TrpTyr: 0.46 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.6TyrAla: 2.6 ± 0.053
0.362TyrCys: 0.362 ± 0.019
2.031TyrAsp: 2.031 ± 0.053
2.034TyrGlu: 2.034 ± 0.044
1.24TyrPhe: 1.24 ± 0.031
2.267TyrGly: 2.267 ± 0.054
0.736TyrHis: 0.736 ± 0.028
1.699TyrIle: 1.699 ± 0.035
1.35TyrLys: 1.35 ± 0.036
2.961TyrLeu: 2.961 ± 0.058
0.757TyrMet: 0.757 ± 0.026
1.362TyrAsn: 1.362 ± 0.036
1.289TyrPro: 1.289 ± 0.034
1.285TyrGln: 1.285 ± 0.035
1.839TyrArg: 1.839 ± 0.038
1.642TyrSer: 1.642 ± 0.04
1.753TyrThr: 1.753 ± 0.044
1.953TyrVal: 1.953 ± 0.044
0.515TyrTrp: 0.515 ± 0.02
1.041TyrTyr: 1.041 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3659 proteins (1197420 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski