Amino acid dipepetide frequency for Pelagibaca abyssi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.582AlaAla: 17.582 ± 0.166
1.145AlaCys: 1.145 ± 0.03
6.779AlaAsp: 6.779 ± 0.087
9.478AlaGlu: 9.478 ± 0.106
4.282AlaPhe: 4.282 ± 0.051
11.254AlaGly: 11.254 ± 0.09
2.363AlaHis: 2.363 ± 0.04
5.422AlaIle: 5.422 ± 0.073
3.353AlaLys: 3.353 ± 0.06
15.206AlaLeu: 15.206 ± 0.149
3.849AlaMet: 3.849 ± 0.052
2.475AlaAsn: 2.475 ± 0.047
6.536AlaPro: 6.536 ± 0.092
4.701AlaGln: 4.701 ± 0.052
9.775AlaArg: 9.775 ± 0.106
5.635AlaSer: 5.635 ± 0.065
5.839AlaThr: 5.839 ± 0.067
8.436AlaVal: 8.436 ± 0.089
1.489AlaTrp: 1.489 ± 0.033
2.538AlaTyr: 2.538 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
1.11CysAla: 1.11 ± 0.028
0.131CysCys: 0.131 ± 0.011
0.602CysAsp: 0.602 ± 0.02
0.467CysGlu: 0.467 ± 0.018
0.327CysPhe: 0.327 ± 0.014
0.94CysGly: 0.94 ± 0.033
0.269CysHis: 0.269 ± 0.016
0.377CysIle: 0.377 ± 0.015
0.202CysLys: 0.202 ± 0.011
0.866CysLeu: 0.866 ± 0.025
0.178CysMet: 0.178 ± 0.01
0.2CysAsn: 0.2 ± 0.01
0.511CysPro: 0.511 ± 0.019
0.222CysGln: 0.222 ± 0.012
0.621CysArg: 0.621 ± 0.024
0.409CysSer: 0.409 ± 0.019
0.454CysThr: 0.454 ± 0.019
0.597CysVal: 0.597 ± 0.018
0.112CysTrp: 0.112 ± 0.008
0.189CysTyr: 0.189 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.389AspAla: 7.389 ± 0.08
0.503AspCys: 0.503 ± 0.018
3.316AspAsp: 3.316 ± 0.062
3.78AspGlu: 3.78 ± 0.065
2.199AspPhe: 2.199 ± 0.039
5.815AspGly: 5.815 ± 0.114
1.252AspHis: 1.252 ± 0.031
2.841AspIle: 2.841 ± 0.048
1.482AspLys: 1.482 ± 0.038
6.286AspLeu: 6.286 ± 0.073
1.667AspMet: 1.667 ± 0.036
1.121AspAsn: 1.121 ± 0.031
3.858AspPro: 3.858 ± 0.053
1.475AspGln: 1.475 ± 0.031
4.331AspArg: 4.331 ± 0.057
2.227AspSer: 2.227 ± 0.047
3.118AspThr: 3.118 ± 0.075
3.899AspVal: 3.899 ± 0.055
1.195AspTrp: 1.195 ± 0.034
1.563AspTyr: 1.563 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
9.236GluAla: 9.236 ± 0.095
0.356GluCys: 0.356 ± 0.016
3.564GluAsp: 3.564 ± 0.047
4.238GluGlu: 4.238 ± 0.067
1.742GluPhe: 1.742 ± 0.033
5.304GluGly: 5.304 ± 0.062
1.196GluHis: 1.196 ± 0.03
4.154GluIle: 4.154 ± 0.065
2.147GluLys: 2.147 ± 0.044
5.638GluLeu: 5.638 ± 0.062
1.971GluMet: 1.971 ± 0.035
1.651GluAsn: 1.651 ± 0.033
2.819GluPro: 2.819 ± 0.047
1.997GluGln: 1.997 ± 0.039
4.747GluArg: 4.747 ± 0.062
2.495GluSer: 2.495 ± 0.047
4.324GluThr: 4.324 ± 0.062
4.235GluVal: 4.235 ± 0.059
0.671GluTrp: 0.671 ± 0.021
1.102GluTyr: 1.102 ± 0.024
0.0GluXaa: 0.0 ± 0.0
Phe
4.487PheAla: 4.487 ± 0.052
0.392PheCys: 0.392 ± 0.017
2.733PheAsp: 2.733 ± 0.043
2.25PheGlu: 2.25 ± 0.042
1.379PhePhe: 1.379 ± 0.038
3.618PheGly: 3.618 ± 0.052
0.762PheHis: 0.762 ± 0.02
1.454PheIle: 1.454 ± 0.035
0.766PheLys: 0.766 ± 0.022
3.366PheLeu: 3.366 ± 0.057
0.83PheMet: 0.83 ± 0.025
0.926PheAsn: 0.926 ± 0.027
1.514PhePro: 1.514 ± 0.034
0.972PheGln: 0.972 ± 0.026
2.259PheArg: 2.259 ± 0.038
2.017PheSer: 2.017 ± 0.042
2.021PheThr: 2.021 ± 0.038
2.593PheVal: 2.593 ± 0.048
0.561PheTrp: 0.561 ± 0.018
0.872PheTyr: 0.872 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
10.625GlyAla: 10.625 ± 0.097
0.88GlyCys: 0.88 ± 0.025
4.92GlyAsp: 4.92 ± 0.084
5.259GlyGlu: 5.259 ± 0.067
3.68GlyPhe: 3.68 ± 0.053
7.918GlyGly: 7.918 ± 0.116
2.0GlyHis: 2.0 ± 0.041
4.472GlyIle: 4.472 ± 0.066
2.955GlyLys: 2.955 ± 0.05
9.56GlyLeu: 9.56 ± 0.091
2.67GlyMet: 2.67 ± 0.051
2.156GlyAsn: 2.156 ± 0.061
3.761GlyPro: 3.761 ± 0.051
3.074GlyGln: 3.074 ± 0.047
6.064GlyArg: 6.064 ± 0.069
4.433GlySer: 4.433 ± 0.07
4.774GlyThr: 4.774 ± 0.071
6.359GlyVal: 6.359 ± 0.067
1.574GlyTrp: 1.574 ± 0.032
2.36GlyTyr: 2.36 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.379HisAla: 2.379 ± 0.044
0.227HisCys: 0.227 ± 0.012
1.303HisAsp: 1.303 ± 0.029
1.106HisGlu: 1.106 ± 0.029
0.821HisPhe: 0.821 ± 0.026
1.996HisGly: 1.996 ± 0.044
0.548HisHis: 0.548 ± 0.022
0.898HisIle: 0.898 ± 0.025
0.503HisLys: 0.503 ± 0.02
2.073HisLeu: 2.073 ± 0.04
0.577HisMet: 0.577 ± 0.02
0.412HisAsn: 0.412 ± 0.017
1.365HisPro: 1.365 ± 0.033
0.492HisGln: 0.492 ± 0.019
1.399HisArg: 1.399 ± 0.033
0.869HisSer: 0.869 ± 0.026
0.753HisThr: 0.753 ± 0.019
1.461HisVal: 1.461 ± 0.03
0.36HisTrp: 0.36 ± 0.015
0.546HisTyr: 0.546 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
6.977IleAla: 6.977 ± 0.068
0.562IleCys: 0.562 ± 0.019
3.269IleAsp: 3.269 ± 0.049
3.393IleGlu: 3.393 ± 0.046
1.706IlePhe: 1.706 ± 0.036
4.604IleGly: 4.604 ± 0.057
0.902IleHis: 0.902 ± 0.025
1.779IleIle: 1.779 ± 0.04
1.128IleLys: 1.128 ± 0.028
4.545IleLeu: 4.545 ± 0.056
1.012IleMet: 1.012 ± 0.028
1.143IleAsn: 1.143 ± 0.031
2.235IlePro: 2.235 ± 0.041
1.062IleGln: 1.062 ± 0.025
3.283IleArg: 3.283 ± 0.049
2.811IleSer: 2.811 ± 0.053
2.645IleThr: 2.645 ± 0.051
3.788IleVal: 3.788 ± 0.062
0.695IleTrp: 0.695 ± 0.024
1.103IleTyr: 1.103 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.559LysAla: 3.559 ± 0.057
0.164LysCys: 0.164 ± 0.011
1.476LysAsp: 1.476 ± 0.035
1.475LysGlu: 1.475 ± 0.038
0.776LysPhe: 0.776 ± 0.025
2.415LysGly: 2.415 ± 0.039
0.538LysHis: 0.538 ± 0.019
1.474LysIle: 1.474 ± 0.034
1.044LysLys: 1.044 ± 0.032
2.802LysLeu: 2.802 ± 0.045
0.716LysMet: 0.716 ± 0.026
0.592LysAsn: 0.592 ± 0.019
1.705LysPro: 1.705 ± 0.036
0.841LysGln: 0.841 ± 0.023
2.258LysArg: 2.258 ± 0.044
1.535LysSer: 1.535 ± 0.034
1.788LysThr: 1.788 ± 0.034
2.027LysVal: 2.027 ± 0.045
0.342LysTrp: 0.342 ± 0.015
0.548LysTyr: 0.548 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
13.623LeuAla: 13.623 ± 0.114
0.999LeuCys: 0.999 ± 0.027
6.247LeuAsp: 6.247 ± 0.076
5.582LeuGlu: 5.582 ± 0.065
3.508LeuPhe: 3.508 ± 0.063
8.782LeuGly: 8.782 ± 0.086
1.905LeuHis: 1.905 ± 0.038
4.886LeuIle: 4.886 ± 0.066
2.941LeuLys: 2.941 ± 0.046
9.436LeuLeu: 9.436 ± 0.126
2.592LeuMet: 2.592 ± 0.049
2.451LeuAsn: 2.451 ± 0.043
5.965LeuPro: 5.965 ± 0.066
2.559LeuGln: 2.559 ± 0.047
7.759LeuArg: 7.759 ± 0.084
6.899LeuSer: 6.899 ± 0.069
6.023LeuThr: 6.023 ± 0.068
6.933LeuVal: 6.933 ± 0.082
1.38LeuTrp: 1.38 ± 0.033
2.022LeuTyr: 2.022 ± 0.037
0.0LeuXaa: 0.0 ± 0.0
Met
3.456MetAla: 3.456 ± 0.047
0.196MetCys: 0.196 ± 0.012
1.357MetAsp: 1.357 ± 0.035
1.411MetGlu: 1.411 ± 0.036
0.744MetPhe: 0.744 ± 0.025
2.24MetGly: 2.24 ± 0.045
0.457MetHis: 0.457 ± 0.018
1.477MetIle: 1.477 ± 0.033
0.965MetLys: 0.965 ± 0.028
2.66MetLeu: 2.66 ± 0.044
0.772MetMet: 0.772 ± 0.03
0.707MetAsn: 0.707 ± 0.02
1.573MetPro: 1.573 ± 0.035
0.976MetGln: 0.976 ± 0.029
2.002MetArg: 2.002 ± 0.037
1.871MetSer: 1.871 ± 0.034
1.998MetThr: 1.998 ± 0.036
1.821MetVal: 1.821 ± 0.034
0.217MetTrp: 0.217 ± 0.014
0.331MetTyr: 0.331 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.996AsnAla: 2.996 ± 0.045
0.222AsnCys: 0.222 ± 0.011
1.435AsnAsp: 1.435 ± 0.055
1.105AsnGlu: 1.105 ± 0.03
0.84AsnPhe: 0.84 ± 0.024
2.285AsnGly: 2.285 ± 0.05
0.427AsnHis: 0.427 ± 0.017
1.264AsnIle: 1.264 ± 0.032
0.539AsnLys: 0.539 ± 0.021
2.239AsnLeu: 2.239 ± 0.036
0.613AsnMet: 0.613 ± 0.019
0.551AsnAsn: 0.551 ± 0.021
1.67AsnPro: 1.67 ± 0.035
0.575AsnGln: 0.575 ± 0.021
1.641AsnArg: 1.641 ± 0.037
0.974AsnSer: 0.974 ± 0.026
1.225AsnThr: 1.225 ± 0.031
1.65AsnVal: 1.65 ± 0.032
0.387AsnTrp: 0.387 ± 0.016
0.585AsnTyr: 0.585 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
6.316ProAla: 6.316 ± 0.073
0.369ProCys: 0.369 ± 0.015
3.807ProAsp: 3.807 ± 0.059
4.961ProGlu: 4.961 ± 0.065
1.935ProPhe: 1.935 ± 0.037
5.007ProGly: 5.007 ± 0.064
1.057ProHis: 1.057 ± 0.029
2.003ProIle: 2.003 ± 0.04
1.481ProLys: 1.481 ± 0.03
4.895ProLeu: 4.895 ± 0.062
1.311ProMet: 1.311 ± 0.031
1.155ProAsn: 1.155 ± 0.027
2.476ProPro: 2.476 ± 0.05
1.777ProGln: 1.777 ± 0.034
3.061ProArg: 3.061 ± 0.046
2.494ProSer: 2.494 ± 0.04
2.179ProThr: 2.179 ± 0.043
4.479ProVal: 4.479 ± 0.06
0.661ProTrp: 0.661 ± 0.022
1.157ProTyr: 1.157 ± 0.027
0.0ProXaa: 0.0 ± 0.0
Gln
4.091GlnAla: 4.091 ± 0.061
0.201GlnCys: 0.201 ± 0.012
1.589GlnAsp: 1.589 ± 0.034
1.615GlnGlu: 1.615 ± 0.037
0.929GlnPhe: 0.929 ± 0.026
2.619GlnGly: 2.619 ± 0.046
0.544GlnHis: 0.544 ± 0.021
1.957GlnIle: 1.957 ± 0.04
0.943GlnLys: 0.943 ± 0.029
2.788GlnLeu: 2.788 ± 0.05
0.941GlnMet: 0.941 ± 0.024
0.795GlnAsn: 0.795 ± 0.023
1.611GlnPro: 1.611 ± 0.035
0.999GlnGln: 0.999 ± 0.027
2.22GlnArg: 2.22 ± 0.038
1.767GlnSer: 1.767 ± 0.035
1.692GlnThr: 1.692 ± 0.036
2.218GlnVal: 2.218 ± 0.043
0.379GlnTrp: 0.379 ± 0.018
0.563GlnTyr: 0.563 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
8.959ArgAla: 8.959 ± 0.104
0.547ArgCys: 0.547 ± 0.023
4.461ArgAsp: 4.461 ± 0.059
4.381ArgGlu: 4.381 ± 0.055
2.83ArgPhe: 2.83 ± 0.044
5.059ArgGly: 5.059 ± 0.058
1.684ArgHis: 1.684 ± 0.035
3.964ArgIle: 3.964 ± 0.052
2.186ArgLys: 2.186 ± 0.041
7.979ArgLeu: 7.979 ± 0.098
2.016ArgMet: 2.016 ± 0.035
1.776ArgAsn: 1.776 ± 0.036
3.518ArgPro: 3.518 ± 0.056
2.395ArgGln: 2.395 ± 0.048
5.647ArgArg: 5.647 ± 0.078
3.36ArgSer: 3.36 ± 0.048
2.671ArgThr: 2.671 ± 0.049
4.831ArgVal: 4.831 ± 0.066
0.981ArgTrp: 0.981 ± 0.027
1.672ArgTyr: 1.672 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
6.174SerAla: 6.174 ± 0.068
0.461SerCys: 0.461 ± 0.017
3.255SerAsp: 3.255 ± 0.061
3.145SerGlu: 3.145 ± 0.044
2.207SerPhe: 2.207 ± 0.04
5.609SerGly: 5.609 ± 0.079
1.066SerHis: 1.066 ± 0.03
2.179SerIle: 2.179 ± 0.04
1.315SerLys: 1.315 ± 0.033
4.811SerLeu: 4.811 ± 0.059
1.344SerMet: 1.344 ± 0.028
1.184SerAsn: 1.184 ± 0.031
2.542SerPro: 2.542 ± 0.039
1.534SerGln: 1.534 ± 0.034
3.265SerArg: 3.265 ± 0.052
2.543SerSer: 2.543 ± 0.043
2.386SerThr: 2.386 ± 0.041
3.823SerVal: 3.823 ± 0.052
0.783SerTrp: 0.783 ± 0.026
1.369SerTyr: 1.369 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.242ThrAla: 6.242 ± 0.081
0.446ThrCys: 0.446 ± 0.018
3.06ThrAsp: 3.06 ± 0.049
3.204ThrGlu: 3.204 ± 0.056
1.748ThrPhe: 1.748 ± 0.033
5.64ThrGly: 5.64 ± 0.069
1.023ThrHis: 1.023 ± 0.029
2.471ThrIle: 2.471 ± 0.048
1.162ThrLys: 1.162 ± 0.031
6.019ThrLeu: 6.019 ± 0.065
1.212ThrMet: 1.212 ± 0.028
1.112ThrAsn: 1.112 ± 0.026
3.458ThrPro: 3.458 ± 0.053
1.51ThrGln: 1.51 ± 0.035
3.734ThrArg: 3.734 ± 0.058
2.468ThrSer: 2.468 ± 0.05
2.481ThrThr: 2.481 ± 0.048
4.273ThrVal: 4.273 ± 0.064
0.635ThrTrp: 0.635 ± 0.023
1.181ThrTyr: 1.181 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
8.966ValAla: 8.966 ± 0.088
0.617ValCys: 0.617 ± 0.023
3.786ValAsp: 3.786 ± 0.059
4.583ValGlu: 4.583 ± 0.056
2.782ValPhe: 2.782 ± 0.046
5.18ValGly: 5.18 ± 0.065
1.253ValHis: 1.253 ± 0.031
3.982ValIle: 3.982 ± 0.056
1.979ValLys: 1.979 ± 0.045
7.489ValLeu: 7.489 ± 0.075
2.071ValMet: 2.071 ± 0.043
1.882ValAsn: 1.882 ± 0.039
3.625ValPro: 3.625 ± 0.056
2.038ValGln: 2.038 ± 0.037
4.03ValArg: 4.03 ± 0.057
4.317ValSer: 4.317 ± 0.055
4.848ValThr: 4.848 ± 0.068
5.369ValVal: 5.369 ± 0.067
0.943ValTrp: 0.943 ± 0.025
1.453ValTyr: 1.453 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.367TrpAla: 1.367 ± 0.03
0.137TrpCys: 0.137 ± 0.01
0.742TrpAsp: 0.742 ± 0.027
0.764TrpGlu: 0.764 ± 0.024
0.51TrpPhe: 0.51 ± 0.02
1.039TrpGly: 1.039 ± 0.03
0.348TrpHis: 0.348 ± 0.015
0.695TrpIle: 0.695 ± 0.021
0.468TrpLys: 0.468 ± 0.019
1.64TrpLeu: 1.64 ± 0.034
0.4TrpMet: 0.4 ± 0.016
0.396TrpAsn: 0.396 ± 0.017
0.718TrpPro: 0.718 ± 0.024
0.631TrpGln: 0.631 ± 0.021
1.161TrpArg: 1.161 ± 0.026
0.78TrpSer: 0.78 ± 0.022
0.761TrpThr: 0.761 ± 0.023
0.861TrpVal: 0.861 ± 0.024
0.199TrpTrp: 0.199 ± 0.011
0.285TrpTyr: 0.285 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.605TyrAla: 2.605 ± 0.043
0.232TyrCys: 0.232 ± 0.012
1.617TyrAsp: 1.617 ± 0.034
1.391TyrGlu: 1.391 ± 0.031
0.826TyrPhe: 0.826 ± 0.025
2.168TyrGly: 2.168 ± 0.034
0.492TyrHis: 0.492 ± 0.018
0.891TyrIle: 0.891 ± 0.026
0.518TyrLys: 0.518 ± 0.018
2.191TyrLeu: 2.191 ± 0.043
0.486TyrMet: 0.486 ± 0.018
0.537TyrAsn: 0.537 ± 0.02
1.125TyrPro: 1.125 ± 0.028
0.632TyrGln: 0.632 ± 0.02
1.659TyrArg: 1.659 ± 0.036
1.092TyrSer: 1.092 ± 0.026
1.099TyrThr: 1.099 ± 0.034
1.538TyrVal: 1.538 ± 0.036
0.339TyrTrp: 0.339 ± 0.014
0.58TyrTyr: 0.58 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5048 proteins (1524221 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski