Amino acid dipepetide frequency for Fistulifera solaris (Oleaginous diatom)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.174AlaAla: 8.174 ± 0.037
1.339AlaCys: 1.339 ± 0.011
3.983AlaAsp: 3.983 ± 0.021
4.881AlaGlu: 4.881 ± 0.03
3.307AlaPhe: 3.307 ± 0.019
4.486AlaGly: 4.486 ± 0.028
1.523AlaHis: 1.523 ± 0.011
3.999AlaIle: 3.999 ± 0.023
3.996AlaLys: 3.996 ± 0.022
8.201AlaLeu: 8.201 ± 0.04
2.132AlaMet: 2.132 ± 0.015
2.809AlaAsn: 2.809 ± 0.017
3.932AlaPro: 3.932 ± 0.035
3.04AlaGln: 3.04 ± 0.019
4.165AlaArg: 4.165 ± 0.025
6.695AlaSer: 6.695 ± 0.029
5.039AlaThr: 5.039 ± 0.023
5.602AlaVal: 5.602 ± 0.026
1.025AlaTrp: 1.025 ± 0.011
1.985AlaTyr: 1.985 ± 0.016
0.0AlaXaa: 0.0 ± 0.0
Cys
1.11CysAla: 1.11 ± 0.012
0.458CysCys: 0.458 ± 0.009
0.954CysAsp: 0.954 ± 0.012
0.863CysGlu: 0.863 ± 0.009
0.744CysPhe: 0.744 ± 0.01
1.124CysGly: 1.124 ± 0.018
0.452CysHis: 0.452 ± 0.007
0.836CysIle: 0.836 ± 0.011
0.633CysLys: 0.633 ± 0.009
1.667CysLeu: 1.667 ± 0.014
0.351CysMet: 0.351 ± 0.006
0.585CysAsn: 0.585 ± 0.009
0.807CysPro: 0.807 ± 0.011
0.719CysGln: 0.719 ± 0.009
1.009CysArg: 1.009 ± 0.011
1.271CysSer: 1.271 ± 0.014
0.862CysThr: 0.862 ± 0.01
1.079CysVal: 1.079 ± 0.012
0.249CysTrp: 0.249 ± 0.004
0.451CysTyr: 0.451 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
4.853AspAla: 4.853 ± 0.022
0.93AspCys: 0.93 ± 0.013
5.741AspAsp: 5.741 ± 0.045
5.243AspGlu: 5.243 ± 0.032
2.377AspPhe: 2.377 ± 0.017
3.926AspGly: 3.926 ± 0.03
1.308AspHis: 1.308 ± 0.011
2.855AspIle: 2.855 ± 0.018
2.481AspLys: 2.481 ± 0.019
5.196AspLeu: 5.196 ± 0.026
1.394AspMet: 1.394 ± 0.013
2.146AspAsn: 2.146 ± 0.019
3.12AspPro: 3.12 ± 0.019
1.99AspGln: 1.99 ± 0.016
2.916AspArg: 2.916 ± 0.019
4.389AspSer: 4.389 ± 0.025
3.052AspThr: 3.052 ± 0.017
3.921AspVal: 3.921 ± 0.021
0.85AspTrp: 0.85 ± 0.012
1.668AspTyr: 1.668 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
5.325GluAla: 5.325 ± 0.028
0.92GluCys: 0.92 ± 0.011
4.481GluAsp: 4.481 ± 0.025
6.519GluGlu: 6.519 ± 0.04
2.084GluPhe: 2.084 ± 0.014
3.545GluGly: 3.545 ± 0.021
1.481GluHis: 1.481 ± 0.012
3.222GluIle: 3.222 ± 0.019
4.166GluLys: 4.166 ± 0.03
5.836GluLeu: 5.836 ± 0.033
1.761GluMet: 1.761 ± 0.013
2.848GluAsn: 2.848 ± 0.017
2.494GluPro: 2.494 ± 0.021
3.05GluGln: 3.05 ± 0.02
3.959GluArg: 3.959 ± 0.023
5.092GluSer: 5.092 ± 0.027
3.979GluThr: 3.979 ± 0.02
3.538GluVal: 3.538 ± 0.019
0.919GluTrp: 0.919 ± 0.01
1.774GluTyr: 1.774 ± 0.014
0.0GluXaa: 0.0 ± 0.0
Phe
3.042PheAla: 3.042 ± 0.018
0.82PheCys: 0.82 ± 0.009
2.469PheAsp: 2.469 ± 0.016
2.284PheGlu: 2.284 ± 0.018
1.742PhePhe: 1.742 ± 0.017
2.72PheGly: 2.72 ± 0.023
1.025PheHis: 1.025 ± 0.011
1.557PheIle: 1.557 ± 0.014
1.246PheLys: 1.246 ± 0.011
4.092PheLeu: 4.092 ± 0.021
0.83PheMet: 0.83 ± 0.009
1.222PheAsn: 1.222 ± 0.012
1.859PhePro: 1.859 ± 0.014
1.692PheGln: 1.692 ± 0.013
2.166PheArg: 2.166 ± 0.016
3.087PheSer: 3.087 ± 0.02
1.984PheThr: 1.984 ± 0.015
2.895PheVal: 2.895 ± 0.018
0.562PheTrp: 0.562 ± 0.008
1.158PheTyr: 1.158 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
4.288GlyAla: 4.288 ± 0.025
1.06GlyCys: 1.06 ± 0.015
3.615GlyAsp: 3.615 ± 0.026
3.352GlyGlu: 3.352 ± 0.022
2.433GlyPhe: 2.433 ± 0.018
4.672GlyGly: 4.672 ± 0.038
1.387GlyHis: 1.387 ± 0.015
3.16GlyIle: 3.16 ± 0.022
3.42GlyLys: 3.42 ± 0.024
5.127GlyLeu: 5.127 ± 0.028
1.53GlyMet: 1.53 ± 0.015
2.536GlyAsn: 2.536 ± 0.018
2.325GlyPro: 2.325 ± 0.019
2.041GlyGln: 2.041 ± 0.015
3.422GlyArg: 3.422 ± 0.025
4.916GlySer: 4.916 ± 0.029
3.58GlyThr: 3.58 ± 0.024
3.923GlyVal: 3.923 ± 0.022
0.887GlyTrp: 0.887 ± 0.011
1.747GlyTyr: 1.747 ± 0.017
0.0GlyXaa: 0.0 ± 0.0
His
1.863HisAla: 1.863 ± 0.014
0.449HisCys: 0.449 ± 0.007
1.423HisAsp: 1.423 ± 0.014
1.452HisGlu: 1.452 ± 0.012
1.003HisPhe: 1.003 ± 0.01
1.511HisGly: 1.511 ± 0.014
0.895HisHis: 0.895 ± 0.012
1.056HisIle: 1.056 ± 0.01
0.978HisLys: 0.978 ± 0.009
2.266HisLeu: 2.266 ± 0.017
0.507HisMet: 0.507 ± 0.007
0.898HisAsn: 0.898 ± 0.011
1.484HisPro: 1.484 ± 0.014
1.04HisGln: 1.04 ± 0.011
1.514HisArg: 1.514 ± 0.015
1.834HisSer: 1.834 ± 0.015
1.133HisThr: 1.133 ± 0.012
1.637HisVal: 1.637 ± 0.015
0.365HisTrp: 0.365 ± 0.006
0.751HisTyr: 0.751 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.986IleAla: 3.986 ± 0.022
0.827IleCys: 0.827 ± 0.009
3.027IleAsp: 3.027 ± 0.018
3.048IleGlu: 3.048 ± 0.017
1.76IlePhe: 1.76 ± 0.016
2.741IleGly: 2.741 ± 0.016
1.253IleHis: 1.253 ± 0.013
2.071IleIle: 2.071 ± 0.014
1.927IleLys: 1.927 ± 0.015
4.807IleLeu: 4.807 ± 0.026
1.055IleMet: 1.055 ± 0.011
1.548IleAsn: 1.548 ± 0.012
2.863IlePro: 2.863 ± 0.021
2.27IleGln: 2.27 ± 0.017
2.843IleArg: 2.843 ± 0.016
3.501IleSer: 3.501 ± 0.021
2.466IleThr: 2.466 ± 0.018
3.457IleVal: 3.457 ± 0.019
0.559IleTrp: 0.559 ± 0.008
1.202IleTyr: 1.202 ± 0.013
0.0IleXaa: 0.0 ± 0.0
Lys
4.174LysAla: 4.174 ± 0.024
0.644LysCys: 0.644 ± 0.009
2.992LysAsp: 2.992 ± 0.017
4.058LysGlu: 4.058 ± 0.025
1.417LysPhe: 1.417 ± 0.012
2.854LysGly: 2.854 ± 0.022
1.177LysHis: 1.177 ± 0.011
2.328LysIle: 2.328 ± 0.016
3.978LysLys: 3.978 ± 0.035
4.295LysLeu: 4.295 ± 0.026
1.236LysMet: 1.236 ± 0.012
2.049LysAsn: 2.049 ± 0.015
2.268LysPro: 2.268 ± 0.017
2.38LysGln: 2.38 ± 0.018
3.507LysArg: 3.507 ± 0.025
3.8LysSer: 3.8 ± 0.023
2.916LysThr: 2.916 ± 0.018
3.047LysVal: 3.047 ± 0.019
0.617LysTrp: 0.617 ± 0.007
1.339LysTyr: 1.339 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
7.515LeuAla: 7.515 ± 0.03
1.659LeuCys: 1.659 ± 0.015
5.119LeuAsp: 5.119 ± 0.029
6.243LeuGlu: 6.243 ± 0.034
3.788LeuPhe: 3.788 ± 0.022
5.204LeuGly: 5.204 ± 0.032
2.444LeuHis: 2.444 ± 0.019
3.918LeuIle: 3.918 ± 0.022
4.449LeuLys: 4.449 ± 0.028
10.199LeuLeu: 10.199 ± 0.049
2.103LeuMet: 2.103 ± 0.014
3.267LeuAsn: 3.267 ± 0.019
5.099LeuPro: 5.099 ± 0.028
5.288LeuGln: 5.288 ± 0.03
5.69LeuArg: 5.69 ± 0.033
7.938LeuSer: 7.938 ± 0.041
4.98LeuThr: 4.98 ± 0.026
5.919LeuVal: 5.919 ± 0.03
1.236LeuTrp: 1.236 ± 0.013
2.45LeuTyr: 2.45 ± 0.017
0.0LeuXaa: 0.0 ± 0.0
Met
2.008MetAla: 2.008 ± 0.015
0.298MetCys: 0.298 ± 0.006
1.476MetAsp: 1.476 ± 0.012
1.786MetGlu: 1.786 ± 0.014
0.807MetPhe: 0.807 ± 0.011
1.457MetGly: 1.457 ± 0.014
0.534MetHis: 0.534 ± 0.008
1.167MetIle: 1.167 ± 0.013
1.525MetLys: 1.525 ± 0.013
2.032MetLeu: 2.032 ± 0.016
0.758MetMet: 0.758 ± 0.01
1.067MetAsn: 1.067 ± 0.011
1.093MetPro: 1.093 ± 0.012
1.087MetGln: 1.087 ± 0.011
1.257MetArg: 1.257 ± 0.012
1.871MetSer: 1.871 ± 0.012
1.583MetThr: 1.583 ± 0.014
1.427MetVal: 1.427 ± 0.012
0.24MetTrp: 0.24 ± 0.005
0.562MetTyr: 0.562 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
3.196AsnAla: 3.196 ± 0.017
0.579AsnCys: 0.579 ± 0.008
2.678AsnAsp: 2.678 ± 0.02
2.758AsnGlu: 2.758 ± 0.016
1.329AsnPhe: 1.329 ± 0.012
2.599AsnGly: 2.599 ± 0.017
0.979AsnHis: 0.979 ± 0.011
1.785AsnIle: 1.785 ± 0.012
1.903AsnLys: 1.903 ± 0.016
3.31AsnLeu: 3.31 ± 0.022
0.948AsnMet: 0.948 ± 0.011
2.046AsnAsn: 2.046 ± 0.024
2.091AsnPro: 2.091 ± 0.016
1.596AsnGln: 1.596 ± 0.013
2.083AsnArg: 2.083 ± 0.014
2.868AsnSer: 2.868 ± 0.017
2.134AsnThr: 2.134 ± 0.017
2.548AsnVal: 2.548 ± 0.016
0.486AsnTrp: 0.486 ± 0.007
1.023AsnTyr: 1.023 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
3.737ProAla: 3.737 ± 0.025
0.625ProCys: 0.625 ± 0.009
2.801ProAsp: 2.801 ± 0.021
3.183ProGlu: 3.183 ± 0.02
2.051ProPhe: 2.051 ± 0.015
2.628ProGly: 2.628 ± 0.02
1.172ProHis: 1.172 ± 0.012
2.314ProIle: 2.314 ± 0.016
2.363ProLys: 2.363 ± 0.019
4.562ProLeu: 4.562 ± 0.025
1.052ProMet: 1.052 ± 0.011
1.949ProAsn: 1.949 ± 0.014
3.712ProPro: 3.712 ± 0.036
1.913ProGln: 1.913 ± 0.018
2.526ProArg: 2.526 ± 0.016
5.13ProSer: 5.13 ± 0.047
3.584ProThr: 3.584 ± 0.041
3.582ProVal: 3.582 ± 0.031
0.622ProTrp: 0.622 ± 0.009
1.301ProTyr: 1.301 ± 0.014
0.0ProXaa: 0.0 ± 0.0
Gln
3.513GlnAla: 3.513 ± 0.019
0.651GlnCys: 0.651 ± 0.008
2.121GlnAsp: 2.121 ± 0.015
2.935GlnGlu: 2.935 ± 0.02
1.455GlnPhe: 1.455 ± 0.012
2.194GlnGly: 2.194 ± 0.016
1.266GlnHis: 1.266 ± 0.013
2.033GlnIle: 2.033 ± 0.015
2.398GlnLys: 2.398 ± 0.019
4.192GlnLeu: 4.192 ± 0.025
1.085GlnMet: 1.085 ± 0.011
1.715GlnAsn: 1.715 ± 0.014
2.051GlnPro: 2.051 ± 0.018
3.421GlnGln: 3.421 ± 0.037
2.983GlnArg: 2.983 ± 0.022
3.427GlnSer: 3.427 ± 0.023
2.329GlnThr: 2.329 ± 0.015
2.656GlnVal: 2.656 ± 0.015
0.612GlnTrp: 0.612 ± 0.008
1.195GlnTyr: 1.195 ± 0.012
0.0GlnXaa: 0.0 ± 0.0
Arg
4.021ArgAla: 4.021 ± 0.023
0.91ArgCys: 0.91 ± 0.01
3.042ArgAsp: 3.042 ± 0.021
3.525ArgGlu: 3.525 ± 0.021
2.191ArgPhe: 2.191 ± 0.016
3.092ArgGly: 3.092 ± 0.021
1.517ArgHis: 1.517 ± 0.014
2.968ArgIle: 2.968 ± 0.018
3.63ArgLys: 3.63 ± 0.025
5.462ArgLeu: 5.462 ± 0.028
1.412ArgMet: 1.412 ± 0.013
2.517ArgAsn: 2.517 ± 0.016
2.702ArgPro: 2.702 ± 0.018
2.771ArgGln: 2.771 ± 0.017
4.585ArgArg: 4.585 ± 0.03
4.539ArgSer: 4.539 ± 0.026
3.032ArgThr: 3.032 ± 0.018
3.378ArgVal: 3.378 ± 0.019
0.762ArgTrp: 0.762 ± 0.01
1.585ArgTyr: 1.585 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
6.015SerAla: 6.015 ± 0.028
1.252SerCys: 1.252 ± 0.013
4.692SerAsp: 4.692 ± 0.028
4.571SerGlu: 4.571 ± 0.023
3.614SerPhe: 3.614 ± 0.022
4.693SerGly: 4.693 ± 0.027
1.838SerHis: 1.838 ± 0.015
4.071SerIle: 4.071 ± 0.016
4.073SerLys: 4.073 ± 0.025
7.921SerLeu: 7.921 ± 0.034
1.949SerMet: 1.949 ± 0.014
3.399SerAsn: 3.399 ± 0.02
4.342SerPro: 4.342 ± 0.034
3.197SerGln: 3.197 ± 0.021
4.352SerArg: 4.352 ± 0.027
9.329SerSer: 9.329 ± 0.048
5.219SerThr: 5.219 ± 0.025
5.204SerVal: 5.204 ± 0.022
1.021SerTrp: 1.021 ± 0.01
2.0SerTyr: 2.0 ± 0.014
0.0SerXaa: 0.0 ± 0.0
Thr
4.914ThrAla: 4.914 ± 0.023
0.946ThrCys: 0.946 ± 0.011
3.058ThrAsp: 3.058 ± 0.02
3.456ThrGlu: 3.456 ± 0.022
2.232ThrPhe: 2.232 ± 0.016
3.538ThrGly: 3.538 ± 0.021
1.161ThrHis: 1.161 ± 0.013
3.091ThrIle: 3.091 ± 0.018
2.983ThrLys: 2.983 ± 0.02
5.372ThrLeu: 5.372 ± 0.024
1.398ThrMet: 1.398 ± 0.012
2.324ThrAsn: 2.324 ± 0.016
3.377ThrPro: 3.377 ± 0.024
2.117ThrGln: 2.117 ± 0.015
2.908ThrArg: 2.908 ± 0.018
5.054ThrSer: 5.054 ± 0.028
4.334ThrThr: 4.334 ± 0.03
3.981ThrVal: 3.981 ± 0.02
0.715ThrTrp: 0.715 ± 0.011
1.339ThrTyr: 1.339 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
5.841ValAla: 5.841 ± 0.026
1.089ValCys: 1.089 ± 0.011
4.108ValAsp: 4.108 ± 0.022
4.317ValGlu: 4.317 ± 0.024
2.475ValPhe: 2.475 ± 0.015
3.823ValGly: 3.823 ± 0.025
1.551ValHis: 1.551 ± 0.012
2.913ValIle: 2.913 ± 0.017
2.886ValLys: 2.886 ± 0.018
6.303ValLeu: 6.303 ± 0.029
1.463ValMet: 1.463 ± 0.011
2.227ValAsn: 2.227 ± 0.013
3.471ValPro: 3.471 ± 0.028
2.795ValGln: 2.795 ± 0.018
3.396ValArg: 3.396 ± 0.02
5.049ValSer: 5.049 ± 0.025
3.899ValThr: 3.899 ± 0.02
4.803ValVal: 4.803 ± 0.025
0.822ValTrp: 0.822 ± 0.01
1.703ValTyr: 1.703 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.01
0.235TrpCys: 0.235 ± 0.004
0.811TrpAsp: 0.811 ± 0.01
0.817TrpGlu: 0.817 ± 0.008
0.485TrpPhe: 0.485 ± 0.008
0.73TrpGly: 0.73 ± 0.009
0.314TrpHis: 0.314 ± 0.005
0.738TrpIle: 0.738 ± 0.009
0.927TrpLys: 0.927 ± 0.008
1.199TrpLeu: 1.199 ± 0.016
0.445TrpMet: 0.445 ± 0.007
0.707TrpAsn: 0.707 ± 0.009
0.447TrpPro: 0.447 ± 0.006
0.584TrpGln: 0.584 ± 0.008
0.77TrpArg: 0.77 ± 0.009
1.009TrpSer: 1.009 ± 0.009
0.835TrpThr: 0.835 ± 0.011
0.711TrpVal: 0.711 ± 0.008
0.239TrpTrp: 0.239 ± 0.005
0.389TrpTyr: 0.389 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.949TyrAla: 1.949 ± 0.016
0.534TyrCys: 0.534 ± 0.008
1.764TyrAsp: 1.764 ± 0.016
1.701TyrGlu: 1.701 ± 0.015
1.158TyrPhe: 1.158 ± 0.011
1.881TyrGly: 1.881 ± 0.016
0.778TyrHis: 0.778 ± 0.009
1.13TyrIle: 1.13 ± 0.011
1.114TyrLys: 1.114 ± 0.012
2.469TyrLeu: 2.469 ± 0.017
0.628TyrMet: 0.628 ± 0.008
1.08TyrAsn: 1.08 ± 0.012
1.27TyrPro: 1.27 ± 0.013
1.245TyrGln: 1.245 ± 0.012
1.591TyrArg: 1.591 ± 0.015
1.954TyrSer: 1.954 ± 0.015
1.357TyrThr: 1.357 ± 0.012
1.625TyrVal: 1.625 ± 0.013
0.391TyrTrp: 0.391 ± 0.007
0.952TyrTyr: 0.952 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20319 proteins (10061793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski