Amino acid dipepetide frequency for Octopus vulgaris (Common octopus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.563AlaAla: 4.563 ± 0.036
1.018AlaCys: 1.018 ± 0.011
2.836AlaAsp: 2.836 ± 0.017
3.612AlaGlu: 3.612 ± 0.026
1.972AlaPhe: 1.972 ± 0.012
2.74AlaGly: 2.74 ± 0.017
1.097AlaHis: 1.097 ± 0.01
2.921AlaIle: 2.921 ± 0.017
3.538AlaLys: 3.538 ± 0.026
4.434AlaLeu: 4.434 ± 0.02
1.208AlaMet: 1.208 ± 0.008
2.557AlaAsn: 2.557 ± 0.014
2.528AlaPro: 2.528 ± 0.024
2.022AlaGln: 2.022 ± 0.015
2.171AlaArg: 2.171 ± 0.014
4.822AlaSer: 4.822 ± 0.024
3.629AlaThr: 3.629 ± 0.03
3.704AlaVal: 3.704 ± 0.019
0.413AlaTrp: 0.413 ± 0.006
1.339AlaTyr: 1.339 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
0.952CysAla: 0.952 ± 0.018
0.577CysCys: 0.577 ± 0.007
1.795CysAsp: 1.795 ± 0.027
1.418CysGlu: 1.418 ± 0.018
0.791CysPhe: 0.791 ± 0.008
1.837CysGly: 1.837 ± 0.03
0.678CysHis: 0.678 ± 0.013
1.205CysIle: 1.205 ± 0.011
1.402CysLys: 1.402 ± 0.014
1.848CysLeu: 1.848 ± 0.013
0.411CysMet: 0.411 ± 0.005
1.247CysAsn: 1.247 ± 0.016
1.045CysPro: 1.045 ± 0.013
1.087CysGln: 1.087 ± 0.018
1.063CysArg: 1.063 ± 0.012
1.979CysSer: 1.979 ± 0.019
1.102CysThr: 1.102 ± 0.015
1.208CysVal: 1.208 ± 0.012
0.208CysTrp: 0.208 ± 0.003
0.599CysTyr: 0.599 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.582AspAla: 2.582 ± 0.015
1.094AspCys: 1.094 ± 0.011
3.962AspAsp: 3.962 ± 0.033
3.924AspGlu: 3.924 ± 0.021
2.197AspPhe: 2.197 ± 0.014
3.203AspGly: 3.203 ± 0.026
1.253AspHis: 1.253 ± 0.011
4.105AspIle: 4.105 ± 0.025
3.758AspLys: 3.758 ± 0.025
4.671AspLeu: 4.671 ± 0.023
1.24AspMet: 1.24 ± 0.009
2.988AspAsn: 2.988 ± 0.02
2.343AspPro: 2.343 ± 0.031
1.909AspGln: 1.909 ± 0.015
2.337AspArg: 2.337 ± 0.016
5.065AspSer: 5.065 ± 0.025
3.004AspThr: 3.004 ± 0.018
3.681AspVal: 3.681 ± 0.034
0.521AspTrp: 0.521 ± 0.006
1.773AspTyr: 1.773 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
3.541GluAla: 3.541 ± 0.023
1.244GluCys: 1.244 ± 0.016
4.295GluAsp: 4.295 ± 0.031
6.647GluGlu: 6.647 ± 0.054
2.131GluPhe: 2.131 ± 0.013
2.791GluGly: 2.791 ± 0.018
1.412GluHis: 1.412 ± 0.01
4.062GluIle: 4.062 ± 0.024
6.424GluLys: 6.424 ± 0.038
5.369GluLeu: 5.369 ± 0.034
1.705GluMet: 1.705 ± 0.013
4.116GluAsn: 4.116 ± 0.023
2.614GluPro: 2.614 ± 0.033
2.726GluGln: 2.726 ± 0.02
3.236GluArg: 3.236 ± 0.021
5.146GluSer: 5.146 ± 0.03
3.938GluThr: 3.938 ± 0.024
3.604GluVal: 3.604 ± 0.023
0.568GluTrp: 0.568 ± 0.007
1.774GluTyr: 1.774 ± 0.013
0.001GluXaa: 0.001 ± 0.0
Phe
1.814PheAla: 1.814 ± 0.011
0.889PheCys: 0.889 ± 0.009
2.066PheAsp: 2.066 ± 0.013
2.208PheGlu: 2.208 ± 0.014
1.467PhePhe: 1.467 ± 0.013
2.073PheGly: 2.073 ± 0.018
1.117PheHis: 1.117 ± 0.01
2.161PheIle: 2.161 ± 0.017
2.133PheLys: 2.133 ± 0.012
3.397PheLeu: 3.397 ± 0.018
0.766PheMet: 0.766 ± 0.007
1.813PheAsn: 1.813 ± 0.012
1.699PhePro: 1.699 ± 0.014
1.69PheGln: 1.69 ± 0.011
1.685PheArg: 1.685 ± 0.014
3.895PheSer: 3.895 ± 0.026
2.232PheThr: 2.232 ± 0.014
2.384PheVal: 2.384 ± 0.03
0.393PheTrp: 0.393 ± 0.006
1.297PheTyr: 1.297 ± 0.009
0.002PheXaa: 0.002 ± 0.0
Gly
2.463GlyAla: 2.463 ± 0.016
0.98GlyCys: 0.98 ± 0.015
2.732GlyAsp: 2.732 ± 0.019
3.608GlyGlu: 3.608 ± 0.03
2.006GlyPhe: 2.006 ± 0.013
3.916GlyGly: 3.916 ± 0.065
1.397GlyHis: 1.397 ± 0.018
2.929GlyIle: 2.929 ± 0.019
4.342GlyLys: 4.342 ± 0.044
3.746GlyLeu: 3.746 ± 0.019
1.129GlyMet: 1.129 ± 0.01
2.847GlyAsn: 2.847 ± 0.017
2.054GlyPro: 2.054 ± 0.029
1.98GlyGln: 1.98 ± 0.016
2.423GlyArg: 2.423 ± 0.019
4.894GlySer: 4.894 ± 0.033
2.97GlyThr: 2.97 ± 0.03
2.808GlyVal: 2.808 ± 0.018
0.52GlyTrp: 0.52 ± 0.007
1.731GlyTyr: 1.731 ± 0.016
0.001GlyXaa: 0.001 ± 0.0
His
1.029HisAla: 1.029 ± 0.009
1.069HisCys: 1.069 ± 0.019
1.06HisAsp: 1.06 ± 0.008
1.306HisGlu: 1.306 ± 0.011
1.035HisPhe: 1.035 ± 0.008
1.273HisGly: 1.273 ± 0.011
1.158HisHis: 1.158 ± 0.018
1.571HisIle: 1.571 ± 0.012
2.103HisLys: 2.103 ± 0.022
2.687HisLeu: 2.687 ± 0.022
0.599HisMet: 0.599 ± 0.006
1.256HisAsn: 1.256 ± 0.027
1.296HisPro: 1.296 ± 0.012
1.356HisGln: 1.356 ± 0.012
1.428HisArg: 1.428 ± 0.01
2.441HisSer: 2.441 ± 0.018
2.166HisThr: 2.166 ± 0.03
1.425HisVal: 1.425 ± 0.009
0.261HisTrp: 0.261 ± 0.004
0.881HisTyr: 0.881 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
2.876IleAla: 2.876 ± 0.014
1.929IleCys: 1.929 ± 0.027
3.031IleAsp: 3.031 ± 0.016
3.343IleGlu: 3.343 ± 0.019
2.206IlePhe: 2.206 ± 0.015
2.634IleGly: 2.634 ± 0.018
2.01IleHis: 2.01 ± 0.027
3.233IleIle: 3.233 ± 0.021
3.651IleLys: 3.651 ± 0.018
4.938IleLeu: 4.938 ± 0.023
1.084IleMet: 1.084 ± 0.009
2.98IleAsn: 2.98 ± 0.017
2.939IlePro: 2.939 ± 0.019
2.36IleGln: 2.36 ± 0.015
2.488IleArg: 2.488 ± 0.015
5.044IleSer: 5.044 ± 0.02
3.444IleThr: 3.444 ± 0.023
3.091IleVal: 3.091 ± 0.017
0.532IleTrp: 0.532 ± 0.006
1.739IleTyr: 1.739 ± 0.011
0.001IleXaa: 0.001 ± 0.0
Lys
3.546LysAla: 3.546 ± 0.019
1.554LysCys: 1.554 ± 0.018
4.325LysAsp: 4.325 ± 0.036
5.517LysGlu: 5.517 ± 0.032
2.322LysPhe: 2.322 ± 0.012
2.964LysGly: 2.964 ± 0.018
1.86LysHis: 1.86 ± 0.013
3.926LysIle: 3.926 ± 0.018
6.058LysLys: 6.058 ± 0.028
5.967LysLeu: 5.967 ± 0.028
1.678LysMet: 1.678 ± 0.01
3.683LysAsn: 3.683 ± 0.015
3.869LysPro: 3.869 ± 0.032
3.065LysGln: 3.065 ± 0.02
4.117LysArg: 4.117 ± 0.04
6.442LysSer: 6.442 ± 0.039
4.454LysThr: 4.454 ± 0.037
3.804LysVal: 3.804 ± 0.02
0.649LysTrp: 0.649 ± 0.008
2.171LysTyr: 2.171 ± 0.013
0.001LysXaa: 0.001 ± 0.0
Leu
4.742LeuAla: 4.742 ± 0.025
1.796LeuCys: 1.796 ± 0.011
4.437LeuAsp: 4.437 ± 0.021
5.691LeuGlu: 5.691 ± 0.04
3.133LeuPhe: 3.133 ± 0.017
3.613LeuGly: 3.613 ± 0.019
2.511LeuHis: 2.511 ± 0.018
4.204LeuIle: 4.204 ± 0.019
6.508LeuLys: 6.508 ± 0.029
8.046LeuLeu: 8.046 ± 0.04
1.871LeuMet: 1.871 ± 0.012
4.39LeuAsn: 4.39 ± 0.02
4.457LeuPro: 4.457 ± 0.019
4.614LeuGln: 4.614 ± 0.029
4.014LeuArg: 4.014 ± 0.022
7.671LeuSer: 7.671 ± 0.031
5.288LeuThr: 5.288 ± 0.025
4.471LeuVal: 4.471 ± 0.022
0.79LeuTrp: 0.79 ± 0.008
2.449LeuTyr: 2.449 ± 0.014
0.001LeuXaa: 0.001 ± 0.0
Met
1.48MetAla: 1.48 ± 0.011
0.464MetCys: 0.464 ± 0.005
1.246MetAsp: 1.246 ± 0.009
1.642MetGlu: 1.642 ± 0.011
0.993MetPhe: 0.993 ± 0.024
0.936MetGly: 0.936 ± 0.008
0.48MetHis: 0.48 ± 0.006
0.998MetIle: 0.998 ± 0.009
1.778MetLys: 1.778 ± 0.01
1.864MetLeu: 1.864 ± 0.012
0.621MetMet: 0.621 ± 0.007
1.154MetAsn: 1.154 ± 0.009
1.029MetPro: 1.029 ± 0.009
0.939MetGln: 0.939 ± 0.008
0.981MetArg: 0.981 ± 0.008
1.977MetSer: 1.977 ± 0.011
1.328MetThr: 1.328 ± 0.008
1.258MetVal: 1.258 ± 0.009
0.203MetTrp: 0.203 ± 0.003
0.616MetTyr: 0.616 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.562AsnAla: 2.562 ± 0.014
1.259AsnCys: 1.259 ± 0.021
2.793AsnAsp: 2.793 ± 0.021
3.191AsnGlu: 3.191 ± 0.017
2.033AsnPhe: 2.033 ± 0.014
2.983AsnGly: 2.983 ± 0.026
1.3AsnHis: 1.3 ± 0.011
3.659AsnIle: 3.659 ± 0.019
3.574AsnLys: 3.574 ± 0.017
4.744AsnLeu: 4.744 ± 0.023
1.238AsnMet: 1.238 ± 0.009
4.113AsnAsn: 4.113 ± 0.049
2.342AsnPro: 2.342 ± 0.016
2.342AsnGln: 2.342 ± 0.015
2.305AsnArg: 2.305 ± 0.017
5.302AsnSer: 5.302 ± 0.031
3.211AsnThr: 3.211 ± 0.017
3.197AsnVal: 3.197 ± 0.017
0.495AsnTrp: 0.495 ± 0.006
1.687AsnTyr: 1.687 ± 0.01
0.001AsnXaa: 0.001 ± 0.0
Pro
2.901ProAla: 2.901 ± 0.024
0.889ProCys: 0.889 ± 0.014
2.656ProAsp: 2.656 ± 0.023
3.186ProGlu: 3.186 ± 0.024
1.797ProPhe: 1.797 ± 0.015
2.52ProGly: 2.52 ± 0.028
1.255ProHis: 1.255 ± 0.013
2.224ProIle: 2.224 ± 0.015
3.121ProLys: 3.121 ± 0.022
3.911ProLeu: 3.911 ± 0.022
1.042ProMet: 1.042 ± 0.024
2.427ProAsn: 2.427 ± 0.018
4.378ProPro: 4.378 ± 0.044
2.232ProGln: 2.232 ± 0.018
1.924ProArg: 1.924 ± 0.016
5.099ProSer: 5.099 ± 0.039
3.32ProThr: 3.32 ± 0.038
3.374ProVal: 3.374 ± 0.026
0.363ProTrp: 0.363 ± 0.005
1.88ProTyr: 1.88 ± 0.024
0.001ProXaa: 0.001 ± 0.0
Gln
2.223GlnAla: 2.223 ± 0.018
1.055GlnCys: 1.055 ± 0.018
2.059GlnAsp: 2.059 ± 0.016
2.93GlnGlu: 2.93 ± 0.025
1.447GlnPhe: 1.447 ± 0.01
1.9GlnGly: 1.9 ± 0.022
1.369GlnHis: 1.369 ± 0.012
2.349GlnIle: 2.349 ± 0.015
3.169GlnLys: 3.169 ± 0.021
4.155GlnLeu: 4.155 ± 0.029
1.13GlnMet: 1.13 ± 0.009
2.591GlnAsn: 2.591 ± 0.015
2.26GlnPro: 2.26 ± 0.02
4.426GlnGln: 4.426 ± 0.07
2.242GlnArg: 2.242 ± 0.017
3.658GlnSer: 3.658 ± 0.023
2.666GlnThr: 2.666 ± 0.025
2.309GlnVal: 2.309 ± 0.016
0.395GlnTrp: 0.395 ± 0.006
1.245GlnTyr: 1.245 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
2.091ArgAla: 2.091 ± 0.013
1.065ArgCys: 1.065 ± 0.012
2.446ArgAsp: 2.446 ± 0.023
3.118ArgGlu: 3.118 ± 0.024
1.623ArgPhe: 1.623 ± 0.011
2.133ArgGly: 2.133 ± 0.02
1.465ArgHis: 1.465 ± 0.012
2.841ArgIle: 2.841 ± 0.022
3.802ArgLys: 3.802 ± 0.02
3.907ArgLeu: 3.907 ± 0.024
1.025ArgMet: 1.025 ± 0.008
2.626ArgAsn: 2.626 ± 0.015
2.08ArgPro: 2.08 ± 0.014
2.094ArgGln: 2.094 ± 0.014
3.21ArgArg: 3.21 ± 0.021
3.962ArgSer: 3.962 ± 0.023
2.596ArgThr: 2.596 ± 0.015
2.307ArgVal: 2.307 ± 0.015
0.5ArgTrp: 0.5 ± 0.006
1.475ArgTyr: 1.475 ± 0.012
0.001ArgXaa: 0.001 ± 0.0
Ser
4.718SerAla: 4.718 ± 0.028
1.786SerCys: 1.786 ± 0.015
5.091SerAsp: 5.091 ± 0.028
5.622SerGlu: 5.622 ± 0.033
3.807SerPhe: 3.807 ± 0.028
4.983SerGly: 4.983 ± 0.031
2.474SerHis: 2.474 ± 0.022
4.378SerIle: 4.378 ± 0.018
6.181SerLys: 6.181 ± 0.036
7.768SerLeu: 7.768 ± 0.029
1.839SerMet: 1.839 ± 0.012
5.07SerAsn: 5.07 ± 0.025
5.353SerPro: 5.353 ± 0.044
4.261SerGln: 4.261 ± 0.024
4.098SerArg: 4.098 ± 0.025
12.137SerSer: 12.137 ± 0.1
6.109SerThr: 6.109 ± 0.054
5.324SerVal: 5.324 ± 0.024
0.751SerTrp: 0.751 ± 0.008
2.48SerTyr: 2.48 ± 0.014
0.001SerXaa: 0.001 ± 0.0
Thr
3.866ThrAla: 3.866 ± 0.029
1.351ThrCys: 1.351 ± 0.019
3.412ThrAsp: 3.412 ± 0.019
4.157ThrGlu: 4.157 ± 0.037
2.269ThrPhe: 2.269 ± 0.016
3.917ThrGly: 3.917 ± 0.044
1.59ThrHis: 1.59 ± 0.021
3.2ThrIle: 3.2 ± 0.02
3.803ThrLys: 3.803 ± 0.021
4.914ThrLeu: 4.914 ± 0.021
1.182ThrMet: 1.182 ± 0.009
3.15ThrAsn: 3.15 ± 0.017
3.7ThrPro: 3.7 ± 0.036
2.259ThrGln: 2.259 ± 0.02
2.28ThrArg: 2.28 ± 0.014
6.258ThrSer: 6.258 ± 0.054
5.987ThrThr: 5.987 ± 0.142
3.991ThrVal: 3.991 ± 0.023
0.526ThrTrp: 0.526 ± 0.007
1.695ThrTyr: 1.695 ± 0.013
0.001ThrXaa: 0.001 ± 0.0
Val
3.458ValAla: 3.458 ± 0.02
1.489ValCys: 1.489 ± 0.015
3.278ValAsp: 3.278 ± 0.019
3.835ValGlu: 3.835 ± 0.027
2.189ValPhe: 2.189 ± 0.015
2.854ValGly: 2.854 ± 0.031
1.541ValHis: 1.541 ± 0.01
3.259ValIle: 3.259 ± 0.014
4.007ValLys: 4.007 ± 0.019
4.765ValLeu: 4.765 ± 0.021
1.282ValMet: 1.282 ± 0.009
3.122ValAsn: 3.122 ± 0.016
2.846ValPro: 2.846 ± 0.022
2.341ValGln: 2.341 ± 0.015
2.374ValArg: 2.374 ± 0.016
5.173ValSer: 5.173 ± 0.023
3.906ValThr: 3.906 ± 0.028
3.655ValVal: 3.655 ± 0.02
0.551ValTrp: 0.551 ± 0.006
1.725ValTyr: 1.725 ± 0.013
0.001ValXaa: 0.001 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.005
0.2TrpCys: 0.2 ± 0.003
0.486TrpAsp: 0.486 ± 0.006
0.53TrpGlu: 0.53 ± 0.006
0.387TrpPhe: 0.387 ± 0.005
0.401TrpGly: 0.401 ± 0.006
0.234TrpHis: 0.234 ± 0.004
0.548TrpIle: 0.548 ± 0.006
0.78TrpLys: 0.78 ± 0.009
0.911TrpLeu: 0.911 ± 0.011
0.263TrpMet: 0.263 ± 0.005
0.564TrpAsn: 0.564 ± 0.006
0.327TrpPro: 0.327 ± 0.004
0.43TrpGln: 0.43 ± 0.005
0.467TrpArg: 0.467 ± 0.005
0.73TrpSer: 0.73 ± 0.008
0.521TrpThr: 0.521 ± 0.007
0.462TrpVal: 0.462 ± 0.006
0.134TrpTrp: 0.134 ± 0.003
0.296TrpTyr: 0.296 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.311TyrAla: 1.311 ± 0.01
0.745TyrCys: 0.745 ± 0.008
1.603TyrAsp: 1.603 ± 0.011
1.855TyrGlu: 1.855 ± 0.013
1.375TyrPhe: 1.375 ± 0.011
1.804TyrGly: 1.804 ± 0.021
1.198TyrHis: 1.198 ± 0.014
1.777TyrIle: 1.777 ± 0.014
1.84TyrLys: 1.84 ± 0.013
2.652TyrLeu: 2.652 ± 0.015
0.673TyrMet: 0.673 ± 0.006
1.637TyrAsn: 1.637 ± 0.013
1.341TyrPro: 1.341 ± 0.013
1.445TyrGln: 1.445 ± 0.017
1.525TyrArg: 1.525 ± 0.011
2.536TyrSer: 2.536 ± 0.013
1.635TyrThr: 1.635 ± 0.012
1.593TyrVal: 1.593 ± 0.011
0.304TyrTrp: 0.304 ± 0.005
1.1TyrTyr: 1.1 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30330 proteins (20536984 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski