Amino acid dipepetide frequency for Octopus bimaculoides (California two-spotted octopus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.298AlaAla: 4.298 ± 0.036
1.025AlaCys: 1.025 ± 0.011
2.56AlaAsp: 2.56 ± 0.017
3.255AlaGlu: 3.255 ± 0.021
1.991AlaPhe: 1.991 ± 0.018
2.514AlaGly: 2.514 ± 0.019
1.14AlaHis: 1.14 ± 0.012
3.021AlaIle: 3.021 ± 0.021
3.249AlaLys: 3.249 ± 0.022
4.432AlaLeu: 4.432 ± 0.029
1.225AlaMet: 1.225 ± 0.012
2.408AlaAsn: 2.408 ± 0.017
2.141AlaPro: 2.141 ± 0.018
1.918AlaGln: 1.918 ± 0.017
2.248AlaArg: 2.248 ± 0.021
4.419AlaSer: 4.419 ± 0.027
3.507AlaThr: 3.507 ± 0.032
3.58AlaVal: 3.58 ± 0.021
0.402AlaTrp: 0.402 ± 0.007
1.359AlaTyr: 1.359 ± 0.012
0.0AlaXaa: 0.0 ± 0.0
Cys
0.937CysAla: 0.937 ± 0.012
0.675CysCys: 0.675 ± 0.011
2.096CysAsp: 2.096 ± 0.036
1.422CysGlu: 1.422 ± 0.02
0.914CysPhe: 0.914 ± 0.011
2.205CysGly: 2.205 ± 0.036
0.749CysHis: 0.749 ± 0.01
1.304CysIle: 1.304 ± 0.016
1.468CysLys: 1.468 ± 0.015
1.998CysLeu: 1.998 ± 0.018
0.515CysMet: 0.515 ± 0.008
1.395CysAsn: 1.395 ± 0.018
1.009CysPro: 1.009 ± 0.015
0.992CysGln: 0.992 ± 0.015
1.088CysArg: 1.088 ± 0.014
2.073CysSer: 2.073 ± 0.02
1.204CysThr: 1.204 ± 0.018
1.42CysVal: 1.42 ± 0.017
0.231CysTrp: 0.231 ± 0.005
0.704CysTyr: 0.704 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
2.444AspAla: 2.444 ± 0.016
1.094AspCys: 1.094 ± 0.015
5.913AspAsp: 5.913 ± 0.1
3.718AspGlu: 3.718 ± 0.026
2.028AspPhe: 2.028 ± 0.016
2.988AspGly: 2.988 ± 0.024
1.215AspHis: 1.215 ± 0.012
4.187AspIle: 4.187 ± 0.029
3.411AspLys: 3.411 ± 0.025
4.34AspLeu: 4.34 ± 0.025
1.201AspMet: 1.201 ± 0.011
2.986AspAsn: 2.986 ± 0.021
1.885AspPro: 1.885 ± 0.021
1.75AspGln: 1.75 ± 0.016
2.274AspArg: 2.274 ± 0.02
4.501AspSer: 4.501 ± 0.031
2.738AspThr: 2.738 ± 0.02
3.363AspVal: 3.363 ± 0.021
0.505AspTrp: 0.505 ± 0.008
1.653AspTyr: 1.653 ± 0.015
0.0AspXaa: 0.0 ± 0.0
Glu
3.238GluAla: 3.238 ± 0.025
1.209GluCys: 1.209 ± 0.014
3.808GluAsp: 3.808 ± 0.027
7.691GluGlu: 7.691 ± 0.091
2.042GluPhe: 2.042 ± 0.016
2.768GluGly: 2.768 ± 0.022
1.362GluHis: 1.362 ± 0.012
3.888GluIle: 3.888 ± 0.027
6.585GluLys: 6.585 ± 0.047
4.978GluLeu: 4.978 ± 0.032
1.665GluMet: 1.665 ± 0.012
3.793GluAsn: 3.793 ± 0.024
2.016GluPro: 2.016 ± 0.029
2.487GluGln: 2.487 ± 0.022
4.336GluArg: 4.336 ± 0.048
4.559GluSer: 4.559 ± 0.035
3.571GluThr: 3.571 ± 0.025
3.416GluVal: 3.416 ± 0.023
0.565GluTrp: 0.565 ± 0.008
1.637GluTyr: 1.637 ± 0.014
0.001GluXaa: 0.001 ± 0.0
Phe
1.873PheAla: 1.873 ± 0.014
1.027PheCys: 1.027 ± 0.013
1.96PheAsp: 1.96 ± 0.015
2.02PheGlu: 2.02 ± 0.016
1.796PhePhe: 1.796 ± 0.022
1.954PheGly: 1.954 ± 0.017
1.267PheHis: 1.267 ± 0.013
2.415PheIle: 2.415 ± 0.02
2.139PheLys: 2.139 ± 0.017
3.804PheLeu: 3.804 ± 0.028
0.869PheMet: 0.869 ± 0.01
1.827PheAsn: 1.827 ± 0.016
1.722PhePro: 1.722 ± 0.014
1.661PheGln: 1.661 ± 0.014
1.734PheArg: 1.734 ± 0.013
4.259PheSer: 4.259 ± 0.031
2.394PheThr: 2.394 ± 0.016
2.589PheVal: 2.589 ± 0.027
0.419PheTrp: 0.419 ± 0.007
1.38PheTyr: 1.38 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
2.312GlyAla: 2.312 ± 0.019
0.999GlyCys: 0.999 ± 0.013
2.72GlyAsp: 2.72 ± 0.02
3.934GlyGlu: 3.934 ± 0.041
1.966GlyPhe: 1.966 ± 0.016
3.582GlyGly: 3.582 ± 0.044
1.428GlyHis: 1.428 ± 0.015
2.802GlyIle: 2.802 ± 0.019
4.318GlyLys: 4.318 ± 0.038
3.607GlyLeu: 3.607 ± 0.025
1.113GlyMet: 1.113 ± 0.013
2.734GlyAsn: 2.734 ± 0.022
1.654GlyPro: 1.654 ± 0.021
1.901GlyGln: 1.901 ± 0.019
2.481GlyArg: 2.481 ± 0.022
4.355GlySer: 4.355 ± 0.033
2.722GlyThr: 2.722 ± 0.022
2.665GlyVal: 2.665 ± 0.019
0.501GlyTrp: 0.501 ± 0.008
1.655GlyTyr: 1.655 ± 0.017
0.0GlyXaa: 0.0 ± 0.0
His
1.073HisAla: 1.073 ± 0.01
1.398HisCys: 1.398 ± 0.025
1.063HisAsp: 1.063 ± 0.009
1.259HisGlu: 1.259 ± 0.01
1.12HisPhe: 1.12 ± 0.01
1.206HisGly: 1.206 ± 0.012
2.081HisHis: 2.081 ± 0.049
1.848HisIle: 1.848 ± 0.016
2.423HisLys: 2.423 ± 0.029
2.787HisLeu: 2.787 ± 0.019
0.688HisMet: 0.688 ± 0.009
1.316HisAsn: 1.316 ± 0.013
1.257HisPro: 1.257 ± 0.011
1.407HisGln: 1.407 ± 0.014
1.563HisArg: 1.563 ± 0.016
2.531HisSer: 2.531 ± 0.018
2.818HisThr: 2.818 ± 0.039
1.503HisVal: 1.503 ± 0.013
0.277HisTrp: 0.277 ± 0.005
1.036HisTyr: 1.036 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
3.008IleAla: 3.008 ± 0.023
2.385IleCys: 2.385 ± 0.035
2.882IleAsp: 2.882 ± 0.019
3.157IleGlu: 3.157 ± 0.022
2.46IlePhe: 2.46 ± 0.021
2.574IleGly: 2.574 ± 0.019
2.475IleHis: 2.475 ± 0.03
5.033IleIle: 5.033 ± 0.077
3.547IleLys: 3.547 ± 0.021
5.189IleLeu: 5.189 ± 0.034
1.187IleMet: 1.187 ± 0.012
3.1IleAsn: 3.1 ± 0.018
2.832IlePro: 2.832 ± 0.021
2.312IleGln: 2.312 ± 0.016
2.593IleArg: 2.593 ± 0.019
5.101IleSer: 5.101 ± 0.028
3.795IleThr: 3.795 ± 0.03
3.5IleVal: 3.5 ± 0.024
0.55IleTrp: 0.55 ± 0.008
2.01IleTyr: 2.01 ± 0.017
0.001IleXaa: 0.001 ± 0.0
Lys
3.375LysAla: 3.375 ± 0.019
1.577LysCys: 1.577 ± 0.02
3.713LysAsp: 3.713 ± 0.029
5.546LysGlu: 5.546 ± 0.039
2.28LysPhe: 2.28 ± 0.017
2.919LysGly: 2.919 ± 0.024
1.998LysHis: 1.998 ± 0.02
3.828LysIle: 3.828 ± 0.023
7.194LysLys: 7.194 ± 0.081
5.79LysLeu: 5.79 ± 0.033
1.689LysMet: 1.689 ± 0.012
3.69LysAsn: 3.69 ± 0.023
3.746LysPro: 3.746 ± 0.035
2.952LysGln: 2.952 ± 0.022
4.288LysArg: 4.288 ± 0.034
6.179LysSer: 6.179 ± 0.041
4.21LysThr: 4.21 ± 0.03
3.679LysVal: 3.679 ± 0.021
0.669LysTrp: 0.669 ± 0.008
2.198LysTyr: 2.198 ± 0.015
0.001LysXaa: 0.001 ± 0.0
Leu
4.605LeuAla: 4.605 ± 0.027
1.969LeuCys: 1.969 ± 0.016
4.108LeuAsp: 4.108 ± 0.027
5.324LeuGlu: 5.324 ± 0.035
3.43LeuPhe: 3.43 ± 0.025
3.48LeuGly: 3.48 ± 0.024
2.702LeuHis: 2.702 ± 0.024
4.552LeuIle: 4.552 ± 0.024
6.156LeuLys: 6.156 ± 0.034
8.812LeuLeu: 8.812 ± 0.064
1.955LeuMet: 1.955 ± 0.016
4.352LeuAsn: 4.352 ± 0.025
4.388LeuPro: 4.388 ± 0.027
4.48LeuGln: 4.48 ± 0.032
4.112LeuArg: 4.112 ± 0.026
7.774LeuSer: 7.774 ± 0.042
5.561LeuThr: 5.561 ± 0.029
4.528LeuVal: 4.528 ± 0.026
0.863LeuTrp: 0.863 ± 0.01
2.66LeuTyr: 2.66 ± 0.019
0.001LeuXaa: 0.001 ± 0.0
Met
1.497MetAla: 1.497 ± 0.015
0.531MetCys: 0.531 ± 0.007
1.168MetAsp: 1.168 ± 0.012
1.576MetGlu: 1.576 ± 0.013
0.938MetPhe: 0.938 ± 0.01
0.932MetGly: 0.932 ± 0.011
0.546MetHis: 0.546 ± 0.007
1.145MetIle: 1.145 ± 0.012
1.854MetLys: 1.854 ± 0.013
2.009MetLeu: 2.009 ± 0.016
0.791MetMet: 0.791 ± 0.018
1.162MetAsn: 1.162 ± 0.011
0.973MetPro: 0.973 ± 0.011
0.901MetGln: 0.901 ± 0.009
1.021MetArg: 1.021 ± 0.01
1.98MetSer: 1.98 ± 0.015
1.385MetThr: 1.385 ± 0.013
1.346MetVal: 1.346 ± 0.013
0.226MetTrp: 0.226 ± 0.004
0.757MetTyr: 0.757 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.463AsnAla: 2.463 ± 0.016
1.244AsnCys: 1.244 ± 0.016
2.845AsnAsp: 2.845 ± 0.022
3.107AsnGlu: 3.107 ± 0.023
2.029AsnPhe: 2.029 ± 0.017
2.905AsnGly: 2.905 ± 0.024
1.408AsnHis: 1.408 ± 0.012
3.802AsnIle: 3.802 ± 0.022
3.542AsnLys: 3.542 ± 0.022
4.632AsnLeu: 4.632 ± 0.025
1.221AsnMet: 1.221 ± 0.011
4.894AsnAsn: 4.894 ± 0.067
2.124AsnPro: 2.124 ± 0.016
2.206AsnGln: 2.206 ± 0.018
2.307AsnArg: 2.307 ± 0.017
5.026AsnSer: 5.026 ± 0.035
3.199AsnThr: 3.199 ± 0.025
3.219AsnVal: 3.219 ± 0.019
0.503AsnTrp: 0.503 ± 0.006
1.721AsnTyr: 1.721 ± 0.013
0.001AsnXaa: 0.001 ± 0.0
Pro
2.441ProAla: 2.441 ± 0.018
0.914ProCys: 0.914 ± 0.017
2.2ProAsp: 2.2 ± 0.018
2.695ProGlu: 2.695 ± 0.021
1.9ProPhe: 1.9 ± 0.015
2.186ProGly: 2.186 ± 0.026
1.239ProHis: 1.239 ± 0.013
2.096ProIle: 2.096 ± 0.016
2.696ProLys: 2.696 ± 0.024
3.89ProLeu: 3.89 ± 0.028
0.873ProMet: 0.873 ± 0.012
2.162ProAsn: 2.162 ± 0.017
3.817ProPro: 3.817 ± 0.048
1.951ProGln: 1.951 ± 0.018
1.804ProArg: 1.804 ± 0.018
4.497ProSer: 4.497 ± 0.035
2.914ProThr: 2.914 ± 0.034
2.953ProVal: 2.953 ± 0.019
0.366ProTrp: 0.366 ± 0.007
2.085ProTyr: 2.085 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.019GlnAla: 2.019 ± 0.016
1.073GlnCys: 1.073 ± 0.016
1.845GlnAsp: 1.845 ± 0.015
2.749GlnGlu: 2.749 ± 0.025
1.438GlnPhe: 1.438 ± 0.012
1.678GlnGly: 1.678 ± 0.018
1.409GlnHis: 1.409 ± 0.017
2.298GlnIle: 2.298 ± 0.018
3.009GlnLys: 3.009 ± 0.021
4.06GlnLeu: 4.06 ± 0.028
1.109GlnMet: 1.109 ± 0.01
2.452GlnAsn: 2.452 ± 0.019
1.98GlnPro: 1.98 ± 0.021
4.015GlnGln: 4.015 ± 0.076
2.236GlnArg: 2.236 ± 0.019
3.355GlnSer: 3.355 ± 0.027
2.63GlnThr: 2.63 ± 0.021
2.12GlnVal: 2.12 ± 0.017
0.401GlnTrp: 0.401 ± 0.006
1.218GlnTyr: 1.218 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
2.046ArgAla: 2.046 ± 0.017
1.131ArgCys: 1.131 ± 0.013
2.333ArgAsp: 2.333 ± 0.021
3.652ArgGlu: 3.652 ± 0.041
1.671ArgPhe: 1.671 ± 0.013
2.265ArgGly: 2.265 ± 0.021
1.647ArgHis: 1.647 ± 0.015
3.099ArgIle: 3.099 ± 0.023
4.181ArgLys: 4.181 ± 0.028
3.95ArgLeu: 3.95 ± 0.026
1.148ArgMet: 1.148 ± 0.012
2.65ArgAsn: 2.65 ± 0.017
1.971ArgPro: 1.971 ± 0.017
2.114ArgGln: 2.114 ± 0.016
3.803ArgArg: 3.803 ± 0.042
4.074ArgSer: 4.074 ± 0.035
2.959ArgThr: 2.959 ± 0.041
2.325ArgVal: 2.325 ± 0.017
0.53ArgTrp: 0.53 ± 0.007
1.503ArgTyr: 1.503 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
4.307SerAla: 4.307 ± 0.027
1.847SerCys: 1.847 ± 0.019
4.646SerAsp: 4.646 ± 0.032
5.111SerGlu: 5.111 ± 0.039
4.522SerPhe: 4.522 ± 0.037
4.565SerGly: 4.565 ± 0.025
2.572SerHis: 2.572 ± 0.018
4.561SerIle: 4.561 ± 0.025
5.495SerLys: 5.495 ± 0.042
7.86SerLeu: 7.86 ± 0.038
1.854SerMet: 1.854 ± 0.015
4.985SerAsn: 4.985 ± 0.034
4.683SerPro: 4.683 ± 0.047
3.898SerGln: 3.898 ± 0.025
3.904SerArg: 3.904 ± 0.029
11.766SerSer: 11.766 ± 0.113
5.624SerThr: 5.624 ± 0.045
5.152SerVal: 5.152 ± 0.027
0.762SerTrp: 0.762 ± 0.011
2.552SerTyr: 2.552 ± 0.02
0.001SerXaa: 0.001 ± 0.0
Thr
3.679ThrAla: 3.679 ± 0.029
1.415ThrCys: 1.415 ± 0.019
3.099ThrAsp: 3.099 ± 0.024
3.818ThrGlu: 3.818 ± 0.036
2.369ThrPhe: 2.369 ± 0.021
3.997ThrGly: 3.997 ± 0.037
1.97ThrHis: 1.97 ± 0.021
3.611ThrIle: 3.611 ± 0.026
3.857ThrLys: 3.857 ± 0.024
4.993ThrLeu: 4.993 ± 0.025
1.265ThrMet: 1.265 ± 0.011
3.222ThrAsn: 3.222 ± 0.026
3.274ThrPro: 3.274 ± 0.032
2.098ThrGln: 2.098 ± 0.021
2.369ThrArg: 2.369 ± 0.019
5.917ThrSer: 5.917 ± 0.047
6.919ThrThr: 6.919 ± 0.138
3.861ThrVal: 3.861 ± 0.025
0.52ThrTrp: 0.52 ± 0.008
1.957ThrTyr: 1.957 ± 0.024
0.001ThrXaa: 0.001 ± 0.0
Val
3.366ValAla: 3.366 ± 0.023
1.825ValCys: 1.825 ± 0.021
3.185ValAsp: 3.185 ± 0.024
3.365ValGlu: 3.365 ± 0.024
2.306ValPhe: 2.306 ± 0.018
2.668ValGly: 2.668 ± 0.017
1.668ValHis: 1.668 ± 0.016
3.579ValIle: 3.579 ± 0.025
3.727ValLys: 3.727 ± 0.02
4.847ValLeu: 4.847 ± 0.028
1.36ValMet: 1.36 ± 0.013
2.961ValAsn: 2.961 ± 0.018
2.587ValPro: 2.587 ± 0.016
2.215ValGln: 2.215 ± 0.018
2.832ValArg: 2.832 ± 0.034
4.99ValSer: 4.99 ± 0.029
3.811ValThr: 3.811 ± 0.032
4.395ValVal: 4.395 ± 0.046
0.582ValTrp: 0.582 ± 0.008
1.76ValTyr: 1.76 ± 0.015
0.0ValXaa: 0.0 ± 0.0
Trp
0.404TrpAla: 0.404 ± 0.006
0.221TrpCys: 0.221 ± 0.004
0.479TrpAsp: 0.479 ± 0.007
0.533TrpGlu: 0.533 ± 0.007
0.399TrpPhe: 0.399 ± 0.008
0.412TrpGly: 0.412 ± 0.007
0.254TrpHis: 0.254 ± 0.006
0.533TrpIle: 0.533 ± 0.007
0.817TrpLys: 0.817 ± 0.011
0.948TrpLeu: 0.948 ± 0.011
0.265TrpMet: 0.265 ± 0.005
0.566TrpAsn: 0.566 ± 0.008
0.339TrpPro: 0.339 ± 0.006
0.435TrpGln: 0.435 ± 0.007
0.518TrpArg: 0.518 ± 0.007
0.761TrpSer: 0.761 ± 0.01
0.532TrpThr: 0.532 ± 0.008
0.472TrpVal: 0.472 ± 0.007
0.151TrpTrp: 0.151 ± 0.004
0.301TrpTyr: 0.301 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.303TyrAla: 1.303 ± 0.011
0.841TyrCys: 0.841 ± 0.01
1.573TyrAsp: 1.573 ± 0.012
1.684TyrGlu: 1.684 ± 0.017
1.509TyrPhe: 1.509 ± 0.013
1.652TyrGly: 1.652 ± 0.016
1.524TyrHis: 1.524 ± 0.022
2.085TyrIle: 2.085 ± 0.018
1.853TyrLys: 1.853 ± 0.017
2.885TyrLeu: 2.885 ± 0.023
0.747TyrMet: 0.747 ± 0.008
1.742TyrAsn: 1.742 ± 0.015
1.281TyrPro: 1.281 ± 0.012
1.399TyrGln: 1.399 ± 0.012
1.541TyrArg: 1.541 ± 0.014
2.549TyrSer: 2.549 ± 0.015
1.719TyrThr: 1.719 ± 0.013
1.944TyrVal: 1.944 ± 0.025
0.317TyrTrp: 0.317 ± 0.007
1.396TyrTyr: 1.396 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.097XaaXaa: 0.097 ± 0.044
Statistics based on 36239 proteins (12291891 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski