Amino acid dipepetide frequency for Crassostrea gigas (Pacific oyster) (Crassostrea angulata)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.86AlaAla: 3.86 ± 0.03
1.198AlaCys: 1.198 ± 0.014
3.029AlaAsp: 3.029 ± 0.019
3.576AlaGlu: 3.576 ± 0.029
2.195AlaPhe: 2.195 ± 0.015
3.352AlaGly: 3.352 ± 0.025
1.096AlaHis: 1.096 ± 0.01
3.105AlaIle: 3.105 ± 0.018
3.48AlaLys: 3.48 ± 0.02
4.756AlaLeu: 4.756 ± 0.029
1.488AlaMet: 1.488 ± 0.013
2.371AlaAsn: 2.371 ± 0.015
2.394AlaPro: 2.394 ± 0.02
2.002AlaGln: 2.002 ± 0.014
2.51AlaArg: 2.51 ± 0.017
4.34AlaSer: 4.34 ± 0.022
3.219AlaThr: 3.219 ± 0.02
4.032AlaVal: 4.032 ± 0.021
0.565AlaTrp: 0.565 ± 0.007
1.573AlaTyr: 1.573 ± 0.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.102CysAla: 1.102 ± 0.014
0.611CysCys: 0.611 ± 0.012
1.57CysAsp: 1.57 ± 0.022
1.416CysGlu: 1.416 ± 0.017
0.893CysPhe: 0.893 ± 0.011
1.535CysGly: 1.535 ± 0.019
0.681CysHis: 0.681 ± 0.01
1.191CysIle: 1.191 ± 0.014
1.464CysLys: 1.464 ± 0.017
1.998CysLeu: 1.998 ± 0.017
0.505CysMet: 0.505 ± 0.008
1.194CysAsn: 1.194 ± 0.017
1.365CysPro: 1.365 ± 0.034
1.063CysGln: 1.063 ± 0.017
1.213CysArg: 1.213 ± 0.014
2.025CysSer: 2.025 ± 0.024
1.407CysThr: 1.407 ± 0.02
1.706CysVal: 1.706 ± 0.015
0.22CysTrp: 0.22 ± 0.004
0.716CysTyr: 0.716 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.764AspAla: 2.764 ± 0.016
1.27AspCys: 1.27 ± 0.017
3.89AspAsp: 3.89 ± 0.03
4.125AspGlu: 4.125 ± 0.026
2.508AspPhe: 2.508 ± 0.016
3.693AspGly: 3.693 ± 0.024
1.304AspHis: 1.304 ± 0.012
3.925AspIle: 3.925 ± 0.02
3.674AspLys: 3.674 ± 0.024
5.129AspLeu: 5.129 ± 0.023
1.416AspMet: 1.416 ± 0.012
2.886AspAsn: 2.886 ± 0.02
2.671AspPro: 2.671 ± 0.02
2.108AspGln: 2.108 ± 0.014
2.747AspArg: 2.747 ± 0.02
4.689AspSer: 4.689 ± 0.028
3.159AspThr: 3.159 ± 0.021
3.989AspVal: 3.989 ± 0.02
0.678AspTrp: 0.678 ± 0.009
1.859AspTyr: 1.859 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
3.675GluAla: 3.675 ± 0.026
1.361GluCys: 1.361 ± 0.019
4.422GluAsp: 4.422 ± 0.027
6.581GluGlu: 6.581 ± 0.065
2.353GluPhe: 2.353 ± 0.015
3.393GluGly: 3.393 ± 0.024
1.4GluHis: 1.4 ± 0.012
3.995GluIle: 3.995 ± 0.029
5.572GluLys: 5.572 ± 0.043
5.172GluLeu: 5.172 ± 0.031
1.851GluMet: 1.851 ± 0.017
4.0GluAsn: 4.0 ± 0.025
2.217GluPro: 2.217 ± 0.018
2.519GluGln: 2.519 ± 0.023
3.365GluArg: 3.365 ± 0.026
4.686GluSer: 4.686 ± 0.03
3.775GluThr: 3.775 ± 0.026
3.973GluVal: 3.973 ± 0.027
0.666GluTrp: 0.666 ± 0.008
1.976GluTyr: 1.976 ± 0.015
0.0GluXaa: 0.0 ± 0.0
Phe
1.961PheAla: 1.961 ± 0.015
0.978PheCys: 0.978 ± 0.013
2.323PheAsp: 2.323 ± 0.016
2.208PheGlu: 2.208 ± 0.015
1.618PhePhe: 1.618 ± 0.015
2.527PheGly: 2.527 ± 0.019
1.016PheHis: 1.016 ± 0.011
2.192PheIle: 2.192 ± 0.015
2.224PheLys: 2.224 ± 0.013
3.662PheLeu: 3.662 ± 0.024
0.924PheMet: 0.924 ± 0.01
1.861PheAsn: 1.861 ± 0.016
1.784PhePro: 1.784 ± 0.016
1.6PheGln: 1.6 ± 0.011
1.904PheArg: 1.904 ± 0.014
3.353PheSer: 3.353 ± 0.017
2.45PheThr: 2.45 ± 0.018
2.626PheVal: 2.626 ± 0.019
0.474PheTrp: 0.474 ± 0.008
1.44PheTyr: 1.44 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
3.093GlyAla: 3.093 ± 0.023
1.336GlyCys: 1.336 ± 0.019
3.44GlyAsp: 3.44 ± 0.026
3.478GlyGlu: 3.478 ± 0.021
2.538GlyPhe: 2.538 ± 0.022
4.41GlyGly: 4.41 ± 0.065
1.626GlyHis: 1.626 ± 0.018
3.268GlyIle: 3.268 ± 0.023
3.99GlyLys: 3.99 ± 0.021
4.519GlyLeu: 4.519 ± 0.024
1.471GlyMet: 1.471 ± 0.014
3.127GlyAsn: 3.127 ± 0.022
2.308GlyPro: 2.308 ± 0.03
2.53GlyGln: 2.53 ± 0.022
3.029GlyArg: 3.029 ± 0.02
5.128GlySer: 5.128 ± 0.037
3.52GlyThr: 3.52 ± 0.034
3.716GlyVal: 3.716 ± 0.022
0.78GlyTrp: 0.78 ± 0.01
2.326GlyTyr: 2.326 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.154HisAla: 1.154 ± 0.01
0.69HisCys: 0.69 ± 0.011
1.168HisAsp: 1.168 ± 0.011
1.241HisGlu: 1.241 ± 0.014
0.994HisPhe: 0.994 ± 0.011
1.542HisGly: 1.542 ± 0.014
0.826HisHis: 0.826 ± 0.009
1.44HisIle: 1.44 ± 0.013
1.398HisLys: 1.398 ± 0.013
2.255HisLeu: 2.255 ± 0.017
0.621HisMet: 0.621 ± 0.009
1.09HisAsn: 1.09 ± 0.011
1.268HisPro: 1.268 ± 0.011
1.062HisGln: 1.062 ± 0.011
1.419HisArg: 1.419 ± 0.014
1.97HisSer: 1.97 ± 0.013
1.34HisThr: 1.34 ± 0.013
1.528HisVal: 1.528 ± 0.012
0.304HisTrp: 0.304 ± 0.006
0.834HisTyr: 0.834 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.131IleAla: 3.131 ± 0.019
1.375IleCys: 1.375 ± 0.015
3.185IleAsp: 3.185 ± 0.02
3.294IleGlu: 3.294 ± 0.017
2.149IlePhe: 2.149 ± 0.018
2.955IleGly: 2.955 ± 0.022
1.485IleHis: 1.485 ± 0.011
3.029IleIle: 3.029 ± 0.022
3.596IleLys: 3.596 ± 0.025
4.899IleLeu: 4.899 ± 0.026
1.221IleMet: 1.221 ± 0.011
2.749IleAsn: 2.749 ± 0.018
2.956IlePro: 2.956 ± 0.018
2.643IleGln: 2.643 ± 0.017
2.691IleArg: 2.691 ± 0.014
4.409IleSer: 4.409 ± 0.021
3.429IleThr: 3.429 ± 0.024
3.541IleVal: 3.541 ± 0.021
0.583IleTrp: 0.583 ± 0.007
1.754IleTyr: 1.754 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
3.556LysAla: 3.556 ± 0.021
1.508LysCys: 1.508 ± 0.018
3.98LysAsp: 3.98 ± 0.026
5.349LysGlu: 5.349 ± 0.039
2.314LysPhe: 2.314 ± 0.014
3.366LysGly: 3.366 ± 0.024
1.638LysHis: 1.638 ± 0.016
3.716LysIle: 3.716 ± 0.022
5.786LysLys: 5.786 ± 0.046
5.537LysLeu: 5.537 ± 0.032
1.814LysMet: 1.814 ± 0.014
3.279LysAsn: 3.279 ± 0.019
2.992LysPro: 2.992 ± 0.03
2.907LysGln: 2.907 ± 0.022
3.702LysArg: 3.702 ± 0.024
5.256LysSer: 5.256 ± 0.032
4.12LysThr: 4.12 ± 0.024
3.941LysVal: 3.941 ± 0.022
0.7LysTrp: 0.7 ± 0.008
2.235LysTyr: 2.235 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
4.65LeuAla: 4.65 ± 0.027
2.084LeuCys: 2.084 ± 0.018
4.714LeuAsp: 4.714 ± 0.027
5.544LeuGlu: 5.544 ± 0.038
3.234LeuPhe: 3.234 ± 0.021
4.331LeuGly: 4.331 ± 0.026
2.211LeuHis: 2.211 ± 0.018
4.091LeuIle: 4.091 ± 0.024
5.821LeuLys: 5.821 ± 0.032
7.554LeuLeu: 7.554 ± 0.043
2.081LeuMet: 2.081 ± 0.016
3.918LeuAsn: 3.918 ± 0.022
4.196LeuPro: 4.196 ± 0.026
4.294LeuGln: 4.294 ± 0.028
4.353LeuArg: 4.353 ± 0.025
7.029LeuSer: 7.029 ± 0.034
5.068LeuThr: 5.068 ± 0.025
5.061LeuVal: 5.061 ± 0.025
0.882LeuTrp: 0.882 ± 0.01
2.777LeuTyr: 2.777 ± 0.02
0.0LeuXaa: 0.0 ± 0.0
Met
1.765MetAla: 1.765 ± 0.014
0.533MetCys: 0.533 ± 0.008
1.574MetAsp: 1.574 ± 0.012
1.986MetGlu: 1.986 ± 0.014
1.036MetPhe: 1.036 ± 0.01
1.295MetGly: 1.295 ± 0.014
0.466MetHis: 0.466 ± 0.007
1.198MetIle: 1.198 ± 0.01
1.858MetLys: 1.858 ± 0.016
1.894MetLeu: 1.894 ± 0.015
0.768MetMet: 0.768 ± 0.01
1.204MetAsn: 1.204 ± 0.012
1.008MetPro: 1.008 ± 0.013
0.911MetGln: 0.911 ± 0.01
1.134MetArg: 1.134 ± 0.011
2.061MetSer: 2.061 ± 0.015
1.526MetThr: 1.526 ± 0.011
1.463MetVal: 1.463 ± 0.012
0.256MetTrp: 0.256 ± 0.006
0.847MetTyr: 0.847 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
2.517AsnAla: 2.517 ± 0.016
1.289AsnCys: 1.289 ± 0.021
2.679AsnAsp: 2.679 ± 0.018
2.964AsnGlu: 2.964 ± 0.019
1.93AsnPhe: 1.93 ± 0.013
3.471AsnGly: 3.471 ± 0.027
1.168AsnHis: 1.168 ± 0.01
3.139AsnIle: 3.139 ± 0.018
3.259AsnLys: 3.259 ± 0.02
4.2AsnLeu: 4.2 ± 0.024
1.258AsnMet: 1.258 ± 0.012
2.724AsnAsn: 2.724 ± 0.021
2.481AsnPro: 2.481 ± 0.02
2.134AsnGln: 2.134 ± 0.016
2.344AsnArg: 2.344 ± 0.015
3.985AsnSer: 3.985 ± 0.024
2.965AsnThr: 2.965 ± 0.019
3.206AsnVal: 3.206 ± 0.02
0.516AsnTrp: 0.516 ± 0.007
1.582AsnTyr: 1.582 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
2.633ProAla: 2.633 ± 0.021
1.039ProCys: 1.039 ± 0.018
2.701ProAsp: 2.701 ± 0.017
3.21ProGlu: 3.21 ± 0.026
1.707ProPhe: 1.707 ± 0.016
3.039ProGly: 3.039 ± 0.039
1.075ProHis: 1.075 ± 0.01
2.211ProIle: 2.211 ± 0.017
2.953ProLys: 2.953 ± 0.022
3.634ProLeu: 3.634 ± 0.024
1.01ProMet: 1.01 ± 0.011
2.249ProAsn: 2.249 ± 0.018
3.624ProPro: 3.624 ± 0.044
2.022ProGln: 2.022 ± 0.017
2.315ProArg: 2.315 ± 0.018
4.308ProSer: 4.308 ± 0.027
2.94ProThr: 2.94 ± 0.027
3.285ProVal: 3.285 ± 0.021
0.509ProTrp: 0.509 ± 0.007
1.487ProTyr: 1.487 ± 0.016
0.001ProXaa: 0.001 ± 0.0
Gln
2.245GlnAla: 2.245 ± 0.017
1.062GlnCys: 1.062 ± 0.016
2.188GlnAsp: 2.188 ± 0.015
3.037GlnGlu: 3.037 ± 0.023
1.569GlnPhe: 1.569 ± 0.013
2.386GlnGly: 2.386 ± 0.022
1.026GlnHis: 1.026 ± 0.012
2.288GlnIle: 2.288 ± 0.017
2.983GlnLys: 2.983 ± 0.021
3.427GlnLeu: 3.427 ± 0.027
1.116GlnMet: 1.116 ± 0.012
2.327GlnAsn: 2.327 ± 0.016
1.856GlnPro: 1.856 ± 0.017
2.332GlnGln: 2.332 ± 0.024
2.359GlnArg: 2.359 ± 0.017
3.314GlnSer: 3.314 ± 0.022
2.703GlnThr: 2.703 ± 0.016
2.298GlnVal: 2.298 ± 0.016
0.483GlnTrp: 0.483 ± 0.007
1.412GlnTyr: 1.412 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
2.43ArgAla: 2.43 ± 0.016
1.149ArgCys: 1.149 ± 0.015
2.825ArgAsp: 2.825 ± 0.02
3.419ArgGlu: 3.419 ± 0.025
1.95ArgPhe: 1.95 ± 0.014
2.851ArgGly: 2.851 ± 0.021
1.327ArgHis: 1.327 ± 0.014
2.62ArgIle: 2.62 ± 0.016
3.911ArgLys: 3.911 ± 0.025
4.218ArgLeu: 4.218 ± 0.022
1.247ArgMet: 1.247 ± 0.011
2.594ArgAsn: 2.594 ± 0.018
2.293ArgPro: 2.293 ± 0.017
2.196ArgGln: 2.196 ± 0.017
3.429ArgArg: 3.429 ± 0.026
3.792ArgSer: 3.792 ± 0.026
2.704ArgThr: 2.704 ± 0.017
2.857ArgVal: 2.857 ± 0.018
0.558ArgTrp: 0.558 ± 0.008
1.792ArgTyr: 1.792 ± 0.013
0.0ArgXaa: 0.0 ± 0.0
Ser
4.446SerAla: 4.446 ± 0.022
1.843SerCys: 1.843 ± 0.022
4.972SerAsp: 4.972 ± 0.027
5.132SerGlu: 5.132 ± 0.037
3.158SerPhe: 3.158 ± 0.021
5.412SerGly: 5.412 ± 0.036
1.879SerHis: 1.879 ± 0.014
4.014SerIle: 4.014 ± 0.021
5.093SerLys: 5.093 ± 0.03
6.962SerLeu: 6.962 ± 0.033
1.915SerMet: 1.915 ± 0.014
3.854SerAsn: 3.854 ± 0.021
4.355SerPro: 4.355 ± 0.034
3.4SerGln: 3.4 ± 0.024
3.886SerArg: 3.886 ± 0.025
8.609SerSer: 8.609 ± 0.052
5.182SerThr: 5.182 ± 0.032
5.312SerVal: 5.312 ± 0.027
0.866SerTrp: 0.866 ± 0.009
2.448SerTyr: 2.448 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
3.586ThrAla: 3.586 ± 0.02
1.676ThrCys: 1.676 ± 0.026
3.566ThrAsp: 3.566 ± 0.019
4.109ThrGlu: 4.109 ± 0.031
2.347ThrPhe: 2.347 ± 0.015
3.941ThrGly: 3.941 ± 0.032
1.262ThrHis: 1.262 ± 0.012
3.271ThrIle: 3.271 ± 0.02
3.683ThrLys: 3.683 ± 0.023
4.892ThrLeu: 4.892 ± 0.024
1.38ThrMet: 1.38 ± 0.012
2.784ThrAsn: 2.784 ± 0.019
3.312ThrPro: 3.312 ± 0.027
2.294ThrGln: 2.294 ± 0.017
2.479ThrArg: 2.479 ± 0.017
5.219ThrSer: 5.219 ± 0.031
4.365ThrThr: 4.365 ± 0.042
4.261ThrVal: 4.261 ± 0.029
0.685ThrTrp: 0.685 ± 0.009
1.881ThrTyr: 1.881 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
3.53ValAla: 3.53 ± 0.023
1.753ValCys: 1.753 ± 0.018
3.755ValAsp: 3.755 ± 0.022
3.86ValGlu: 3.86 ± 0.026
2.794ValPhe: 2.794 ± 0.019
3.36ValGly: 3.36 ± 0.022
1.5ValHis: 1.5 ± 0.013
3.737ValIle: 3.737 ± 0.022
4.101ValLys: 4.101 ± 0.022
5.461ValLeu: 5.461 ± 0.028
1.58ValMet: 1.58 ± 0.012
3.211ValAsn: 3.211 ± 0.022
2.993ValPro: 2.993 ± 0.021
2.583ValGln: 2.583 ± 0.019
2.888ValArg: 2.888 ± 0.014
5.086ValSer: 5.086 ± 0.024
4.338ValThr: 4.338 ± 0.029
4.629ValVal: 4.629 ± 0.027
0.732ValTrp: 0.732 ± 0.009
2.177ValTyr: 2.177 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.523TrpAla: 0.523 ± 0.007
0.244TrpCys: 0.244 ± 0.005
0.599TrpAsp: 0.599 ± 0.008
0.62TrpGlu: 0.62 ± 0.009
0.453TrpPhe: 0.453 ± 0.007
0.654TrpGly: 0.654 ± 0.01
0.257TrpHis: 0.257 ± 0.006
0.673TrpIle: 0.673 ± 0.008
0.84TrpLys: 0.84 ± 0.01
0.903TrpLeu: 0.903 ± 0.011
0.326TrpMet: 0.326 ± 0.006
0.612TrpAsn: 0.612 ± 0.008
0.396TrpPro: 0.396 ± 0.006
0.415TrpGln: 0.415 ± 0.006
0.665TrpArg: 0.665 ± 0.008
0.855TrpSer: 0.855 ± 0.009
0.765TrpThr: 0.765 ± 0.013
0.656TrpVal: 0.656 ± 0.008
0.168TrpTrp: 0.168 ± 0.004
0.392TrpTyr: 0.392 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.52TyrAla: 1.52 ± 0.013
0.874TyrCys: 0.874 ± 0.021
1.905TyrAsp: 1.905 ± 0.014
1.803TyrGlu: 1.803 ± 0.012
1.383TyrPhe: 1.383 ± 0.013
2.118TyrGly: 2.118 ± 0.019
0.901TyrHis: 0.901 ± 0.01
1.973TyrIle: 1.973 ± 0.015
2.012TyrLys: 2.012 ± 0.014
2.819TyrLeu: 2.819 ± 0.02
0.79TyrMet: 0.79 ± 0.009
1.776TyrAsn: 1.776 ± 0.013
1.481TyrPro: 1.481 ± 0.015
1.398TyrGln: 1.398 ± 0.013
1.732TyrArg: 1.732 ± 0.013
2.626TyrSer: 2.626 ± 0.02
2.02TyrThr: 2.02 ± 0.019
1.986TyrVal: 1.986 ± 0.016
0.391TyrTrp: 0.391 ± 0.007
1.198TyrTyr: 1.198 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.331XaaXaa: 0.331 ± 0.082
Statistics based on 25998 proteins (11654340 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski