Amino acid dipepetide frequency for Sinocyclocheilus rhinocerous

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.354AlaAla: 5.354 ± 0.015
1.313AlaCys: 1.313 ± 0.005
3.088AlaAsp: 3.088 ± 0.008
4.448AlaGlu: 4.448 ± 0.012
2.449AlaPhe: 2.449 ± 0.007
3.956AlaGly: 3.956 ± 0.013
1.51AlaHis: 1.51 ± 0.005
2.922AlaIle: 2.922 ± 0.007
3.407AlaLys: 3.407 ± 0.01
6.534AlaLeu: 6.534 ± 0.014
1.582AlaMet: 1.582 ± 0.006
2.197AlaAsn: 2.197 ± 0.007
3.034AlaPro: 3.034 ± 0.01
2.896AlaGln: 2.896 ± 0.009
2.996AlaArg: 2.996 ± 0.008
4.955AlaSer: 4.955 ± 0.013
3.247AlaThr: 3.247 ± 0.008
4.88AlaVal: 4.88 ± 0.01
0.638AlaTrp: 0.638 ± 0.004
1.59AlaTyr: 1.59 ± 0.006
0.0AlaXaa: 0.0 ± 0.0
Cys
1.235CysAla: 1.235 ± 0.006
0.651CysCys: 0.651 ± 0.004
1.162CysAsp: 1.162 ± 0.007
1.305CysGlu: 1.305 ± 0.006
1.015CysPhe: 1.015 ± 0.004
1.484CysGly: 1.484 ± 0.008
0.659CysHis: 0.659 ± 0.004
1.133CysIle: 1.133 ± 0.006
1.261CysLys: 1.261 ± 0.006
2.313CysLeu: 2.313 ± 0.009
0.524CysMet: 0.524 ± 0.003
0.875CysAsn: 0.875 ± 0.005
1.216CysPro: 1.216 ± 0.008
1.047CysGln: 1.047 ± 0.006
1.262CysArg: 1.262 ± 0.006
2.079CysSer: 2.079 ± 0.008
1.237CysThr: 1.237 ± 0.006
1.731CysVal: 1.731 ± 0.008
0.301CysTrp: 0.301 ± 0.002
0.651CysTyr: 0.651 ± 0.003
0.0CysXaa: 0.0 ± 0.0
Asp
3.009AspAla: 3.009 ± 0.008
1.147AspCys: 1.147 ± 0.007
3.063AspAsp: 3.063 ± 0.012
3.808AspGlu: 3.808 ± 0.011
2.249AspPhe: 2.249 ± 0.007
3.593AspGly: 3.593 ± 0.011
1.21AspHis: 1.21 ± 0.005
2.946AspIle: 2.946 ± 0.009
2.826AspLys: 2.826 ± 0.009
5.199AspLeu: 5.199 ± 0.009
1.339AspMet: 1.339 ± 0.005
1.968AspAsn: 1.968 ± 0.007
2.854AspPro: 2.854 ± 0.008
1.954AspGln: 1.954 ± 0.006
2.643AspArg: 2.643 ± 0.007
4.304AspSer: 4.304 ± 0.011
2.662AspThr: 2.662 ± 0.007
3.445AspVal: 3.445 ± 0.01
0.708AspTrp: 0.708 ± 0.004
1.641AspTyr: 1.641 ± 0.006
0.0AspXaa: 0.0 ± 0.0
Glu
4.488GluAla: 4.488 ± 0.014
1.315GluCys: 1.315 ± 0.007
4.353GluAsp: 4.353 ± 0.01
7.152GluGlu: 7.152 ± 0.025
2.163GluPhe: 2.163 ± 0.007
3.942GluGly: 3.942 ± 0.011
1.528GluHis: 1.528 ± 0.005
3.191GluIle: 3.191 ± 0.008
4.88GluLys: 4.88 ± 0.016
6.321GluLeu: 6.321 ± 0.015
1.87GluMet: 1.87 ± 0.006
2.941GluAsn: 2.941 ± 0.008
2.624GluPro: 2.624 ± 0.008
3.112GluGln: 3.112 ± 0.011
4.201GluArg: 4.201 ± 0.013
4.338GluSer: 4.338 ± 0.011
3.4GluThr: 3.4 ± 0.01
4.305GluVal: 4.305 ± 0.009
0.716GluTrp: 0.716 ± 0.004
1.762GluTyr: 1.762 ± 0.006
0.0GluXaa: 0.0 ± 0.0
Phe
2.043PheAla: 2.043 ± 0.007
1.034PheCys: 1.034 ± 0.005
1.923PheAsp: 1.923 ± 0.006
2.083PheGlu: 2.083 ± 0.007
1.837PhePhe: 1.837 ± 0.007
2.313PheGly: 2.313 ± 0.009
1.09PheHis: 1.09 ± 0.004
2.199PheIle: 2.199 ± 0.007
2.025PheLys: 2.025 ± 0.007
4.133PheLeu: 4.133 ± 0.011
0.895PheMet: 0.895 ± 0.004
1.63PheAsn: 1.63 ± 0.006
1.84PhePro: 1.84 ± 0.006
1.732PheGln: 1.732 ± 0.006
1.946PheArg: 1.946 ± 0.005
3.543PheSer: 3.543 ± 0.008
2.42PheThr: 2.42 ± 0.008
2.413PheVal: 2.413 ± 0.008
0.498PheTrp: 0.498 ± 0.003
1.334PheTyr: 1.334 ± 0.005
0.0PheXaa: 0.0 ± 0.0
Gly
3.671GlyAla: 3.671 ± 0.012
1.191GlyCys: 1.191 ± 0.005
3.142GlyAsp: 3.142 ± 0.01
3.881GlyGlu: 3.881 ± 0.011
2.501GlyPhe: 2.501 ± 0.008
4.401GlyGly: 4.401 ± 0.015
1.628GlyHis: 1.628 ± 0.007
2.789GlyIle: 2.789 ± 0.009
3.708GlyLys: 3.708 ± 0.009
5.33GlyLeu: 5.33 ± 0.013
1.54GlyMet: 1.54 ± 0.007
2.428GlyAsn: 2.428 ± 0.009
2.928GlyPro: 2.928 ± 0.015
2.66GlyGln: 2.66 ± 0.009
3.364GlyArg: 3.364 ± 0.011
5.192GlySer: 5.192 ± 0.012
3.322GlyThr: 3.322 ± 0.01
3.88GlyVal: 3.88 ± 0.011
0.757GlyTrp: 0.757 ± 0.004
1.892GlyTyr: 1.892 ± 0.008
0.0GlyXaa: 0.0 ± 0.0
His
1.359HisAla: 1.359 ± 0.004
0.782HisCys: 0.782 ± 0.004
1.058HisAsp: 1.058 ± 0.004
1.341HisGlu: 1.341 ± 0.005
1.151HisPhe: 1.151 ± 0.005
1.538HisGly: 1.538 ± 0.005
0.922HisHis: 0.922 ± 0.005
1.474HisIle: 1.474 ± 0.006
1.403HisLys: 1.403 ± 0.005
2.773HisLeu: 2.773 ± 0.008
0.707HisMet: 0.707 ± 0.005
1.093HisAsn: 1.093 ± 0.005
1.49HisPro: 1.49 ± 0.006
1.203HisGln: 1.203 ± 0.005
1.554HisArg: 1.554 ± 0.006
2.351HisSer: 2.351 ± 0.007
1.68HisThr: 1.68 ± 0.007
1.527HisVal: 1.527 ± 0.006
0.35HisTrp: 0.35 ± 0.002
0.916HisTyr: 0.916 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.773IleAla: 2.773 ± 0.008
1.197IleCys: 1.197 ± 0.005
2.368IleAsp: 2.368 ± 0.007
2.723IleGlu: 2.723 ± 0.009
2.067IlePhe: 2.067 ± 0.008
2.462IleGly: 2.462 ± 0.009
1.419IleHis: 1.419 ± 0.005
2.747IleIle: 2.747 ± 0.01
2.903IleLys: 2.903 ± 0.009
4.716IleLeu: 4.716 ± 0.011
1.23IleMet: 1.23 ± 0.005
2.21IleAsn: 2.21 ± 0.008
2.541IlePro: 2.541 ± 0.007
2.39IleGln: 2.39 ± 0.007
2.621IleArg: 2.621 ± 0.007
4.081IleSer: 4.081 ± 0.009
3.024IleThr: 3.024 ± 0.008
2.839IleVal: 2.839 ± 0.01
0.526IleTrp: 0.526 ± 0.003
1.631IleTyr: 1.631 ± 0.006
0.0IleXaa: 0.0 ± 0.0
Lys
3.764LysAla: 3.764 ± 0.012
1.146LysCys: 1.146 ± 0.005
3.418LysAsp: 3.418 ± 0.009
4.772LysGlu: 4.772 ± 0.016
1.81LysPhe: 1.81 ± 0.007
3.223LysGly: 3.223 ± 0.009
1.563LysHis: 1.563 ± 0.006
2.892LysIle: 2.892 ± 0.009
4.56LysLys: 4.56 ± 0.018
5.373LysLeu: 5.373 ± 0.013
1.561LysMet: 1.561 ± 0.006
2.553LysAsn: 2.553 ± 0.007
2.878LysPro: 2.878 ± 0.011
2.716LysGln: 2.716 ± 0.01
3.444LysArg: 3.444 ± 0.01
3.96LysSer: 3.96 ± 0.01
3.307LysThr: 3.307 ± 0.009
3.697LysVal: 3.697 ± 0.01
0.619LysTrp: 0.619 ± 0.003
1.688LysTyr: 1.688 ± 0.006
0.0LysXaa: 0.0 ± 0.0
Leu
6.007LeuAla: 6.007 ± 0.013
2.352LeuCys: 2.352 ± 0.008
4.978LeuAsp: 4.978 ± 0.009
6.631LeuGlu: 6.631 ± 0.019
3.776LeuPhe: 3.776 ± 0.011
5.027LeuGly: 5.027 ± 0.012
2.803LeuHis: 2.803 ± 0.008
4.282LeuIle: 4.282 ± 0.011
6.09LeuLys: 6.09 ± 0.012
10.13LeuLeu: 10.13 ± 0.023
2.266LeuMet: 2.266 ± 0.008
3.979LeuAsn: 3.979 ± 0.008
4.945LeuPro: 4.945 ± 0.012
5.545LeuGln: 5.545 ± 0.014
5.562LeuArg: 5.562 ± 0.01
8.095LeuSer: 8.095 ± 0.014
5.257LeuThr: 5.257 ± 0.012
5.485LeuVal: 5.485 ± 0.012
1.069LeuTrp: 1.069 ± 0.004
2.858LeuTyr: 2.858 ± 0.008
0.0LeuXaa: 0.0 ± 0.0
Met
1.891MetAla: 1.891 ± 0.006
0.539MetCys: 0.539 ± 0.003
1.488MetAsp: 1.488 ± 0.005
2.035MetGlu: 2.035 ± 0.007
0.953MetPhe: 0.953 ± 0.005
1.461MetGly: 1.461 ± 0.006
0.568MetHis: 0.568 ± 0.003
0.997MetIle: 0.997 ± 0.004
1.59MetLys: 1.59 ± 0.006
2.245MetLeu: 2.245 ± 0.007
0.733MetMet: 0.733 ± 0.004
1.029MetAsn: 1.029 ± 0.004
1.142MetPro: 1.142 ± 0.006
1.086MetGln: 1.086 ± 0.005
1.262MetArg: 1.262 ± 0.005
1.873MetSer: 1.873 ± 0.006
1.303MetThr: 1.303 ± 0.005
1.61MetVal: 1.61 ± 0.006
0.279MetTrp: 0.279 ± 0.002
0.698MetTyr: 0.698 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.321AsnAla: 2.321 ± 0.006
0.932AsnCys: 0.932 ± 0.005
1.825AsnAsp: 1.825 ± 0.007
2.342AsnGlu: 2.342 ± 0.008
1.532AsnPhe: 1.532 ± 0.005
2.817AsnGly: 2.817 ± 0.009
1.04AsnHis: 1.04 ± 0.005
2.453AsnIle: 2.453 ± 0.007
2.399AsnLys: 2.399 ± 0.008
3.812AsnLeu: 3.812 ± 0.01
1.12AsnMet: 1.12 ± 0.004
1.85AsnAsn: 1.85 ± 0.006
2.294AsnPro: 2.294 ± 0.007
1.768AsnGln: 1.768 ± 0.006
2.037AsnArg: 2.037 ± 0.006
3.279AsnSer: 3.279 ± 0.008
2.352AsnThr: 2.352 ± 0.007
2.466AsnVal: 2.466 ± 0.007
0.473AsnTrp: 0.473 ± 0.003
1.255AsnTyr: 1.255 ± 0.005
0.0AsnXaa: 0.0 ± 0.0
Pro
3.542ProAla: 3.542 ± 0.01
1.015ProCys: 1.015 ± 0.007
2.818ProAsp: 2.818 ± 0.007
3.662ProGlu: 3.662 ± 0.01
1.858ProPhe: 1.858 ± 0.008
3.527ProGly: 3.527 ± 0.02
1.422ProHis: 1.422 ± 0.006
1.964ProIle: 1.964 ± 0.008
2.562ProLys: 2.562 ± 0.01
4.574ProLeu: 4.574 ± 0.012
1.044ProMet: 1.044 ± 0.005
1.966ProAsn: 1.966 ± 0.007
4.421ProPro: 4.421 ± 0.021
2.433ProGln: 2.433 ± 0.009
2.439ProArg: 2.439 ± 0.008
4.902ProSer: 4.902 ± 0.014
2.826ProThr: 2.826 ± 0.009
3.607ProVal: 3.607 ± 0.009
0.526ProTrp: 0.526 ± 0.003
1.433ProTyr: 1.433 ± 0.006
0.0ProXaa: 0.0 ± 0.0
Gln
3.068GlnAla: 3.068 ± 0.009
1.064GlnCys: 1.064 ± 0.006
2.352GlnAsp: 2.352 ± 0.006
3.397GlnGlu: 3.397 ± 0.01
1.504GlnPhe: 1.504 ± 0.005
2.552GlnGly: 2.552 ± 0.008
1.362GlnHis: 1.362 ± 0.006
2.151GlnIle: 2.151 ± 0.006
2.772GlnLys: 2.772 ± 0.01
4.511GlnLeu: 4.511 ± 0.013
1.233GlnMet: 1.233 ± 0.005
1.921GlnAsn: 1.921 ± 0.006
2.291GlnPro: 2.291 ± 0.01
3.024GlnGln: 3.024 ± 0.018
2.903GlnArg: 2.903 ± 0.009
3.362GlnSer: 3.362 ± 0.011
2.581GlnThr: 2.581 ± 0.007
2.803GlnVal: 2.803 ± 0.008
0.568GlnTrp: 0.568 ± 0.003
1.359GlnTyr: 1.359 ± 0.005
0.0GlnXaa: 0.0 ± 0.0
Arg
3.29ArgAla: 3.29 ± 0.008
1.244ArgCys: 1.244 ± 0.006
2.914ArgAsp: 2.914 ± 0.008
3.81ArgGlu: 3.81 ± 0.012
2.055ArgPhe: 2.055 ± 0.007
3.14ArgGly: 3.14 ± 0.011
1.5ArgHis: 1.5 ± 0.005
2.591ArgIle: 2.591 ± 0.006
3.557ArgLys: 3.557 ± 0.01
5.248ArgLeu: 5.248 ± 0.012
1.346ArgMet: 1.346 ± 0.004
2.215ArgAsn: 2.215 ± 0.007
2.634ArgPro: 2.634 ± 0.007
2.572ArgGln: 2.572 ± 0.008
3.831ArgArg: 3.831 ± 0.012
4.136ArgSer: 4.136 ± 0.011
2.779ArgThr: 2.779 ± 0.008
3.308ArgVal: 3.308 ± 0.009
0.661ArgTrp: 0.661 ± 0.004
1.607ArgTyr: 1.607 ± 0.005
0.0ArgXaa: 0.0 ± 0.0
Ser
5.178SerAla: 5.178 ± 0.013
1.886SerCys: 1.886 ± 0.008
4.238SerAsp: 4.238 ± 0.01
4.896SerGlu: 4.896 ± 0.012
3.199SerPhe: 3.199 ± 0.008
5.285SerGly: 5.285 ± 0.011
2.145SerHis: 2.145 ± 0.007
3.596SerIle: 3.596 ± 0.009
4.114SerLys: 4.114 ± 0.01
8.012SerLeu: 8.012 ± 0.016
1.865SerMet: 1.865 ± 0.006
3.001SerAsn: 3.001 ± 0.008
5.1SerPro: 5.1 ± 0.016
3.67SerGln: 3.67 ± 0.011
4.19SerArg: 4.19 ± 0.011
8.993SerSer: 8.993 ± 0.026
4.602SerThr: 4.602 ± 0.012
5.432SerVal: 5.432 ± 0.011
0.96SerTrp: 0.96 ± 0.004
2.169SerTyr: 2.169 ± 0.007
0.001SerXaa: 0.001 ± 0.0
Thr
3.834ThrAla: 3.834 ± 0.009
1.395ThrCys: 1.395 ± 0.008
2.992ThrAsp: 2.992 ± 0.008
3.702ThrGlu: 3.702 ± 0.01
2.206ThrPhe: 2.206 ± 0.007
3.688ThrGly: 3.688 ± 0.01
1.531ThrHis: 1.531 ± 0.007
2.608ThrIle: 2.608 ± 0.007
2.686ThrLys: 2.686 ± 0.009
5.498ThrLeu: 5.498 ± 0.011
1.197ThrMet: 1.197 ± 0.005
2.012ThrAsn: 2.012 ± 0.007
3.362ThrPro: 3.362 ± 0.011
2.362ThrGln: 2.362 ± 0.008
2.436ThrArg: 2.436 ± 0.008
4.469ThrSer: 4.469 ± 0.013
3.046ThrThr: 3.046 ± 0.015
4.172ThrVal: 4.172 ± 0.011
0.644ThrTrp: 0.644 ± 0.004
1.534ThrTyr: 1.534 ± 0.006
0.0ThrXaa: 0.0 ± 0.0
Val
3.987ValAla: 3.987 ± 0.01
1.885ValCys: 1.885 ± 0.007
3.244ValAsp: 3.244 ± 0.008
4.108ValGlu: 4.108 ± 0.01
2.786ValPhe: 2.786 ± 0.008
3.407ValGly: 3.407 ± 0.01
1.645ValHis: 1.645 ± 0.005
3.245ValIle: 3.245 ± 0.01
3.799ValLys: 3.799 ± 0.009
6.403ValLeu: 6.403 ± 0.014
1.601ValMet: 1.601 ± 0.006
2.676ValAsn: 2.676 ± 0.008
3.19ValPro: 3.19 ± 0.009
2.839ValGln: 2.839 ± 0.007
3.326ValArg: 3.326 ± 0.009
5.287ValSer: 5.287 ± 0.011
3.877ValThr: 3.877 ± 0.011
4.239ValVal: 4.239 ± 0.01
0.787ValTrp: 0.787 ± 0.003
1.948ValTyr: 1.948 ± 0.006
0.0ValXaa: 0.0 ± 0.0
Trp
0.661TrpAla: 0.661 ± 0.004
0.273TrpCys: 0.273 ± 0.002
0.665TrpAsp: 0.665 ± 0.004
0.754TrpGlu: 0.754 ± 0.004
0.473TrpPhe: 0.473 ± 0.003
0.625TrpGly: 0.625 ± 0.004
0.297TrpHis: 0.297 ± 0.002
0.602TrpIle: 0.602 ± 0.004
0.76TrpLys: 0.76 ± 0.004
1.163TrpLeu: 1.163 ± 0.005
0.371TrpMet: 0.371 ± 0.002
0.531TrpAsn: 0.531 ± 0.003
0.438TrpPro: 0.438 ± 0.003
0.498TrpGln: 0.498 ± 0.003
0.695TrpArg: 0.695 ± 0.003
0.934TrpSer: 0.934 ± 0.005
0.695TrpThr: 0.695 ± 0.004
0.699TrpVal: 0.699 ± 0.004
0.199TrpTrp: 0.199 ± 0.002
0.352TrpTyr: 0.352 ± 0.003
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.518TyrAla: 1.518 ± 0.005
0.783TyrCys: 0.783 ± 0.004
1.479TyrAsp: 1.479 ± 0.005
1.752TyrGlu: 1.752 ± 0.006
1.347TyrPhe: 1.347 ± 0.005
1.767TyrGly: 1.767 ± 0.007
0.831TyrHis: 0.831 ± 0.004
1.685TyrIle: 1.685 ± 0.007
1.641TyrLys: 1.641 ± 0.007
2.849TyrLeu: 2.849 ± 0.008
0.756TyrMet: 0.756 ± 0.004
1.292TyrAsn: 1.292 ± 0.005
1.318TyrPro: 1.318 ± 0.005
1.273TyrGln: 1.273 ± 0.005
1.729TyrArg: 1.729 ± 0.005
2.397TyrSer: 2.397 ± 0.007
1.759TyrThr: 1.759 ± 0.006
1.74TyrVal: 1.74 ± 0.006
0.404TyrTrp: 0.404 ± 0.004
1.081TyrTyr: 1.081 ± 0.005
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.15XaaXaa: 0.15 ± 0.015
Statistics based on 97524 proteins (58701194 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski