Amino acid dipepetide frequency for Collichthys lucidus (Big head croaker) (Sciaena lucida)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.569AlaAla: 6.569 ± 0.036
1.289AlaCys: 1.289 ± 0.01
3.231AlaAsp: 3.231 ± 0.017
4.745AlaGlu: 4.745 ± 0.027
2.297AlaPhe: 2.297 ± 0.013
4.343AlaGly: 4.343 ± 0.021
1.525AlaHis: 1.525 ± 0.012
2.634AlaIle: 2.634 ± 0.015
3.359AlaLys: 3.359 ± 0.02
6.451AlaLeu: 6.451 ± 0.03
1.604AlaMet: 1.604 ± 0.011
2.145AlaAsn: 2.145 ± 0.014
3.599AlaPro: 3.599 ± 0.024
2.981AlaGln: 2.981 ± 0.021
3.235AlaArg: 3.235 ± 0.016
5.579AlaSer: 5.579 ± 0.027
3.623AlaThr: 3.623 ± 0.025
4.964AlaVal: 4.964 ± 0.023
0.646AlaTrp: 0.646 ± 0.007
1.469AlaTyr: 1.469 ± 0.011
0.0AlaXaa: 0.0 ± 0.0
Cys
1.158CysAla: 1.158 ± 0.01
0.654CysCys: 0.654 ± 0.009
1.125CysAsp: 1.125 ± 0.013
1.212CysGlu: 1.212 ± 0.013
0.838CysPhe: 0.838 ± 0.009
1.513CysGly: 1.513 ± 0.016
0.661CysHis: 0.661 ± 0.008
0.911CysIle: 0.911 ± 0.009
1.085CysLys: 1.085 ± 0.012
2.142CysLeu: 2.142 ± 0.016
0.487CysMet: 0.487 ± 0.007
0.829CysAsn: 0.829 ± 0.013
1.285CysPro: 1.285 ± 0.014
1.041CysGln: 1.041 ± 0.012
1.308CysArg: 1.308 ± 0.011
2.114CysSer: 2.114 ± 0.016
1.182CysThr: 1.182 ± 0.011
1.5CysVal: 1.5 ± 0.015
0.295CysTrp: 0.295 ± 0.006
0.611CysTyr: 0.611 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.005AspAla: 3.005 ± 0.017
1.133AspCys: 1.133 ± 0.012
3.23AspAsp: 3.23 ± 0.024
3.866AspGlu: 3.866 ± 0.025
2.054AspPhe: 2.054 ± 0.012
3.636AspGly: 3.636 ± 0.021
1.223AspHis: 1.223 ± 0.01
2.651AspIle: 2.651 ± 0.033
2.742AspLys: 2.742 ± 0.02
4.877AspLeu: 4.877 ± 0.028
1.333AspMet: 1.333 ± 0.012
1.976AspAsn: 1.976 ± 0.028
2.873AspPro: 2.873 ± 0.018
2.021AspGln: 2.021 ± 0.019
2.888AspArg: 2.888 ± 0.02
4.652AspSer: 4.652 ± 0.03
2.792AspThr: 2.792 ± 0.018
3.353AspVal: 3.353 ± 0.025
0.667AspTrp: 0.667 ± 0.008
1.597AspTyr: 1.597 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
4.832GluAla: 4.832 ± 0.029
1.199GluCys: 1.199 ± 0.015
4.545GluAsp: 4.545 ± 0.027
8.449GluGlu: 8.449 ± 0.065
1.859GluPhe: 1.859 ± 0.015
4.14GluGly: 4.14 ± 0.026
1.478GluHis: 1.478 ± 0.012
2.762GluIle: 2.762 ± 0.021
4.67GluLys: 4.67 ± 0.035
6.079GluLeu: 6.079 ± 0.034
1.746GluMet: 1.746 ± 0.015
2.656GluAsn: 2.656 ± 0.02
2.916GluPro: 2.916 ± 0.023
3.222GluGln: 3.222 ± 0.04
4.488GluArg: 4.488 ± 0.034
4.455GluSer: 4.455 ± 0.025
3.567GluThr: 3.567 ± 0.025
4.38GluVal: 4.38 ± 0.024
0.69GluTrp: 0.69 ± 0.007
1.511GluTyr: 1.511 ± 0.014
0.0GluXaa: 0.0 ± 0.0
Phe
1.852PheAla: 1.852 ± 0.015
0.869PheCys: 0.869 ± 0.009
1.723PheAsp: 1.723 ± 0.013
1.787PheGlu: 1.787 ± 0.019
1.474PhePhe: 1.474 ± 0.013
2.077PheGly: 2.077 ± 0.017
0.964PheHis: 0.964 ± 0.008
1.796PheIle: 1.796 ± 0.016
1.723PheLys: 1.723 ± 0.021
3.638PheLeu: 3.638 ± 0.02
0.813PheMet: 0.813 ± 0.015
1.415PheAsn: 1.415 ± 0.01
1.734PhePro: 1.734 ± 0.014
1.523PheGln: 1.523 ± 0.01
1.804PheArg: 1.804 ± 0.014
3.269PheSer: 3.269 ± 0.02
2.234PheThr: 2.234 ± 0.015
2.125PheVal: 2.125 ± 0.016
0.434PheTrp: 0.434 ± 0.006
1.14PheTyr: 1.14 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
4.016GlyAla: 4.016 ± 0.023
1.275GlyCys: 1.275 ± 0.011
3.296GlyAsp: 3.296 ± 0.023
4.119GlyGlu: 4.119 ± 0.025
2.293GlyPhe: 2.293 ± 0.018
5.813GlyGly: 5.813 ± 0.05
1.697GlyHis: 1.697 ± 0.018
2.446GlyIle: 2.446 ± 0.017
3.423GlyLys: 3.423 ± 0.023
5.445GlyLeu: 5.445 ± 0.023
1.466GlyMet: 1.466 ± 0.015
2.384GlyAsn: 2.384 ± 0.017
3.292GlyPro: 3.292 ± 0.051
2.835GlyGln: 2.835 ± 0.023
3.784GlyArg: 3.784 ± 0.023
5.715GlySer: 5.715 ± 0.03
3.386GlyThr: 3.386 ± 0.023
4.006GlyVal: 4.006 ± 0.021
0.756GlyTrp: 0.756 ± 0.01
1.77GlyTyr: 1.77 ± 0.016
0.0GlyXaa: 0.0 ± 0.0
His
1.399HisAla: 1.399 ± 0.011
0.737HisCys: 0.737 ± 0.009
1.025HisAsp: 1.025 ± 0.011
1.197HisGlu: 1.197 ± 0.009
1.007HisPhe: 1.007 ± 0.008
1.645HisGly: 1.645 ± 0.015
1.201HisHis: 1.201 ± 0.018
1.268HisIle: 1.268 ± 0.01
1.311HisLys: 1.311 ± 0.015
2.706HisLeu: 2.706 ± 0.018
0.678HisMet: 0.678 ± 0.007
1.06HisAsn: 1.06 ± 0.016
1.643HisPro: 1.643 ± 0.014
1.349HisGln: 1.349 ± 0.014
1.769HisArg: 1.769 ± 0.013
2.508HisSer: 2.508 ± 0.017
1.647HisThr: 1.647 ± 0.013
1.471HisVal: 1.471 ± 0.012
0.337HisTrp: 0.337 ± 0.005
0.834HisTyr: 0.834 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
2.41IleAla: 2.41 ± 0.017
1.014IleCys: 1.014 ± 0.009
2.04IleAsp: 2.04 ± 0.015
2.305IleGlu: 2.305 ± 0.02
1.64IlePhe: 1.64 ± 0.014
2.212IleGly: 2.212 ± 0.02
1.235IleHis: 1.235 ± 0.013
2.294IleIle: 2.294 ± 0.021
2.434IleLys: 2.434 ± 0.019
4.105IleLeu: 4.105 ± 0.021
1.047IleMet: 1.047 ± 0.009
1.868IleAsn: 1.868 ± 0.012
2.384IlePro: 2.384 ± 0.015
2.153IleGln: 2.153 ± 0.017
2.373IleArg: 2.373 ± 0.015
3.685IleSer: 3.685 ± 0.02
2.749IleThr: 2.749 ± 0.025
2.47IleVal: 2.47 ± 0.02
0.452IleTrp: 0.452 ± 0.007
1.301IleTyr: 1.301 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
3.749LysAla: 3.749 ± 0.024
0.975LysCys: 0.975 ± 0.01
3.214LysAsp: 3.214 ± 0.022
4.777LysGlu: 4.777 ± 0.039
1.474LysPhe: 1.474 ± 0.014
3.131LysGly: 3.131 ± 0.033
1.399LysHis: 1.399 ± 0.011
2.338LysIle: 2.338 ± 0.018
4.502LysLys: 4.502 ± 0.04
4.886LysLeu: 4.886 ± 0.051
1.48LysMet: 1.48 ± 0.017
2.172LysAsn: 2.172 ± 0.014
2.941LysPro: 2.941 ± 0.023
2.551LysGln: 2.551 ± 0.021
3.533LysArg: 3.533 ± 0.02
3.793LysSer: 3.793 ± 0.024
3.24LysThr: 3.24 ± 0.024
3.431LysVal: 3.431 ± 0.022
0.546LysTrp: 0.546 ± 0.009
1.404LysTyr: 1.404 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
5.949LeuAla: 5.949 ± 0.03
2.236LeuCys: 2.236 ± 0.018
4.899LeuAsp: 4.899 ± 0.028
6.236LeuGlu: 6.236 ± 0.032
3.273LeuPhe: 3.273 ± 0.02
4.976LeuGly: 4.976 ± 0.024
2.756LeuHis: 2.756 ± 0.018
3.692LeuIle: 3.692 ± 0.019
5.504LeuLys: 5.504 ± 0.053
9.978LeuLeu: 9.978 ± 0.055
2.118LeuMet: 2.118 ± 0.014
3.545LeuAsn: 3.545 ± 0.02
5.306LeuPro: 5.306 ± 0.028
5.435LeuGln: 5.435 ± 0.035
5.723LeuArg: 5.723 ± 0.029
8.369LeuSer: 8.369 ± 0.042
5.265LeuThr: 5.265 ± 0.022
5.367LeuVal: 5.367 ± 0.025
1.053LeuTrp: 1.053 ± 0.011
2.532LeuTyr: 2.532 ± 0.019
0.0LeuXaa: 0.0 ± 0.0
Met
1.903MetAla: 1.903 ± 0.011
0.48MetCys: 0.48 ± 0.008
1.488MetAsp: 1.488 ± 0.015
2.021MetGlu: 2.021 ± 0.016
0.86MetPhe: 0.86 ± 0.011
1.433MetGly: 1.433 ± 0.014
0.496MetHis: 0.496 ± 0.007
0.867MetIle: 0.867 ± 0.009
1.487MetLys: 1.487 ± 0.012
2.093MetLeu: 2.093 ± 0.013
0.741MetMet: 0.741 ± 0.009
0.933MetAsn: 0.933 ± 0.023
1.124MetPro: 1.124 ± 0.019
0.994MetGln: 0.994 ± 0.009
1.2MetArg: 1.2 ± 0.008
1.963MetSer: 1.963 ± 0.015
1.34MetThr: 1.34 ± 0.011
1.529MetVal: 1.529 ± 0.012
0.268MetTrp: 0.268 ± 0.004
0.647MetTyr: 0.647 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.125AsnAla: 2.125 ± 0.015
0.837AsnCys: 0.837 ± 0.009
1.644AsnAsp: 1.644 ± 0.026
2.049AsnGlu: 2.049 ± 0.016
1.32AsnPhe: 1.32 ± 0.01
2.621AsnGly: 2.621 ± 0.018
1.023AsnHis: 1.023 ± 0.01
2.073AsnIle: 2.073 ± 0.015
2.196AsnLys: 2.196 ± 0.017
3.475AsnLeu: 3.475 ± 0.02
1.113AsnMet: 1.113 ± 0.022
1.886AsnAsn: 1.886 ± 0.025
2.188AsnPro: 2.188 ± 0.017
1.825AsnGln: 1.825 ± 0.026
2.013AsnArg: 2.013 ± 0.014
3.149AsnSer: 3.149 ± 0.019
2.379AsnThr: 2.379 ± 0.022
2.278AsnVal: 2.278 ± 0.019
0.423AsnTrp: 0.423 ± 0.005
1.097AsnTyr: 1.097 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
4.323ProAla: 4.323 ± 0.026
1.095ProCys: 1.095 ± 0.012
2.921ProAsp: 2.921 ± 0.02
3.833ProGlu: 3.833 ± 0.027
1.724ProPhe: 1.724 ± 0.014
4.008ProGly: 4.008 ± 0.045
1.551ProHis: 1.551 ± 0.014
1.861ProIle: 1.861 ± 0.013
2.559ProLys: 2.559 ± 0.031
4.901ProLeu: 4.901 ± 0.027
1.046ProMet: 1.046 ± 0.011
1.911ProAsn: 1.911 ± 0.018
5.772ProPro: 5.772 ± 0.058
2.755ProGln: 2.755 ± 0.022
2.879ProArg: 2.879 ± 0.022
5.723ProSer: 5.723 ± 0.036
3.325ProThr: 3.325 ± 0.029
3.858ProVal: 3.858 ± 0.024
0.547ProTrp: 0.547 ± 0.008
1.364ProTyr: 1.364 ± 0.013
0.0ProXaa: 0.0 ± 0.0
Gln
3.223GlnAla: 3.223 ± 0.023
0.974GlnCys: 0.974 ± 0.011
2.385GlnAsp: 2.385 ± 0.016
3.626GlnGlu: 3.626 ± 0.028
1.28GlnPhe: 1.28 ± 0.016
2.764GlnGly: 2.764 ± 0.019
1.441GlnHis: 1.441 ± 0.016
1.86GlnIle: 1.86 ± 0.016
2.534GlnLys: 2.534 ± 0.021
4.488GlnLeu: 4.488 ± 0.027
1.128GlnMet: 1.128 ± 0.01
1.762GlnAsn: 1.762 ± 0.016
2.74GlnPro: 2.74 ± 0.023
3.542GlnGln: 3.542 ± 0.047
3.272GlnArg: 3.272 ± 0.021
3.703GlnSer: 3.703 ± 0.032
2.786GlnThr: 2.786 ± 0.029
2.834GlnVal: 2.834 ± 0.02
0.542GlnTrp: 0.542 ± 0.008
1.245GlnTyr: 1.245 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
3.592ArgAla: 3.592 ± 0.018
1.288ArgCys: 1.288 ± 0.014
3.092ArgAsp: 3.092 ± 0.024
4.09ArgGlu: 4.09 ± 0.033
1.902ArgPhe: 1.902 ± 0.012
3.753ArgGly: 3.753 ± 0.029
1.721ArgHis: 1.721 ± 0.016
2.29ArgIle: 2.29 ± 0.013
3.542ArgLys: 3.542 ± 0.018
5.404ArgLeu: 5.404 ± 0.028
1.309ArgMet: 1.309 ± 0.01
2.077ArgAsn: 2.077 ± 0.013
3.114ArgPro: 3.114 ± 0.026
2.805ArgGln: 2.805 ± 0.021
4.78ArgArg: 4.78 ± 0.033
4.675ArgSer: 4.675 ± 0.027
3.063ArgThr: 3.063 ± 0.019
3.36ArgVal: 3.36 ± 0.022
0.692ArgTrp: 0.692 ± 0.008
1.571ArgTyr: 1.571 ± 0.015
0.0ArgXaa: 0.0 ± 0.0
Ser
5.722SerAla: 5.722 ± 0.027
1.948SerCys: 1.948 ± 0.016
4.47SerAsp: 4.47 ± 0.025
5.022SerGlu: 5.022 ± 0.031
2.964SerPhe: 2.964 ± 0.018
5.617SerGly: 5.617 ± 0.028
2.317SerHis: 2.317 ± 0.015
3.273SerIle: 3.273 ± 0.019
3.998SerLys: 3.998 ± 0.022
8.305SerLeu: 8.305 ± 0.032
1.84SerMet: 1.84 ± 0.011
3.01SerAsn: 3.01 ± 0.022
6.124SerPro: 6.124 ± 0.046
4.007SerGln: 4.007 ± 0.027
4.67SerArg: 4.67 ± 0.026
10.8SerSer: 10.8 ± 0.067
5.126SerThr: 5.126 ± 0.034
5.55SerVal: 5.55 ± 0.028
0.978SerTrp: 0.978 ± 0.01
2.117SerTyr: 2.117 ± 0.016
0.0SerXaa: 0.0 ± 0.0
Thr
4.271ThrAla: 4.271 ± 0.022
1.336ThrCys: 1.336 ± 0.018
3.066ThrAsp: 3.066 ± 0.024
3.948ThrGlu: 3.948 ± 0.03
2.052ThrPhe: 2.052 ± 0.021
3.789ThrGly: 3.789 ± 0.021
1.462ThrHis: 1.462 ± 0.014
2.347ThrIle: 2.347 ± 0.019
2.745ThrLys: 2.745 ± 0.022
5.361ThrLeu: 5.361 ± 0.022
1.276ThrMet: 1.276 ± 0.011
2.048ThrAsn: 2.048 ± 0.026
3.879ThrPro: 3.879 ± 0.03
2.49ThrGln: 2.49 ± 0.021
2.693ThrArg: 2.693 ± 0.02
5.111ThrSer: 5.111 ± 0.03
3.865ThrThr: 3.865 ± 0.069
4.321ThrVal: 4.321 ± 0.037
0.64ThrTrp: 0.64 ± 0.009
1.406ThrTyr: 1.406 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
4.138ValAla: 4.138 ± 0.021
1.689ValCys: 1.689 ± 0.016
3.218ValAsp: 3.218 ± 0.025
4.127ValGlu: 4.127 ± 0.028
2.54ValPhe: 2.54 ± 0.016
3.48ValGly: 3.48 ± 0.023
1.599ValHis: 1.599 ± 0.013
2.901ValIle: 2.901 ± 0.019
3.613ValLys: 3.613 ± 0.026
6.212ValLeu: 6.212 ± 0.028
1.614ValMet: 1.614 ± 0.021
2.397ValAsn: 2.397 ± 0.015
3.299ValPro: 3.299 ± 0.019
2.9ValGln: 2.9 ± 0.022
3.299ValArg: 3.299 ± 0.017
5.379ValSer: 5.379 ± 0.026
4.16ValThr: 4.16 ± 0.042
4.444ValVal: 4.444 ± 0.028
0.761ValTrp: 0.761 ± 0.009
1.769ValTyr: 1.769 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.654TrpAla: 0.654 ± 0.008
0.246TrpCys: 0.246 ± 0.004
0.621TrpAsp: 0.621 ± 0.008
0.725TrpGlu: 0.725 ± 0.008
0.434TrpPhe: 0.434 ± 0.007
0.644TrpGly: 0.644 ± 0.009
0.266TrpHis: 0.266 ± 0.005
0.523TrpIle: 0.523 ± 0.006
0.696TrpLys: 0.696 ± 0.008
1.137TrpLeu: 1.137 ± 0.013
0.338TrpMet: 0.338 ± 0.007
0.47TrpAsn: 0.47 ± 0.006
0.439TrpPro: 0.439 ± 0.007
0.477TrpGln: 0.477 ± 0.006
0.755TrpArg: 0.755 ± 0.007
0.929TrpSer: 0.929 ± 0.01
0.728TrpThr: 0.728 ± 0.009
0.673TrpVal: 0.673 ± 0.008
0.18TrpTrp: 0.18 ± 0.004
0.331TrpTyr: 0.331 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.397TyrAla: 1.397 ± 0.012
0.677TyrCys: 0.677 ± 0.008
1.338TyrAsp: 1.338 ± 0.011
1.511TyrGlu: 1.511 ± 0.012
1.109TyrPhe: 1.109 ± 0.01
1.622TyrGly: 1.622 ± 0.014
0.78TyrHis: 0.78 ± 0.009
1.386TyrIle: 1.386 ± 0.025
1.414TyrLys: 1.414 ± 0.019
2.53TyrLeu: 2.53 ± 0.019
0.69TyrMet: 0.69 ± 0.014
1.17TyrAsn: 1.17 ± 0.011
1.307TyrPro: 1.307 ± 0.012
1.221TyrGln: 1.221 ± 0.011
1.656TyrArg: 1.656 ± 0.013
2.284TyrSer: 2.284 ± 0.017
1.644TyrThr: 1.644 ± 0.015
1.625TyrVal: 1.625 ± 0.018
0.358TyrTrp: 0.358 ± 0.006
0.951TyrTyr: 0.951 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.001
Statistics based on 27208 proteins (15347301 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski