Amino acid dipepetide frequency for Carassius auratus (Goldfish)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.329AlaAla: 5.329 ± 0.02
1.154AlaCys: 1.154 ± 0.007
3.041AlaAsp: 3.041 ± 0.009
4.692AlaGlu: 4.692 ± 0.018
2.087AlaPhe: 2.087 ± 0.008
4.01AlaGly: 4.01 ± 0.017
1.421AlaHis: 1.421 ± 0.005
2.59AlaIle: 2.59 ± 0.008
3.393AlaLys: 3.393 ± 0.015
6.017AlaLeu: 6.017 ± 0.022
1.448AlaMet: 1.448 ± 0.008
2.098AlaAsn: 2.098 ± 0.008
3.552AlaPro: 3.552 ± 0.018
2.988AlaGln: 2.988 ± 0.014
3.039AlaArg: 3.039 ± 0.012
5.409AlaSer: 5.409 ± 0.016
3.324AlaThr: 3.324 ± 0.01
4.59AlaVal: 4.59 ± 0.011
0.581AlaTrp: 0.581 ± 0.004
1.358AlaTyr: 1.358 ± 0.006
0.0AlaXaa: 0.0 ± 0.0
Cys
1.117CysAla: 1.117 ± 0.005
0.56CysCys: 0.56 ± 0.005
1.134CysAsp: 1.134 ± 0.008
1.358CysGlu: 1.358 ± 0.01
0.8CysPhe: 0.8 ± 0.005
1.458CysGly: 1.458 ± 0.01
0.592CysHis: 0.592 ± 0.004
0.932CysIle: 0.932 ± 0.005
1.209CysLys: 1.209 ± 0.007
1.887CysLeu: 1.887 ± 0.008
0.441CysMet: 0.441 ± 0.003
0.807CysAsn: 0.807 ± 0.004
1.148CysPro: 1.148 ± 0.008
1.001CysGln: 1.001 ± 0.006
1.158CysArg: 1.158 ± 0.005
2.102CysSer: 2.102 ± 0.009
1.154CysThr: 1.154 ± 0.006
1.414CysVal: 1.414 ± 0.007
0.252CysTrp: 0.252 ± 0.002
0.532CysTyr: 0.532 ± 0.004
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 0.009
1.117AspCys: 1.117 ± 0.006
3.158AspAsp: 3.158 ± 0.013
4.009AspGlu: 4.009 ± 0.013
1.997AspPhe: 1.997 ± 0.008
3.748AspGly: 3.748 ± 0.019
1.214AspHis: 1.214 ± 0.005
2.808AspIle: 2.808 ± 0.011
2.823AspLys: 2.823 ± 0.009
5.033AspLeu: 5.033 ± 0.013
1.276AspMet: 1.276 ± 0.006
1.854AspAsn: 1.854 ± 0.008
2.947AspPro: 2.947 ± 0.011
2.066AspGln: 2.066 ± 0.007
2.642AspArg: 2.642 ± 0.008
4.855AspSer: 4.855 ± 0.016
2.852AspThr: 2.852 ± 0.01
3.433AspVal: 3.433 ± 0.012
0.612AspTrp: 0.612 ± 0.004
1.455AspTyr: 1.455 ± 0.005
0.0AspXaa: 0.0 ± 0.0
Glu
4.634GluAla: 4.634 ± 0.017
1.262GluCys: 1.262 ± 0.007
4.672GluAsp: 4.672 ± 0.011
7.989GluGlu: 7.989 ± 0.029
1.975GluPhe: 1.975 ± 0.008
4.174GluGly: 4.174 ± 0.018
1.556GluHis: 1.556 ± 0.007
3.29GluIle: 3.29 ± 0.018
5.434GluLys: 5.434 ± 0.029
6.216GluLeu: 6.216 ± 0.023
1.882GluMet: 1.882 ± 0.008
3.068GluAsn: 3.068 ± 0.011
3.164GluPro: 3.164 ± 0.018
3.305GluGln: 3.305 ± 0.019
4.546GluArg: 4.546 ± 0.018
5.018GluSer: 5.018 ± 0.016
3.843GluThr: 3.843 ± 0.014
4.431GluVal: 4.431 ± 0.016
0.696GluTrp: 0.696 ± 0.005
1.752GluTyr: 1.752 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
1.711PheAla: 1.711 ± 0.007
0.811PheCys: 0.811 ± 0.006
1.68PheAsp: 1.68 ± 0.006
1.937PheGlu: 1.937 ± 0.008
1.356PhePhe: 1.356 ± 0.007
2.039PheGly: 2.039 ± 0.01
0.931PheHis: 0.931 ± 0.004
1.851PheIle: 1.851 ± 0.007
1.867PheLys: 1.867 ± 0.01
3.314PheLeu: 3.314 ± 0.015
0.757PheMet: 0.757 ± 0.004
1.38PheAsn: 1.38 ± 0.005
1.698PhePro: 1.698 ± 0.008
1.54PheGln: 1.54 ± 0.007
1.905PheArg: 1.905 ± 0.014
3.245PheSer: 3.245 ± 0.012
2.246PheThr: 2.246 ± 0.009
1.984PheVal: 1.984 ± 0.008
0.409PheTrp: 0.409 ± 0.003
1.033PheTyr: 1.033 ± 0.005
0.0PheXaa: 0.0 ± 0.0
Gly
3.625GlyAla: 3.625 ± 0.014
1.101GlyCys: 1.101 ± 0.005
3.145GlyAsp: 3.145 ± 0.012
4.169GlyGlu: 4.169 ± 0.02
2.205GlyPhe: 2.205 ± 0.009
4.571GlyGly: 4.571 ± 0.023
1.579GlyHis: 1.579 ± 0.007
2.571GlyIle: 2.571 ± 0.013
3.641GlyLys: 3.641 ± 0.016
5.108GlyLeu: 5.108 ± 0.013
1.396GlyMet: 1.396 ± 0.007
2.421GlyAsn: 2.421 ± 0.009
3.347GlyPro: 3.347 ± 0.025
3.082GlyGln: 3.082 ± 0.031
3.465GlyArg: 3.465 ± 0.013
6.147GlySer: 6.147 ± 0.036
3.367GlyThr: 3.367 ± 0.013
3.846GlyVal: 3.846 ± 0.015
0.678GlyTrp: 0.678 ± 0.004
1.817GlyTyr: 1.817 ± 0.01
0.001GlyXaa: 0.001 ± 0.0
His
1.273HisAla: 1.273 ± 0.006
0.745HisCys: 0.745 ± 0.005
1.002HisAsp: 1.002 ± 0.004
1.358HisGlu: 1.358 ± 0.006
0.994HisPhe: 0.994 ± 0.005
1.478HisGly: 1.478 ± 0.007
0.999HisHis: 0.999 ± 0.008
1.318HisIle: 1.318 ± 0.006
1.35HisLys: 1.35 ± 0.006
2.599HisLeu: 2.599 ± 0.009
0.704HisMet: 0.704 ± 0.005
1.014HisAsn: 1.014 ± 0.004
1.534HisPro: 1.534 ± 0.008
1.296HisGln: 1.296 ± 0.006
1.549HisArg: 1.549 ± 0.007
2.474HisSer: 2.474 ± 0.01
1.694HisThr: 1.694 ± 0.008
1.458HisVal: 1.458 ± 0.006
0.292HisTrp: 0.292 ± 0.002
0.82HisTyr: 0.82 ± 0.004
0.0HisXaa: 0.0 ± 0.0
Ile
2.503IleAla: 2.503 ± 0.007
1.042IleCys: 1.042 ± 0.005
2.163IleAsp: 2.163 ± 0.008
2.755IleGlu: 2.755 ± 0.017
1.663IlePhe: 1.663 ± 0.007
2.34IleGly: 2.34 ± 0.018
1.308IleHis: 1.308 ± 0.006
2.528IleIle: 2.528 ± 0.012
2.916IleLys: 2.916 ± 0.016
4.091IleLeu: 4.091 ± 0.01
1.044IleMet: 1.044 ± 0.005
2.033IleAsn: 2.033 ± 0.007
2.558IlePro: 2.558 ± 0.01
2.3IleGln: 2.3 ± 0.008
2.5IleArg: 2.5 ± 0.007
4.192IleSer: 4.192 ± 0.013
3.11IleThr: 3.11 ± 0.019
2.636IleVal: 2.636 ± 0.015
0.445IleTrp: 0.445 ± 0.003
1.287IleTyr: 1.287 ± 0.005
0.0IleXaa: 0.0 ± 0.0
Lys
3.834LysAla: 3.834 ± 0.017
1.127LysCys: 1.127 ± 0.007
3.638LysAsp: 3.638 ± 0.018
5.178LysGlu: 5.178 ± 0.029
1.55LysPhe: 1.55 ± 0.008
3.311LysGly: 3.311 ± 0.016
1.514LysHis: 1.514 ± 0.005
2.722LysIle: 2.722 ± 0.013
4.717LysLys: 4.717 ± 0.023
5.03LysLeu: 5.03 ± 0.014
1.479LysMet: 1.479 ± 0.007
2.478LysAsn: 2.478 ± 0.01
3.477LysPro: 3.477 ± 0.021
2.702LysGln: 2.702 ± 0.013
3.566LysArg: 3.566 ± 0.01
4.296LysSer: 4.296 ± 0.012
3.5LysThr: 3.5 ± 0.01
3.634LysVal: 3.634 ± 0.018
0.626LysTrp: 0.626 ± 0.007
1.523LysTyr: 1.523 ± 0.01
0.001LysXaa: 0.001 ± 0.0
Leu
5.407LeuAla: 5.407 ± 0.019
1.972LeuCys: 1.972 ± 0.009
4.79LeuAsp: 4.79 ± 0.013
6.677LeuGlu: 6.677 ± 0.022
2.993LeuPhe: 2.993 ± 0.011
4.505LeuGly: 4.505 ± 0.015
2.591LeuHis: 2.591 ± 0.009
3.724LeuIle: 3.724 ± 0.009
5.856LeuLys: 5.856 ± 0.016
8.989LeuLeu: 8.989 ± 0.034
2.011LeuMet: 2.011 ± 0.009
3.688LeuAsn: 3.688 ± 0.011
5.031LeuPro: 5.031 ± 0.014
5.52LeuGln: 5.52 ± 0.021
5.355LeuArg: 5.355 ± 0.016
8.165LeuSer: 8.165 ± 0.023
5.149LeuThr: 5.149 ± 0.012
4.806LeuVal: 4.806 ± 0.013
0.902LeuTrp: 0.902 ± 0.005
2.322LeuTyr: 2.322 ± 0.009
0.001LeuXaa: 0.001 ± 0.0
Met
1.744MetAla: 1.744 ± 0.007
0.458MetCys: 0.458 ± 0.004
1.419MetAsp: 1.419 ± 0.007
2.038MetGlu: 2.038 ± 0.009
0.802MetPhe: 0.802 ± 0.004
1.359MetGly: 1.359 ± 0.008
0.506MetHis: 0.506 ± 0.003
0.892MetIle: 0.892 ± 0.004
1.558MetLys: 1.558 ± 0.006
1.975MetLeu: 1.975 ± 0.008
0.718MetMet: 0.718 ± 0.005
0.964MetAsn: 0.964 ± 0.006
1.16MetPro: 1.16 ± 0.009
1.036MetGln: 1.036 ± 0.005
1.201MetArg: 1.201 ± 0.005
1.911MetSer: 1.911 ± 0.008
1.272MetThr: 1.272 ± 0.005
1.444MetVal: 1.444 ± 0.006
0.238MetTrp: 0.238 ± 0.003
0.601MetTyr: 0.601 ± 0.004
0.0MetXaa: 0.0 ± 0.0
Asn
2.194AsnAla: 2.194 ± 0.009
0.837AsnCys: 0.837 ± 0.005
1.756AsnAsp: 1.756 ± 0.007
2.322AsnGlu: 2.322 ± 0.008
1.311AsnPhe: 1.311 ± 0.006
2.82AsnGly: 2.82 ± 0.011
0.987AsnHis: 0.987 ± 0.004
2.246AsnIle: 2.246 ± 0.008
2.31AsnLys: 2.31 ± 0.008
3.569AsnLeu: 3.569 ± 0.01
1.03AsnMet: 1.03 ± 0.006
1.772AsnAsn: 1.772 ± 0.008
2.233AsnPro: 2.233 ± 0.009
1.848AsnGln: 1.848 ± 0.008
2.038AsnArg: 2.038 ± 0.007
3.412AsnSer: 3.412 ± 0.011
2.404AsnThr: 2.404 ± 0.008
2.345AsnVal: 2.345 ± 0.011
0.412AsnTrp: 0.412 ± 0.003
1.116AsnTyr: 1.116 ± 0.006
0.0AsnXaa: 0.0 ± 0.0
Pro
4.037ProAla: 4.037 ± 0.016
0.979ProCys: 0.979 ± 0.006
2.952ProAsp: 2.952 ± 0.012
4.12ProGlu: 4.12 ± 0.019
1.755ProPhe: 1.755 ± 0.007
4.13ProGly: 4.13 ± 0.03
1.477ProHis: 1.477 ± 0.007
2.1ProIle: 2.1 ± 0.013
2.903ProLys: 2.903 ± 0.018
4.71ProLeu: 4.71 ± 0.013
1.068ProMet: 1.068 ± 0.006
1.983ProAsn: 1.983 ± 0.008
5.628ProPro: 5.628 ± 0.03
2.822ProGln: 2.822 ± 0.014
2.654ProArg: 2.654 ± 0.01
5.96ProSer: 5.96 ± 0.02
3.249ProThr: 3.249 ± 0.014
4.106ProVal: 4.106 ± 0.016
0.456ProTrp: 0.456 ± 0.003
1.374ProTyr: 1.374 ± 0.008
0.001ProXaa: 0.001 ± 0.0
Gln
3.134GlnAla: 3.134 ± 0.014
1.039GlnCys: 1.039 ± 0.008
2.425GlnAsp: 2.425 ± 0.01
3.717GlnGlu: 3.717 ± 0.015
1.368GlnPhe: 1.368 ± 0.006
3.035GlnGly: 3.035 ± 0.039
1.406GlnHis: 1.406 ± 0.008
2.104GlnIle: 2.104 ± 0.007
2.842GlnLys: 2.842 ± 0.012
4.324GlnLeu: 4.324 ± 0.022
1.252GlnMet: 1.252 ± 0.007
1.949GlnAsn: 1.949 ± 0.007
2.619GlnPro: 2.619 ± 0.013
3.43GlnGln: 3.43 ± 0.032
3.028GlnArg: 3.028 ± 0.013
3.892GlnSer: 3.892 ± 0.018
2.873GlnThr: 2.873 ± 0.012
2.626GlnVal: 2.626 ± 0.008
0.561GlnTrp: 0.561 ± 0.004
1.273GlnTyr: 1.273 ± 0.005
0.0GlnXaa: 0.0 ± 0.0
Arg
3.269ArgAla: 3.269 ± 0.008
1.143ArgCys: 1.143 ± 0.007
2.963ArgAsp: 2.963 ± 0.009
4.07ArgGlu: 4.07 ± 0.016
1.915ArgPhe: 1.915 ± 0.007
3.231ArgGly: 3.231 ± 0.015
1.508ArgHis: 1.508 ± 0.007
2.483ArgIle: 2.483 ± 0.009
3.628ArgLys: 3.628 ± 0.008
5.017ArgLeu: 5.017 ± 0.015
1.29ArgMet: 1.29 ± 0.005
2.135ArgAsn: 2.135 ± 0.006
2.954ArgPro: 2.954 ± 0.014
2.619ArgGln: 2.619 ± 0.011
4.153ArgArg: 4.153 ± 0.015
4.718ArgSer: 4.718 ± 0.02
2.883ArgThr: 2.883 ± 0.008
3.39ArgVal: 3.39 ± 0.019
0.635ArgTrp: 0.635 ± 0.004
1.436ArgTyr: 1.436 ± 0.005
0.001ArgXaa: 0.001 ± 0.0
Ser
5.683SerAla: 5.683 ± 0.015
1.881SerCys: 1.881 ± 0.009
4.779SerAsp: 4.779 ± 0.012
5.653SerGlu: 5.653 ± 0.015
3.064SerPhe: 3.064 ± 0.01
5.945SerGly: 5.945 ± 0.027
2.289SerHis: 2.289 ± 0.009
3.664SerIle: 3.664 ± 0.012
4.355SerLys: 4.355 ± 0.01
8.089SerLeu: 8.089 ± 0.025
1.903SerMet: 1.903 ± 0.008
3.154SerAsn: 3.154 ± 0.01
6.362SerPro: 6.362 ± 0.03
4.169SerGln: 4.169 ± 0.018
4.703SerArg: 4.703 ± 0.015
11.216SerSer: 11.216 ± 0.044
5.328SerThr: 5.328 ± 0.025
5.91SerVal: 5.91 ± 0.015
0.987SerTrp: 0.987 ± 0.006
2.101SerTyr: 2.101 ± 0.007
0.001SerXaa: 0.001 ± 0.0
Thr
4.022ThrAla: 4.022 ± 0.013
1.358ThrCys: 1.358 ± 0.01
3.14ThrAsp: 3.14 ± 0.01
4.235ThrGlu: 4.235 ± 0.013
1.995ThrPhe: 1.995 ± 0.007
3.799ThrGly: 3.799 ± 0.012
1.474ThrHis: 1.474 ± 0.006
2.538ThrIle: 2.538 ± 0.011
2.881ThrLys: 2.881 ± 0.019
5.379ThrLeu: 5.379 ± 0.013
1.166ThrMet: 1.166 ± 0.004
2.047ThrAsn: 2.047 ± 0.011
4.016ThrPro: 4.016 ± 0.018
2.587ThrGln: 2.587 ± 0.011
2.524ThrArg: 2.524 ± 0.007
5.324ThrSer: 5.324 ± 0.02
3.748ThrThr: 3.748 ± 0.041
4.321ThrVal: 4.321 ± 0.017
0.677ThrTrp: 0.677 ± 0.007
1.386ThrTyr: 1.386 ± 0.006
0.001ThrXaa: 0.001 ± 0.0
Val
3.7ValAla: 3.7 ± 0.011
1.59ValCys: 1.59 ± 0.008
3.134ValAsp: 3.134 ± 0.01
4.263ValGlu: 4.263 ± 0.019
2.374ValPhe: 2.374 ± 0.009
3.208ValGly: 3.208 ± 0.014
1.575ValHis: 1.575 ± 0.006
3.035ValIle: 3.035 ± 0.014
4.01ValLys: 4.01 ± 0.022
5.814ValLeu: 5.814 ± 0.013
1.462ValMet: 1.462 ± 0.006
2.544ValAsn: 2.544 ± 0.01
3.419ValPro: 3.419 ± 0.012
2.916ValGln: 2.916 ± 0.009
3.148ValArg: 3.148 ± 0.009
5.679ValSer: 5.679 ± 0.015
4.219ValThr: 4.219 ± 0.028
4.167ValVal: 4.167 ± 0.022
0.692ValTrp: 0.692 ± 0.004
1.678ValTyr: 1.678 ± 0.006
0.001ValXaa: 0.001 ± 0.0
Trp
0.583TrpAla: 0.583 ± 0.003
0.216TrpCys: 0.216 ± 0.002
0.594TrpAsp: 0.594 ± 0.004
0.732TrpGlu: 0.732 ± 0.005
0.434TrpPhe: 0.434 ± 0.004
0.531TrpGly: 0.531 ± 0.004
0.269TrpHis: 0.269 ± 0.002
0.532TrpIle: 0.532 ± 0.003
0.699TrpLys: 0.699 ± 0.005
1.003TrpLeu: 1.003 ± 0.006
0.337TrpMet: 0.337 ± 0.003
0.469TrpAsn: 0.469 ± 0.003
0.416TrpPro: 0.416 ± 0.003
0.455TrpGln: 0.455 ± 0.003
0.67TrpArg: 0.67 ± 0.004
0.91TrpSer: 0.91 ± 0.005
0.698TrpThr: 0.698 ± 0.008
0.621TrpVal: 0.621 ± 0.004
0.155TrpTrp: 0.155 ± 0.002
0.333TrpTyr: 0.333 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.302TyrAla: 1.302 ± 0.005
0.666TyrCys: 0.666 ± 0.004
1.335TyrAsp: 1.335 ± 0.006
1.636TyrGlu: 1.636 ± 0.006
1.053TyrPhe: 1.053 ± 0.005
1.59TyrGly: 1.59 ± 0.008
0.757TyrHis: 0.757 ± 0.004
1.485TyrIle: 1.485 ± 0.01
1.509TyrLys: 1.509 ± 0.009
2.287TyrLeu: 2.287 ± 0.008
0.632TyrMet: 0.632 ± 0.003
1.103TyrAsn: 1.103 ± 0.005
1.201TyrPro: 1.201 ± 0.007
1.214TyrGln: 1.214 ± 0.005
1.598TyrArg: 1.598 ± 0.006
2.311TyrSer: 2.311 ± 0.008
1.703TyrThr: 1.703 ± 0.011
1.464TyrVal: 1.464 ± 0.007
0.351TyrTrp: 0.351 ± 0.003
0.894TyrTyr: 0.894 ± 0.005
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82965 proteins (61822548 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski