Amino acid dipepetide frequency for Stentor coeruleus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.051AlaAla: 2.051 ± 0.016
0.931AlaCys: 0.931 ± 0.012
1.977AlaAsp: 1.977 ± 0.015
3.171AlaGlu: 3.171 ± 0.021
2.222AlaPhe: 2.222 ± 0.017
2.032AlaGly: 2.032 ± 0.016
0.795AlaHis: 0.795 ± 0.009
3.442AlaIle: 3.442 ± 0.019
3.85AlaLys: 3.85 ± 0.022
4.825AlaLeu: 4.825 ± 0.026
1.069AlaMet: 1.069 ± 0.009
2.318AlaAsn: 2.318 ± 0.014
1.307AlaPro: 1.307 ± 0.013
1.548AlaGln: 1.548 ± 0.011
1.763AlaArg: 1.763 ± 0.014
3.336AlaSer: 3.336 ± 0.017
1.88AlaThr: 1.88 ± 0.014
2.421AlaVal: 2.421 ± 0.015
0.418AlaTrp: 0.418 ± 0.005
1.857AlaTyr: 1.857 ± 0.014
0.004AlaXaa: 0.004 ± 0.001
Cys
0.78CysAla: 0.78 ± 0.009
0.473CysCys: 0.473 ± 0.01
0.974CysAsp: 0.974 ± 0.014
1.372CysGlu: 1.372 ± 0.015
1.009CysPhe: 1.009 ± 0.011
0.978CysGly: 0.978 ± 0.012
0.31CysHis: 0.31 ± 0.005
1.601CysIle: 1.601 ± 0.016
1.727CysLys: 1.727 ± 0.018
1.964CysLeu: 1.964 ± 0.018
0.441CysMet: 0.441 ± 0.006
1.08CysAsn: 1.08 ± 0.016
0.761CysPro: 0.761 ± 0.015
0.673CysGln: 0.673 ± 0.009
0.737CysArg: 0.737 ± 0.009
1.62CysSer: 1.62 ± 0.021
0.964CysThr: 0.964 ± 0.019
0.967CysVal: 0.967 ± 0.01
0.175CysTrp: 0.175 ± 0.004
0.785CysTyr: 0.785 ± 0.01
0.002CysXaa: 0.002 ± 0.0
Asp
1.769AspAla: 1.769 ± 0.015
0.953AspCys: 0.953 ± 0.012
2.789AspAsp: 2.789 ± 0.024
3.918AspGlu: 3.918 ± 0.025
3.184AspPhe: 3.184 ± 0.016
2.085AspGly: 2.085 ± 0.02
0.909AspHis: 0.909 ± 0.01
4.598AspIle: 4.598 ± 0.021
4.286AspLys: 4.286 ± 0.021
5.445AspLeu: 5.445 ± 0.026
1.321AspMet: 1.321 ± 0.012
2.866AspAsn: 2.866 ± 0.015
1.802AspPro: 1.802 ± 0.011
1.598AspGln: 1.598 ± 0.011
1.79AspArg: 1.79 ± 0.014
3.924AspSer: 3.924 ± 0.022
2.272AspThr: 2.272 ± 0.016
2.192AspVal: 2.192 ± 0.013
0.529AspTrp: 0.529 ± 0.007
2.413AspTyr: 2.413 ± 0.015
0.005AspXaa: 0.005 ± 0.001
Glu
3.329GluAla: 3.329 ± 0.02
1.223GluCys: 1.223 ± 0.015
4.189GluAsp: 4.189 ± 0.023
7.107GluGlu: 7.107 ± 0.052
3.373GluPhe: 3.373 ± 0.016
2.931GluGly: 2.931 ± 0.018
1.174GluHis: 1.174 ± 0.013
7.327GluIle: 7.327 ± 0.029
8.539GluLys: 8.539 ± 0.047
6.852GluLeu: 6.852 ± 0.037
1.795GluMet: 1.795 ± 0.012
6.155GluAsn: 6.155 ± 0.034
1.677GluPro: 1.677 ± 0.015
2.304GluGln: 2.304 ± 0.017
2.846GluArg: 2.846 ± 0.023
5.197GluSer: 5.197 ± 0.027
3.388GluThr: 3.388 ± 0.02
3.825GluVal: 3.825 ± 0.021
0.598GluTrp: 0.598 ± 0.007
2.755GluTyr: 2.755 ± 0.014
0.006GluXaa: 0.006 ± 0.001
Phe
2.103PheAla: 2.103 ± 0.015
0.996PheCys: 0.996 ± 0.01
2.628PheAsp: 2.628 ± 0.015
3.274PheGlu: 3.274 ± 0.017
2.415PhePhe: 2.415 ± 0.018
2.314PheGly: 2.314 ± 0.015
0.884PheHis: 0.884 ± 0.008
4.112PheIle: 4.112 ± 0.022
3.693PheLys: 3.693 ± 0.02
4.693PheLeu: 4.693 ± 0.024
1.162PheMet: 1.162 ± 0.01
2.728PheAsn: 2.728 ± 0.018
1.697PhePro: 1.697 ± 0.013
1.646PheGln: 1.646 ± 0.011
1.86PheArg: 1.86 ± 0.014
4.457PheSer: 4.457 ± 0.021
2.73PheThr: 2.73 ± 0.016
2.262PheVal: 2.262 ± 0.016
0.48PheTrp: 0.48 ± 0.006
2.193PheTyr: 2.193 ± 0.016
0.004PheXaa: 0.004 ± 0.001
Gly
1.696GlyAla: 1.696 ± 0.017
0.956GlyCys: 0.956 ± 0.013
2.152GlyAsp: 2.152 ± 0.017
2.753GlyGlu: 2.753 ± 0.017
2.423GlyPhe: 2.423 ± 0.017
2.231GlyGly: 2.231 ± 0.025
1.043GlyHis: 1.043 ± 0.012
3.728GlyIle: 3.728 ± 0.02
4.118GlyLys: 4.118 ± 0.023
3.922GlyLeu: 3.922 ± 0.019
0.998GlyMet: 0.998 ± 0.011
2.67GlyAsn: 2.67 ± 0.017
1.218GlyPro: 1.218 ± 0.015
1.468GlyGln: 1.468 ± 0.012
1.703GlyArg: 1.703 ± 0.014
3.382GlySer: 3.382 ± 0.021
2.237GlyThr: 2.237 ± 0.02
2.438GlyVal: 2.438 ± 0.017
0.414GlyTrp: 0.414 ± 0.007
1.949GlyTyr: 1.949 ± 0.02
0.004GlyXaa: 0.004 ± 0.001
His
0.708HisAla: 0.708 ± 0.008
0.393HisCys: 0.393 ± 0.006
0.841HisAsp: 0.841 ± 0.008
1.363HisGlu: 1.363 ± 0.015
0.941HisPhe: 0.941 ± 0.009
0.847HisGly: 0.847 ± 0.011
0.431HisHis: 0.431 ± 0.011
1.514HisIle: 1.514 ± 0.011
1.689HisLys: 1.689 ± 0.012
1.796HisLeu: 1.796 ± 0.012
0.43HisMet: 0.43 ± 0.006
1.087HisAsn: 1.087 ± 0.009
0.831HisPro: 0.831 ± 0.008
0.633HisGln: 0.633 ± 0.007
0.86HisArg: 0.86 ± 0.009
1.548HisSer: 1.548 ± 0.012
0.942HisThr: 0.942 ± 0.009
0.762HisVal: 0.762 ± 0.008
0.177HisTrp: 0.177 ± 0.004
0.742HisTyr: 0.742 ± 0.008
0.001HisXaa: 0.001 ± 0.0
Ile
3.569IleAla: 3.569 ± 0.019
1.712IleCys: 1.712 ± 0.015
4.353IleAsp: 4.353 ± 0.021
6.688IleGlu: 6.688 ± 0.028
4.063IlePhe: 4.063 ± 0.023
3.564IleGly: 3.564 ± 0.019
1.484IleHis: 1.484 ± 0.011
6.941IleIle: 6.941 ± 0.032
7.732IleLys: 7.732 ± 0.031
7.859IleLeu: 7.859 ± 0.037
1.764IleMet: 1.764 ± 0.013
4.907IleAsn: 4.907 ± 0.024
3.23IlePro: 3.23 ± 0.019
3.292IleGln: 3.292 ± 0.017
3.32IleArg: 3.32 ± 0.017
7.589IleSer: 7.589 ± 0.028
4.089IleThr: 4.089 ± 0.021
4.188IleVal: 4.188 ± 0.022
0.876IleTrp: 0.876 ± 0.011
3.174IleTyr: 3.174 ± 0.02
0.006IleXaa: 0.006 ± 0.001
Lys
4.213LysAla: 4.213 ± 0.023
1.658LysCys: 1.658 ± 0.019
4.854LysAsp: 4.854 ± 0.026
7.04LysGlu: 7.04 ± 0.041
3.72LysPhe: 3.72 ± 0.019
3.393LysGly: 3.393 ± 0.021
1.799LysHis: 1.799 ± 0.013
8.448LysIle: 8.448 ± 0.034
9.482LysLys: 9.482 ± 0.047
8.111LysLeu: 8.111 ± 0.034
1.906LysMet: 1.906 ± 0.013
6.801LysAsn: 6.801 ± 0.032
3.141LysPro: 3.141 ± 0.023
3.121LysGln: 3.121 ± 0.02
3.638LysArg: 3.638 ± 0.024
7.406LysSer: 7.406 ± 0.031
4.899LysThr: 4.899 ± 0.021
4.582LysVal: 4.582 ± 0.021
0.698LysTrp: 0.698 ± 0.009
3.535LysTyr: 3.535 ± 0.018
0.008LysXaa: 0.008 ± 0.001
Leu
4.704LeuAla: 4.704 ± 0.024
1.841LeuCys: 1.841 ± 0.015
4.703LeuAsp: 4.703 ± 0.024
7.799LeuGlu: 7.799 ± 0.035
3.911LeuPhe: 3.911 ± 0.021
4.291LeuGly: 4.291 ± 0.021
1.822LeuHis: 1.822 ± 0.013
7.295LeuIle: 7.295 ± 0.031
9.416LeuLys: 9.416 ± 0.037
8.56LeuLeu: 8.56 ± 0.035
2.1LeuMet: 2.1 ± 0.012
5.927LeuAsn: 5.927 ± 0.023
3.344LeuPro: 3.344 ± 0.017
3.98LeuGln: 3.98 ± 0.022
4.0LeuArg: 4.0 ± 0.021
8.208LeuSer: 8.208 ± 0.03
4.793LeuThr: 4.793 ± 0.022
4.513LeuVal: 4.513 ± 0.021
0.86LeuTrp: 0.86 ± 0.01
3.486LeuTyr: 3.486 ± 0.02
0.009LeuXaa: 0.009 ± 0.001
Met
1.057MetAla: 1.057 ± 0.009
0.389MetCys: 0.389 ± 0.006
1.078MetAsp: 1.078 ± 0.009
1.622MetGlu: 1.622 ± 0.013
0.99MetPhe: 0.99 ± 0.009
0.963MetGly: 0.963 ± 0.009
0.522MetHis: 0.522 ± 0.006
2.037MetIle: 2.037 ± 0.013
2.239MetLys: 2.239 ± 0.013
2.156MetLeu: 2.156 ± 0.016
0.613MetMet: 0.613 ± 0.009
1.515MetAsn: 1.515 ± 0.012
0.821MetPro: 0.821 ± 0.011
1.011MetGln: 1.011 ± 0.01
0.944MetArg: 0.944 ± 0.01
1.802MetSer: 1.802 ± 0.012
1.055MetThr: 1.055 ± 0.01
1.003MetVal: 1.003 ± 0.01
0.236MetTrp: 0.236 ± 0.004
0.712MetTyr: 0.712 ± 0.007
0.002MetXaa: 0.002 ± 0.0
Asn
2.365AsnAla: 2.365 ± 0.013
1.292AsnCys: 1.292 ± 0.017
3.153AsnAsp: 3.153 ± 0.016
4.662AsnGlu: 4.662 ± 0.025
3.407AsnPhe: 3.407 ± 0.016
2.195AsnGly: 2.195 ± 0.017
1.201AsnHis: 1.201 ± 0.011
5.419AsnIle: 5.419 ± 0.023
5.248AsnLys: 5.248 ± 0.023
6.357AsnLeu: 6.357 ± 0.024
1.493AsnMet: 1.493 ± 0.012
3.748AsnAsn: 3.748 ± 0.02
2.726AsnPro: 2.726 ± 0.022
2.365AsnGln: 2.365 ± 0.016
2.231AsnArg: 2.231 ± 0.014
5.64AsnSer: 5.64 ± 0.024
3.359AsnThr: 3.359 ± 0.018
2.407AsnVal: 2.407 ± 0.015
0.578AsnTrp: 0.578 ± 0.007
2.82AsnTyr: 2.82 ± 0.017
0.005AsnXaa: 0.005 ± 0.001
Pro
1.398ProAla: 1.398 ± 0.011
0.576ProCys: 0.576 ± 0.01
1.819ProAsp: 1.819 ± 0.013
3.019ProGlu: 3.019 ± 0.019
1.49ProPhe: 1.49 ± 0.013
1.78ProGly: 1.78 ± 0.022
0.6ProHis: 0.6 ± 0.007
2.773ProIle: 2.773 ± 0.016
3.158ProLys: 3.158 ± 0.022
3.069ProLeu: 3.069 ± 0.018
0.684ProMet: 0.684 ± 0.009
2.219ProAsn: 2.219 ± 0.016
1.652ProPro: 1.652 ± 0.032
1.481ProGln: 1.481 ± 0.013
1.422ProArg: 1.422 ± 0.013
3.098ProSer: 3.098 ± 0.022
1.685ProThr: 1.685 ± 0.014
1.858ProVal: 1.858 ± 0.012
0.345ProTrp: 0.345 ± 0.006
1.351ProTyr: 1.351 ± 0.011
0.002ProXaa: 0.002 ± 0.0
Gln
1.91GlnAla: 1.91 ± 0.012
0.629GlnCys: 0.629 ± 0.01
1.966GlnAsp: 1.966 ± 0.012
3.406GlnGlu: 3.406 ± 0.022
1.298GlnPhe: 1.298 ± 0.011
1.751GlnGly: 1.751 ± 0.014
0.597GlnHis: 0.597 ± 0.007
3.102GlnIle: 3.102 ± 0.016
3.635GlnLys: 3.635 ± 0.023
3.08GlnLeu: 3.08 ± 0.021
0.814GlnMet: 0.814 ± 0.009
2.627GlnAsn: 2.627 ± 0.017
0.987GlnPro: 0.987 ± 0.011
1.396GlnGln: 1.396 ± 0.013
1.509GlnArg: 1.509 ± 0.013
2.717GlnSer: 2.717 ± 0.014
1.777GlnThr: 1.777 ± 0.012
1.978GlnVal: 1.978 ± 0.014
0.291GlnTrp: 0.291 ± 0.005
1.201GlnTyr: 1.201 ± 0.01
0.003GlnXaa: 0.003 ± 0.0
Arg
1.802ArgAla: 1.802 ± 0.013
0.599ArgCys: 0.599 ± 0.008
1.965ArgAsp: 1.965 ± 0.015
2.983ArgGlu: 2.983 ± 0.021
1.774ArgPhe: 1.774 ± 0.013
1.688ArgGly: 1.688 ± 0.015
0.731ArgHis: 0.731 ± 0.009
3.404ArgIle: 3.404 ± 0.021
4.022ArgLys: 4.022 ± 0.026
3.57ArgLeu: 3.57 ± 0.02
0.934ArgMet: 0.934 ± 0.009
2.607ArgAsn: 2.607 ± 0.016
1.402ArgPro: 1.402 ± 0.013
1.392ArgGln: 1.392 ± 0.011
1.884ArgArg: 1.884 ± 0.016
2.925ArgSer: 2.925 ± 0.019
1.811ArgThr: 1.811 ± 0.015
2.15ArgVal: 2.15 ± 0.014
0.344ArgTrp: 0.344 ± 0.006
1.456ArgTyr: 1.456 ± 0.01
0.003ArgXaa: 0.003 ± 0.0
Ser
3.364SerAla: 3.364 ± 0.018
1.635SerCys: 1.635 ± 0.021
3.895SerAsp: 3.895 ± 0.019
5.87SerGlu: 5.87 ± 0.028
4.226SerPhe: 4.226 ± 0.02
3.745SerGly: 3.745 ± 0.027
1.519SerHis: 1.519 ± 0.01
6.824SerIle: 6.824 ± 0.029
6.989SerLys: 6.989 ± 0.027
8.755SerLeu: 8.755 ± 0.026
1.856SerMet: 1.856 ± 0.013
4.769SerAsn: 4.769 ± 0.02
3.399SerPro: 3.399 ± 0.024
3.409SerGln: 3.409 ± 0.016
3.252SerArg: 3.252 ± 0.02
7.92SerSer: 7.92 ± 0.048
3.972SerThr: 3.972 ± 0.025
3.891SerVal: 3.891 ± 0.018
0.691SerTrp: 0.691 ± 0.007
3.087SerTyr: 3.087 ± 0.019
0.006SerXaa: 0.006 ± 0.001
Thr
2.151ThrAla: 2.151 ± 0.016
1.055ThrCys: 1.055 ± 0.021
2.217ThrAsp: 2.217 ± 0.016
3.591ThrGlu: 3.591 ± 0.02
2.468ThrPhe: 2.468 ± 0.016
2.504ThrGly: 2.504 ± 0.02
0.92ThrHis: 0.92 ± 0.008
4.064ThrIle: 4.064 ± 0.023
3.933ThrLys: 3.933 ± 0.018
4.965ThrLeu: 4.965 ± 0.023
1.038ThrMet: 1.038 ± 0.01
2.711ThrAsn: 2.711 ± 0.016
2.224ThrPro: 2.224 ± 0.015
1.951ThrGln: 1.951 ± 0.014
1.846ThrArg: 1.846 ± 0.012
4.244ThrSer: 4.244 ± 0.026
2.455ThrThr: 2.455 ± 0.024
2.495ThrVal: 2.495 ± 0.018
0.485ThrTrp: 0.485 ± 0.007
1.843ThrTyr: 1.843 ± 0.015
0.004ThrXaa: 0.004 ± 0.001
Val
2.121ValAla: 2.121 ± 0.015
1.014ValCys: 1.014 ± 0.012
2.411ValAsp: 2.411 ± 0.017
3.32ValGlu: 3.32 ± 0.019
2.789ValPhe: 2.789 ± 0.017
2.079ValGly: 2.079 ± 0.015
0.922ValHis: 0.922 ± 0.009
3.925ValIle: 3.925 ± 0.021
4.428ValLys: 4.428 ± 0.02
4.93ValLeu: 4.93 ± 0.025
1.138ValMet: 1.138 ± 0.01
2.802ValAsn: 2.802 ± 0.015
1.717ValPro: 1.717 ± 0.011
1.707ValGln: 1.707 ± 0.012
1.933ValArg: 1.933 ± 0.014
3.937ValSer: 3.937 ± 0.023
2.296ValThr: 2.296 ± 0.017
2.755ValVal: 2.755 ± 0.019
0.592ValTrp: 0.592 ± 0.008
2.166ValTyr: 2.166 ± 0.014
0.004ValXaa: 0.004 ± 0.001
Trp
0.459TrpAla: 0.459 ± 0.007
0.161TrpCys: 0.161 ± 0.004
0.618TrpAsp: 0.618 ± 0.009
0.657TrpGlu: 0.657 ± 0.007
0.375TrpPhe: 0.375 ± 0.005
0.425TrpGly: 0.425 ± 0.007
0.168TrpHis: 0.168 ± 0.004
0.733TrpIle: 0.733 ± 0.007
0.923TrpLys: 0.923 ± 0.011
0.773TrpLeu: 0.773 ± 0.008
0.217TrpMet: 0.217 ± 0.004
0.694TrpAsn: 0.694 ± 0.01
0.266TrpPro: 0.266 ± 0.005
0.27TrpGln: 0.27 ± 0.004
0.378TrpArg: 0.378 ± 0.006
0.719TrpSer: 0.719 ± 0.008
0.459TrpThr: 0.459 ± 0.006
0.607TrpVal: 0.607 ± 0.006
0.087TrpTrp: 0.087 ± 0.003
0.301TrpTyr: 0.301 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.667TyrAla: 1.667 ± 0.012
0.905TyrCys: 0.905 ± 0.015
2.062TyrAsp: 2.062 ± 0.014
2.97TyrGlu: 2.97 ± 0.017
2.248TyrPhe: 2.248 ± 0.018
1.704TyrGly: 1.704 ± 0.013
0.702TyrHis: 0.702 ± 0.008
3.1TyrIle: 3.1 ± 0.017
3.338TyrLys: 3.338 ± 0.018
3.87TyrLeu: 3.87 ± 0.019
0.992TyrMet: 0.992 ± 0.008
2.435TyrAsn: 2.435 ± 0.018
1.274TyrPro: 1.274 ± 0.012
1.428TyrGln: 1.428 ± 0.011
1.527TyrArg: 1.527 ± 0.011
3.399TyrSer: 3.399 ± 0.019
2.098TyrThr: 2.098 ± 0.016
1.688TyrVal: 1.688 ± 0.013
0.414TyrTrp: 0.414 ± 0.006
1.914TyrTyr: 1.914 ± 0.02
0.003TyrXaa: 0.003 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.002XaaCys: 0.002 ± 0.0
0.003XaaAsp: 0.003 ± 0.0
0.004XaaGlu: 0.004 ± 0.001
0.004XaaPhe: 0.004 ± 0.001
0.003XaaGly: 0.003 ± 0.001
0.002XaaHis: 0.002 ± 0.0
0.009XaaIle: 0.009 ± 0.001
0.007XaaLys: 0.007 ± 0.001
0.009XaaLeu: 0.009 ± 0.001
0.004XaaMet: 0.004 ± 0.001
0.005XaaAsn: 0.005 ± 0.001
0.002XaaPro: 0.002 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.004XaaArg: 0.004 ± 0.001
0.006XaaSer: 0.006 ± 0.001
0.004XaaThr: 0.004 ± 0.001
0.004XaaVal: 0.004 ± 0.001
0.001XaaTrp: 0.001 ± 0.0
0.003XaaTyr: 0.003 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 30969 proteins (13122840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski