Amino acid dipepetide frequency for Po-Circo-like virus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.61AlaAla: 3.61 ± 1.972
1.203AlaCys: 1.203 ± 1.146
2.407AlaAsp: 2.407 ± 1.662
3.61AlaGlu: 3.61 ± 1.467
1.203AlaPhe: 1.203 ± 1.126
7.22AlaGly: 7.22 ± 1.942
2.407AlaHis: 2.407 ± 0.977
7.22AlaIle: 7.22 ± 2.026
0.0AlaLys: 0.0 ± 0.0
6.017AlaLeu: 6.017 ± 2.033
1.203AlaMet: 1.203 ± 0.831
0.0AlaAsn: 0.0 ± 0.0
3.61AlaPro: 3.61 ± 2.304
3.61AlaGln: 3.61 ± 2.458
6.017AlaArg: 6.017 ± 0.238
3.61AlaSer: 3.61 ± 1.466
1.203AlaThr: 1.203 ± 0.831
6.017AlaVal: 6.017 ± 1.632
0.0AlaTrp: 0.0 ± 0.0
2.407AlaTyr: 2.407 ± 0.977
0.0AlaXaa: 0.0 ± 0.0
Cys
1.203CysAla: 1.203 ± 1.146
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.203CysPhe: 1.203 ± 0.831
0.0CysGly: 0.0 ± 0.0
1.203CysHis: 1.203 ± 1.126
0.0CysIle: 0.0 ± 0.0
4.813CysLys: 4.813 ± 1.33
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
2.407CysThr: 2.407 ± 1.24
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.203CysTyr: 1.203 ± 1.126
0.0CysXaa: 0.0 ± 0.0
Asp
6.017AspAla: 6.017 ± 1.475
1.203AspCys: 1.203 ± 0.831
1.203AspAsp: 1.203 ± 0.831
3.61AspGlu: 3.61 ± 2.227
2.407AspPhe: 2.407 ± 2.269
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
0.0AspIle: 0.0 ± 0.0
2.407AspLys: 2.407 ± 1.01
1.203AspLeu: 1.203 ± 1.146
1.203AspMet: 1.203 ± 0.831
3.61AspAsn: 3.61 ± 1.467
7.22AspPro: 7.22 ± 4.016
4.813AspGln: 4.813 ± 2.16
4.813AspArg: 4.813 ± 1.094
1.203AspSer: 1.203 ± 1.135
2.407AspThr: 2.407 ± 1.01
3.61AspVal: 3.61 ± 3.404
1.203AspTrp: 1.203 ± 0.831
1.203AspTyr: 1.203 ± 1.135
0.0AspXaa: 0.0 ± 0.0
Glu
8.424GluAla: 8.424 ± 3.462
0.0GluCys: 0.0 ± 0.0
3.61GluAsp: 3.61 ± 2.492
9.627GluGlu: 9.627 ± 5.34
1.203GluPhe: 1.203 ± 0.831
3.61GluGly: 3.61 ± 1.406
1.203GluHis: 1.203 ± 0.831
4.813GluIle: 4.813 ± 2.02
3.61GluLys: 3.61 ± 1.467
6.017GluLeu: 6.017 ± 2.033
2.407GluMet: 2.407 ± 1.548
2.407GluAsn: 2.407 ± 1.409
4.813GluPro: 4.813 ± 1.958
3.61GluGln: 3.61 ± 3.404
1.203GluArg: 1.203 ± 0.831
2.407GluSer: 2.407 ± 0.977
4.813GluThr: 4.813 ± 1.861
3.61GluVal: 3.61 ± 1.78
0.0GluTrp: 0.0 ± 0.0
1.203GluTyr: 1.203 ± 1.135
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 1.409
2.407PheCys: 2.407 ± 1.274
4.813PheAsp: 4.813 ± 3.172
6.017PheGlu: 6.017 ± 1.586
1.203PhePhe: 1.203 ± 1.146
3.61PheGly: 3.61 ± 1.055
1.203PheHis: 1.203 ± 1.135
1.203PheIle: 1.203 ± 1.126
1.203PheLys: 1.203 ± 1.126
1.203PheLeu: 1.203 ± 1.135
2.407PheMet: 2.407 ± 1.34
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.407PheArg: 2.407 ± 1.24
6.017PheSer: 6.017 ± 1.918
6.017PheThr: 6.017 ± 4.154
3.61PheVal: 3.61 ± 1.089
0.0PheTrp: 0.0 ± 0.0
3.61PheTyr: 3.61 ± 1.972
0.0PheXaa: 0.0 ± 0.0
Gly
2.407GlyAla: 2.407 ± 1.552
0.0GlyCys: 0.0 ± 0.0
2.407GlyAsp: 2.407 ± 1.552
3.61GlyGlu: 3.61 ± 1.055
3.61GlyPhe: 3.61 ± 2.227
4.813GlyGly: 4.813 ± 2.16
1.203GlyHis: 1.203 ± 0.831
2.407GlyIle: 2.407 ± 2.253
3.61GlyLys: 3.61 ± 2.492
4.813GlyLeu: 4.813 ± 1.952
0.0GlyMet: 0.0 ± 0.0
6.017GlyAsn: 6.017 ± 2.033
2.407GlyPro: 2.407 ± 1.24
1.203GlyGln: 1.203 ± 1.126
2.407GlyArg: 2.407 ± 0.977
4.813GlySer: 4.813 ± 1.695
6.017GlyThr: 6.017 ± 1.694
4.813GlyVal: 4.813 ± 3.545
0.0GlyTrp: 0.0 ± 0.0
1.203GlyTyr: 1.203 ± 0.831
0.0GlyXaa: 0.0 ± 0.0
His
1.203HisAla: 1.203 ± 1.146
0.0HisCys: 0.0 ± 0.0
1.203HisAsp: 1.203 ± 1.135
1.203HisGlu: 1.203 ± 0.831
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.407HisIle: 2.407 ± 1.662
0.0HisLys: 0.0 ± 0.0
4.813HisLeu: 4.813 ± 2.16
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.203HisGln: 1.203 ± 0.831
2.407HisArg: 2.407 ± 1.01
1.203HisSer: 1.203 ± 1.135
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.203HisTrp: 1.203 ± 0.831
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.61IleAla: 3.61 ± 1.467
0.0IleCys: 0.0 ± 0.0
4.813IleAsp: 4.813 ± 2.16
6.017IleGlu: 6.017 ± 1.719
0.0IlePhe: 0.0 ± 0.0
3.61IleGly: 3.61 ± 1.055
2.407IleHis: 2.407 ± 1.662
3.61IleIle: 3.61 ± 1.467
1.203IleLys: 1.203 ± 0.831
2.407IleLeu: 2.407 ± 1.274
1.203IleMet: 1.203 ± 1.146
4.813IleAsn: 4.813 ± 1.094
3.61IlePro: 3.61 ± 2.121
1.203IleGln: 1.203 ± 0.831
1.203IleArg: 1.203 ± 1.135
10.83IleSer: 10.83 ± 6.582
4.813IleThr: 4.813 ± 1.695
2.407IleVal: 2.407 ± 0.977
2.407IleTrp: 2.407 ± 1.662
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.203LysAla: 1.203 ± 0.831
2.407LysCys: 2.407 ± 0.977
2.407LysAsp: 2.407 ± 1.662
2.407LysGlu: 2.407 ± 1.24
2.407LysPhe: 2.407 ± 1.01
4.813LysGly: 4.813 ± 2.02
0.0LysHis: 0.0 ± 0.0
2.407LysIle: 2.407 ± 1.01
2.407LysLys: 2.407 ± 0.977
3.61LysLeu: 3.61 ± 2.492
1.203LysMet: 1.203 ± 1.126
6.017LysAsn: 6.017 ± 3.029
0.0LysPro: 0.0 ± 0.0
1.203LysGln: 1.203 ± 0.831
2.407LysArg: 2.407 ± 1.662
2.407LysSer: 2.407 ± 1.662
3.61LysThr: 3.61 ± 2.304
1.203LysVal: 1.203 ± 1.126
2.407LysTrp: 2.407 ± 1.662
4.813LysTyr: 4.813 ± 1.094
0.0LysXaa: 0.0 ± 0.0
Leu
4.813LeuAla: 4.813 ± 1.958
0.0LeuCys: 0.0 ± 0.0
7.22LeuAsp: 7.22 ± 2.926
4.813LeuGlu: 4.813 ± 2.486
6.017LeuPhe: 6.017 ± 3.327
1.203LeuGly: 1.203 ± 0.831
0.0LeuHis: 0.0 ± 0.0
6.017LeuIle: 6.017 ± 0.238
6.017LeuLys: 6.017 ± 2.925
1.203LeuLeu: 1.203 ± 1.126
0.0LeuMet: 0.0 ± 0.0
4.813LeuAsn: 4.813 ± 1.955
2.407LeuPro: 2.407 ± 2.269
2.407LeuGln: 2.407 ± 1.274
4.813LeuArg: 4.813 ± 2.48
3.61LeuSer: 3.61 ± 2.485
4.813LeuThr: 4.813 ± 1.955
6.017LeuVal: 6.017 ± 1.694
1.203LeuTrp: 1.203 ± 0.831
4.813LeuTyr: 4.813 ± 1.955
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.203MetAsp: 1.203 ± 0.831
0.0MetGlu: 0.0 ± 0.0
3.61MetPhe: 3.61 ± 2.458
1.203MetGly: 1.203 ± 0.831
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.407MetLeu: 2.407 ± 1.24
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.61MetPro: 3.61 ± 1.089
2.407MetGln: 2.407 ± 1.24
1.203MetArg: 1.203 ± 0.831
2.407MetSer: 2.407 ± 1.409
1.203MetThr: 1.203 ± 1.126
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.203MetTyr: 1.203 ± 0.831
0.0MetXaa: 0.0 ± 0.0
Asn
3.61AsnAla: 3.61 ± 2.485
0.0AsnCys: 0.0 ± 0.0
1.203AsnAsp: 1.203 ± 1.126
3.61AsnGlu: 3.61 ± 1.406
3.61AsnPhe: 3.61 ± 1.406
4.813AsnGly: 4.813 ± 2.02
1.203AsnHis: 1.203 ± 1.135
4.813AsnIle: 4.813 ± 1.094
2.407AsnLys: 2.407 ± 1.01
4.813AsnLeu: 4.813 ± 1.955
3.61AsnMet: 3.61 ± 1.467
2.407AsnAsn: 2.407 ± 0.977
4.813AsnPro: 4.813 ± 1.33
3.61AsnGln: 3.61 ± 2.485
1.203AsnArg: 1.203 ± 1.135
2.407AsnSer: 2.407 ± 1.409
2.407AsnThr: 2.407 ± 2.253
1.203AsnVal: 1.203 ± 1.146
1.203AsnTrp: 1.203 ± 1.146
3.61AsnTyr: 3.61 ± 2.485
0.0AsnXaa: 0.0 ± 0.0
Pro
2.407ProAla: 2.407 ± 2.269
0.0ProCys: 0.0 ± 0.0
2.407ProAsp: 2.407 ± 1.409
2.407ProGlu: 2.407 ± 1.662
7.22ProPhe: 7.22 ± 2.178
2.407ProGly: 2.407 ± 1.662
2.407ProHis: 2.407 ± 1.01
2.407ProIle: 2.407 ± 2.253
1.203ProLys: 1.203 ± 0.831
1.203ProLeu: 1.203 ± 1.126
1.203ProMet: 1.203 ± 1.146
1.203ProAsn: 1.203 ± 1.146
3.61ProPro: 3.61 ± 1.972
4.813ProGln: 4.813 ± 1.861
1.203ProArg: 1.203 ± 1.146
8.424ProSer: 8.424 ± 1.389
3.61ProThr: 3.61 ± 3.404
4.813ProVal: 4.813 ± 3.148
0.0ProTrp: 0.0 ± 0.0
1.203ProTyr: 1.203 ± 1.135
0.0ProXaa: 0.0 ± 0.0
Gln
3.61GlnAla: 3.61 ± 1.972
2.407GlnCys: 2.407 ± 1.24
3.61GlnAsp: 3.61 ± 2.227
3.61GlnGlu: 3.61 ± 1.78
3.61GlnPhe: 3.61 ± 1.055
6.017GlnGly: 6.017 ± 2.377
1.203GlnHis: 1.203 ± 0.831
3.61GlnIle: 3.61 ± 1.089
1.203GlnLys: 1.203 ± 0.831
2.407GlnLeu: 2.407 ± 1.01
0.0GlnMet: 0.0 ± 0.0
6.017GlnAsn: 6.017 ± 3.029
1.203GlnPro: 1.203 ± 1.126
2.407GlnGln: 2.407 ± 1.24
1.203GlnArg: 1.203 ± 0.831
3.61GlnSer: 3.61 ± 2.134
0.0GlnThr: 0.0 ± 0.0
4.813GlnVal: 4.813 ± 0.642
0.0GlnTrp: 0.0 ± 0.0
1.203GlnTyr: 1.203 ± 1.135
0.0GlnXaa: 0.0 ± 0.0
Arg
6.017ArgAla: 6.017 ± 3.251
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
3.61ArgGlu: 3.61 ± 2.227
2.407ArgPhe: 2.407 ± 1.24
0.0ArgGly: 0.0 ± 0.0
1.203ArgHis: 1.203 ± 0.831
2.407ArgIle: 2.407 ± 0.977
2.407ArgLys: 2.407 ± 1.552
3.61ArgLeu: 3.61 ± 1.466
0.0ArgMet: 0.0 ± 0.0
2.407ArgAsn: 2.407 ± 0.977
0.0ArgPro: 0.0 ± 0.0
3.61ArgGln: 3.61 ± 2.492
8.424ArgArg: 8.424 ± 2.105
7.22ArgSer: 7.22 ± 2.595
4.813ArgThr: 4.813 ± 2.02
4.813ArgVal: 4.813 ± 1.094
1.203ArgTrp: 1.203 ± 0.831
4.813ArgTyr: 4.813 ± 1.958
0.0ArgXaa: 0.0 ± 0.0
Ser
2.407SerAla: 2.407 ± 1.409
0.0SerCys: 0.0 ± 0.0
4.813SerAsp: 4.813 ± 2.112
4.813SerGlu: 4.813 ± 1.33
3.61SerPhe: 3.61 ± 1.466
9.627SerGly: 9.627 ± 5.075
0.0SerHis: 0.0 ± 0.0
4.813SerIle: 4.813 ± 1.695
2.407SerLys: 2.407 ± 0.977
6.017SerLeu: 6.017 ± 1.731
1.203SerMet: 1.203 ± 1.135
6.017SerAsn: 6.017 ± 1.475
1.203SerPro: 1.203 ± 1.126
7.22SerGln: 7.22 ± 3.357
3.61SerArg: 3.61 ± 0.968
4.813SerSer: 4.813 ± 2.549
1.203SerThr: 1.203 ± 1.146
3.61SerVal: 3.61 ± 3.404
0.0SerTrp: 0.0 ± 0.0
6.017SerTyr: 6.017 ± 2.924
0.0SerXaa: 0.0 ± 0.0
Thr
4.813ThrAla: 4.813 ± 0.642
0.0ThrCys: 0.0 ± 0.0
2.407ThrAsp: 2.407 ± 2.269
2.407ThrGlu: 2.407 ± 0.977
0.0ThrPhe: 0.0 ± 0.0
1.203ThrGly: 1.203 ± 1.146
0.0ThrHis: 0.0 ± 0.0
3.61ThrIle: 3.61 ± 1.972
7.22ThrLys: 7.22 ± 3.719
8.424ThrLeu: 8.424 ± 2.285
0.0ThrMet: 0.0 ± 0.0
2.407ThrAsn: 2.407 ± 2.253
4.813ThrPro: 4.813 ± 1.861
2.407ThrGln: 2.407 ± 1.274
8.424ThrArg: 8.424 ± 4.281
2.407ThrSer: 2.407 ± 1.01
3.61ThrThr: 3.61 ± 1.972
4.813ThrVal: 4.813 ± 1.861
1.203ThrTrp: 1.203 ± 0.831
3.61ThrTyr: 3.61 ± 1.055
0.0ThrXaa: 0.0 ± 0.0
Val
1.203ValAla: 1.203 ± 1.146
1.203ValCys: 1.203 ± 1.126
2.407ValAsp: 2.407 ± 1.01
3.61ValGlu: 3.61 ± 1.78
4.813ValPhe: 4.813 ± 2.48
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
4.813ValIle: 4.813 ± 2.093
6.017ValLys: 6.017 ± 4.453
7.22ValLeu: 7.22 ± 1.942
2.407ValMet: 2.407 ± 1.18
3.61ValAsn: 3.61 ± 3.379
4.813ValPro: 4.813 ± 2.202
3.61ValGln: 3.61 ± 1.406
4.813ValArg: 4.813 ± 2.093
2.407ValSer: 2.407 ± 1.274
3.61ValThr: 3.61 ± 2.287
1.203ValVal: 1.203 ± 0.831
1.203ValTrp: 1.203 ± 1.146
2.407ValTyr: 2.407 ± 1.552
0.0ValXaa: 0.0 ± 0.0
Trp
1.203TrpAla: 1.203 ± 0.831
0.0TrpCys: 0.0 ± 0.0
1.203TrpAsp: 1.203 ± 0.831
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.203TrpGly: 1.203 ± 0.831
0.0TrpHis: 0.0 ± 0.0
1.203TrpIle: 1.203 ± 0.831
0.0TrpLys: 0.0 ± 0.0
1.203TrpLeu: 1.203 ± 0.831
0.0TrpMet: 0.0 ± 0.0
2.407TrpAsn: 2.407 ± 1.662
2.407TrpPro: 2.407 ± 0.977
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.203TrpSer: 1.203 ± 1.146
0.0TrpThr: 0.0 ± 0.0
1.203TrpVal: 1.203 ± 0.831
1.203TrpTrp: 1.203 ± 0.831
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.407TyrAla: 2.407 ± 2.292
1.203TyrCys: 1.203 ± 0.831
0.0TyrAsp: 0.0 ± 0.0
4.813TyrGlu: 4.813 ± 1.33
1.203TyrPhe: 1.203 ± 1.126
2.407TyrGly: 2.407 ± 1.274
1.203TyrHis: 1.203 ± 0.831
2.407TyrIle: 2.407 ± 1.274
1.203TyrLys: 1.203 ± 1.126
3.61TyrLeu: 3.61 ± 1.972
1.203TyrMet: 1.203 ± 0.831
2.407TyrAsn: 2.407 ± 0.977
3.61TyrPro: 3.61 ± 2.492
2.407TyrGln: 2.407 ± 1.24
1.203TyrArg: 1.203 ± 0.831
2.407TyrSer: 2.407 ± 2.292
7.22TyrThr: 7.22 ± 4.657
3.61TyrVal: 3.61 ± 2.458
0.0TyrTrp: 0.0 ± 0.0
2.407TyrTyr: 2.407 ± 2.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (832 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski