Amino acid dipepetide frequency for Beihai picorna-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.885AlaAla: 2.885 ± 0.415
0.721AlaCys: 0.721 ± 0.404
3.967AlaAsp: 3.967 ± 0.421
2.885AlaGlu: 2.885 ± 1.015
2.885AlaPhe: 2.885 ± 0.415
6.852AlaGly: 6.852 ± 0.961
1.803AlaHis: 1.803 ± 0.409
3.967AlaIle: 3.967 ± 0.421
3.246AlaLys: 3.246 ± 0.018
4.327AlaLeu: 4.327 ± 2.421
2.524AlaMet: 2.524 ± 0.72
3.967AlaAsn: 3.967 ± 0.178
3.967AlaPro: 3.967 ± 0.777
1.442AlaGln: 1.442 ± 0.208
3.967AlaArg: 3.967 ± 0.178
6.491AlaSer: 6.491 ± 4.16
5.049AlaThr: 5.049 ± 1.97
6.852AlaVal: 6.852 ± 0.237
0.721AlaTrp: 0.721 ± 0.196
1.442AlaTyr: 1.442 ± 0.807
0.0AlaXaa: 0.0 ± 0.0
Cys
1.082CysAla: 1.082 ± 0.605
0.0CysCys: 0.0 ± 0.0
2.164CysAsp: 2.164 ± 0.611
1.442CysGlu: 1.442 ± 0.807
0.361CysPhe: 0.361 ± 0.202
1.082CysGly: 1.082 ± 0.605
0.0CysHis: 0.0 ± 0.0
0.361CysIle: 0.361 ± 0.398
0.721CysLys: 0.721 ± 0.404
0.361CysLeu: 0.361 ± 0.202
0.361CysMet: 0.361 ± 0.202
1.803CysAsn: 1.803 ± 0.789
0.721CysPro: 0.721 ± 0.404
0.0CysGln: 0.0 ± 0.0
1.082CysArg: 1.082 ± 0.605
0.0CysSer: 0.0 ± 0.0
0.361CysThr: 0.361 ± 0.398
1.803CysVal: 1.803 ± 1.009
0.0CysTrp: 0.0 ± 0.0
0.721CysTyr: 0.721 ± 0.196
0.0CysXaa: 0.0 ± 0.0
Asp
3.967AspAla: 3.967 ± 0.178
0.721AspCys: 0.721 ± 0.404
5.77AspAsp: 5.77 ± 0.368
3.606AspGlu: 3.606 ± 0.22
3.246AspPhe: 3.246 ± 1.181
4.327AspGly: 4.327 ± 0.623
0.721AspHis: 0.721 ± 0.196
3.606AspIle: 3.606 ± 1.418
3.967AspLys: 3.967 ± 0.421
8.655AspLeu: 8.655 ± 2.445
2.164AspMet: 2.164 ± 0.012
2.164AspAsn: 2.164 ± 0.587
3.246AspPro: 3.246 ± 1.181
2.524AspGln: 2.524 ± 1.584
2.164AspArg: 2.164 ± 0.012
5.049AspSer: 5.049 ± 1.97
1.803AspThr: 1.803 ± 0.19
3.246AspVal: 3.246 ± 0.018
0.721AspTrp: 0.721 ± 0.196
3.246AspTyr: 3.246 ± 1.216
0.0AspXaa: 0.0 ± 0.0
Glu
3.606GluAla: 3.606 ± 0.979
0.721GluCys: 0.721 ± 0.404
4.327GluAsp: 4.327 ± 0.024
4.327GluGlu: 4.327 ± 0.576
1.803GluPhe: 1.803 ± 0.409
2.164GluGly: 2.164 ± 1.211
0.0GluHis: 0.0 ± 0.0
2.524GluIle: 2.524 ± 0.985
2.885GluLys: 2.885 ± 1.614
2.524GluLeu: 2.524 ± 0.985
1.442GluMet: 1.442 ± 0.392
1.442GluAsn: 1.442 ± 0.208
4.688GluPro: 4.688 ± 0.825
1.082GluGln: 1.082 ± 0.006
1.442GluArg: 1.442 ± 0.807
4.327GluSer: 4.327 ± 1.222
1.803GluThr: 1.803 ± 0.19
6.491GluVal: 6.491 ± 0.036
2.164GluTrp: 2.164 ± 1.211
1.442GluTyr: 1.442 ± 0.208
0.0GluXaa: 0.0 ± 0.0
Phe
5.409PheAla: 5.409 ± 0.03
1.442PheCys: 1.442 ± 0.807
2.524PheAsp: 2.524 ± 0.813
2.885PheGlu: 2.885 ± 1.015
3.246PhePhe: 3.246 ± 0.581
3.967PheGly: 3.967 ± 0.178
2.524PheHis: 2.524 ± 0.386
2.885PheIle: 2.885 ± 0.415
3.606PheLys: 3.606 ± 0.819
7.934PheLeu: 7.934 ± 0.356
1.442PheMet: 1.442 ± 0.208
2.885PheAsn: 2.885 ± 0.415
0.721PhePro: 0.721 ± 0.196
1.803PheGln: 1.803 ± 0.19
2.524PheArg: 2.524 ± 0.386
2.164PheSer: 2.164 ± 0.611
3.606PheThr: 3.606 ± 0.979
2.524PheVal: 2.524 ± 0.214
0.361PheTrp: 0.361 ± 0.398
1.442PheTyr: 1.442 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
4.327GlyAla: 4.327 ± 0.024
0.721GlyCys: 0.721 ± 0.196
6.131GlyAsp: 6.131 ± 1.365
2.524GlyGlu: 2.524 ± 1.584
5.049GlyPhe: 5.049 ± 0.427
4.688GlyGly: 4.688 ± 2.172
1.442GlyHis: 1.442 ± 0.807
3.606GlyIle: 3.606 ± 0.819
3.967GlyLys: 3.967 ± 0.421
4.688GlyLeu: 4.688 ± 0.374
1.082GlyMet: 1.082 ± 0.593
3.246GlyAsn: 3.246 ± 0.581
2.524GlyPro: 2.524 ± 1.584
2.524GlyGln: 2.524 ± 1.584
4.688GlyArg: 4.688 ± 0.374
6.491GlySer: 6.491 ± 0.635
3.246GlyThr: 3.246 ± 0.581
4.688GlyVal: 4.688 ± 0.226
1.082GlyTrp: 1.082 ± 1.193
2.164GlyTyr: 2.164 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.082HisAla: 1.082 ± 0.006
0.361HisCys: 0.361 ± 0.202
0.361HisAsp: 0.361 ± 0.202
0.0HisGlu: 0.0 ± 0.0
1.082HisPhe: 1.082 ± 0.605
1.082HisGly: 1.082 ± 0.006
0.721HisHis: 0.721 ± 0.404
1.082HisIle: 1.082 ± 0.605
1.442HisLys: 1.442 ± 0.392
1.803HisLeu: 1.803 ± 1.009
2.524HisMet: 2.524 ± 0.214
0.0HisAsn: 0.0 ± 0.0
0.721HisPro: 0.721 ± 0.196
1.442HisGln: 1.442 ± 0.208
1.442HisArg: 1.442 ± 0.807
1.803HisSer: 1.803 ± 0.19
0.361HisThr: 0.361 ± 0.202
1.442HisVal: 1.442 ± 0.807
0.0HisTrp: 0.0 ± 0.0
1.442HisTyr: 1.442 ± 0.392
0.0HisXaa: 0.0 ± 0.0
Ile
3.967IleAla: 3.967 ± 0.421
0.361IleCys: 0.361 ± 0.202
4.688IleAsp: 4.688 ± 0.226
2.885IleGlu: 2.885 ± 0.783
3.967IlePhe: 3.967 ± 0.178
2.885IleGly: 2.885 ± 0.415
0.361IleHis: 0.361 ± 0.202
1.803IleIle: 1.803 ± 0.409
2.885IleLys: 2.885 ± 1.015
2.885IleLeu: 2.885 ± 0.415
0.721IleMet: 0.721 ± 0.404
1.442IleAsn: 1.442 ± 0.991
1.442IlePro: 1.442 ± 0.208
1.803IleGln: 1.803 ± 0.19
2.524IleArg: 2.524 ± 0.214
3.967IleSer: 3.967 ± 0.421
3.246IleThr: 3.246 ± 0.581
2.164IleVal: 2.164 ± 0.012
1.082IleTrp: 1.082 ± 0.605
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.246LysAla: 3.246 ± 0.018
1.082LysCys: 1.082 ± 0.605
1.442LysAsp: 1.442 ± 0.208
1.442LysGlu: 1.442 ± 0.807
2.524LysPhe: 2.524 ± 0.214
5.049LysGly: 5.049 ± 0.172
2.524LysHis: 2.524 ± 0.813
2.164LysIle: 2.164 ± 0.012
1.082LysLys: 1.082 ± 0.006
5.409LysLeu: 5.409 ± 0.629
1.082LysMet: 1.082 ± 0.593
1.803LysAsn: 1.803 ± 0.409
1.082LysPro: 1.082 ± 0.006
2.164LysGln: 2.164 ± 0.012
3.246LysArg: 3.246 ± 1.816
6.491LysSer: 6.491 ± 1.834
2.164LysThr: 2.164 ± 1.211
3.967LysVal: 3.967 ± 1.021
1.082LysTrp: 1.082 ± 0.593
3.606LysTyr: 3.606 ± 1.418
0.0LysXaa: 0.0 ± 0.0
Leu
5.77LeuAla: 5.77 ± 2.029
1.803LeuCys: 1.803 ± 1.009
5.049LeuAsp: 5.049 ± 0.427
4.327LeuGlu: 4.327 ± 0.623
4.327LeuPhe: 4.327 ± 0.024
5.77LeuGly: 5.77 ± 0.231
2.164LeuHis: 2.164 ± 0.587
2.164LeuIle: 2.164 ± 0.012
5.77LeuLys: 5.77 ± 2.629
5.049LeuLeu: 5.049 ± 0.427
0.721LeuMet: 0.721 ± 0.404
3.967LeuAsn: 3.967 ± 0.178
3.606LeuPro: 3.606 ± 0.38
1.803LeuGln: 1.803 ± 0.789
4.688LeuArg: 4.688 ± 0.825
8.655LeuSer: 8.655 ± 0.647
5.409LeuThr: 5.409 ± 1.169
4.327LeuVal: 4.327 ± 0.024
1.803LeuTrp: 1.803 ± 0.409
2.524LeuTyr: 2.524 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
1.803MetAla: 1.803 ± 0.409
0.0MetCys: 0.0 ± 0.0
1.803MetAsp: 1.803 ± 1.009
1.082MetGlu: 1.082 ± 0.006
0.361MetPhe: 0.361 ± 0.398
0.721MetGly: 0.721 ± 0.795
0.0MetHis: 0.0 ± 0.0
0.361MetIle: 0.361 ± 0.202
2.885MetLys: 2.885 ± 0.184
3.246MetLeu: 3.246 ± 0.617
1.803MetMet: 1.803 ± 0.409
0.721MetAsn: 0.721 ± 0.196
2.524MetPro: 2.524 ± 0.386
0.361MetGln: 0.361 ± 0.202
2.524MetArg: 2.524 ± 0.214
2.524MetSer: 2.524 ± 0.214
2.164MetThr: 2.164 ± 0.587
1.803MetVal: 1.803 ± 0.409
0.0MetTrp: 0.0 ± 0.0
1.082MetTyr: 1.082 ± 0.605
0.0MetXaa: 0.0 ± 0.0
Asn
3.246AsnAla: 3.246 ± 2.379
0.0AsnCys: 0.0 ± 0.0
1.082AsnAsp: 1.082 ± 0.006
1.442AsnGlu: 1.442 ± 0.208
1.803AsnPhe: 1.803 ± 0.789
3.967AsnGly: 3.967 ± 1.377
1.082AsnHis: 1.082 ± 0.605
1.803AsnIle: 1.803 ± 0.409
2.164AsnLys: 2.164 ± 0.587
4.688AsnLeu: 4.688 ± 0.973
1.442AsnMet: 1.442 ± 0.208
1.803AsnAsn: 1.803 ± 0.19
4.688AsnPro: 4.688 ± 0.973
1.442AsnGln: 1.442 ± 0.392
3.246AsnArg: 3.246 ± 1.216
5.049AsnSer: 5.049 ± 0.771
1.803AsnThr: 1.803 ± 0.409
3.606AsnVal: 3.606 ± 0.38
0.361AsnTrp: 0.361 ± 0.202
3.246AsnTyr: 3.246 ± 1.181
0.0AsnXaa: 0.0 ± 0.0
Pro
2.164ProAla: 2.164 ± 0.611
1.442ProCys: 1.442 ± 0.991
3.246ProAsp: 3.246 ± 0.581
2.524ProGlu: 2.524 ± 0.985
2.164ProPhe: 2.164 ± 1.786
2.164ProGly: 2.164 ± 1.187
0.361ProHis: 0.361 ± 0.202
1.442ProIle: 1.442 ± 0.208
2.164ProLys: 2.164 ± 1.211
4.327ProLeu: 4.327 ± 0.576
0.721ProMet: 0.721 ± 0.196
2.885ProAsn: 2.885 ± 1.383
1.803ProPro: 1.803 ± 1.988
3.606ProGln: 3.606 ± 0.38
0.721ProArg: 0.721 ± 0.196
3.967ProSer: 3.967 ± 0.178
2.885ProThr: 2.885 ± 1.383
5.409ProVal: 5.409 ± 0.57
2.164ProTrp: 2.164 ± 0.611
0.721ProTyr: 0.721 ± 0.795
0.0ProXaa: 0.0 ± 0.0
Gln
4.327GlnAla: 4.327 ± 1.175
0.361GlnCys: 0.361 ± 0.202
2.524GlnAsp: 2.524 ± 0.214
1.803GlnGlu: 1.803 ± 0.19
1.803GlnPhe: 1.803 ± 1.009
1.803GlnGly: 1.803 ± 0.789
0.0GlnHis: 0.0 ± 0.0
2.524GlnIle: 2.524 ± 0.985
1.082GlnLys: 1.082 ± 0.593
2.524GlnLeu: 2.524 ± 0.214
2.164GlnMet: 2.164 ± 0.012
2.164GlnAsn: 2.164 ± 0.587
0.721GlnPro: 0.721 ± 0.196
0.721GlnGln: 0.721 ± 0.196
1.442GlnArg: 1.442 ± 0.991
4.327GlnSer: 4.327 ± 0.024
2.164GlnThr: 2.164 ± 1.187
1.803GlnVal: 1.803 ± 0.19
0.361GlnTrp: 0.361 ± 0.202
0.361GlnTyr: 0.361 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
3.606ArgAla: 3.606 ± 0.819
1.082ArgCys: 1.082 ± 0.593
3.246ArgAsp: 3.246 ± 0.617
4.327ArgGlu: 4.327 ± 1.222
3.967ArgPhe: 3.967 ± 0.421
1.803ArgGly: 1.803 ± 0.409
0.361ArgHis: 0.361 ± 0.398
1.442ArgIle: 1.442 ± 0.392
3.246ArgLys: 3.246 ± 1.216
3.246ArgLeu: 3.246 ± 0.018
1.082ArgMet: 1.082 ± 0.605
1.803ArgAsn: 1.803 ± 0.19
3.967ArgPro: 3.967 ± 1.976
0.721ArgGln: 0.721 ± 0.404
2.164ArgArg: 2.164 ± 0.611
2.885ArgSer: 2.885 ± 1.015
1.803ArgThr: 1.803 ± 0.789
3.967ArgVal: 3.967 ± 0.421
1.803ArgTrp: 1.803 ± 0.409
2.164ArgTyr: 2.164 ± 0.012
0.0ArgXaa: 0.0 ± 0.0
Ser
6.131SerAla: 6.131 ± 1.365
1.442SerCys: 1.442 ± 0.208
6.852SerAsp: 6.852 ± 0.362
2.164SerGlu: 2.164 ± 0.611
6.131SerPhe: 6.131 ± 1.033
8.294SerGly: 8.294 ± 2.551
0.721SerHis: 0.721 ± 0.196
6.491SerIle: 6.491 ± 0.635
5.049SerLys: 5.049 ± 1.027
6.852SerLeu: 6.852 ± 0.237
1.442SerMet: 1.442 ± 0.622
4.327SerAsn: 4.327 ± 0.576
3.246SerPro: 3.246 ± 0.018
3.606SerGln: 3.606 ± 0.38
2.524SerArg: 2.524 ± 0.985
5.77SerSer: 5.77 ± 0.831
4.688SerThr: 4.688 ± 0.374
6.852SerVal: 6.852 ± 1.436
0.721SerTrp: 0.721 ± 0.196
3.967SerTyr: 3.967 ± 0.777
0.0SerXaa: 0.0 ± 0.0
Thr
1.442ThrAla: 1.442 ± 0.392
0.721ThrCys: 0.721 ± 0.404
2.524ThrAsp: 2.524 ± 0.386
3.246ThrGlu: 3.246 ± 0.581
3.606ThrPhe: 3.606 ± 0.819
3.967ThrGly: 3.967 ± 0.777
1.442ThrHis: 1.442 ± 0.208
1.442ThrIle: 1.442 ± 0.392
1.803ThrLys: 1.803 ± 0.19
3.967ThrLeu: 3.967 ± 0.178
1.082ThrMet: 1.082 ± 0.593
3.967ThrAsn: 3.967 ± 0.777
3.246ThrPro: 3.246 ± 1.181
1.803ThrGln: 1.803 ± 0.409
3.606ThrArg: 3.606 ± 1.578
4.688ThrSer: 4.688 ± 2.172
4.327ThrThr: 4.327 ± 2.373
4.327ThrVal: 4.327 ± 0.024
0.721ThrTrp: 0.721 ± 0.404
2.164ThrTyr: 2.164 ± 0.587
0.0ThrXaa: 0.0 ± 0.0
Val
7.212ValAla: 7.212 ± 0.16
0.721ValCys: 0.721 ± 0.404
4.327ValAsp: 4.327 ± 1.774
4.688ValGlu: 4.688 ± 1.424
4.327ValPhe: 4.327 ± 0.623
4.327ValGly: 4.327 ± 0.024
1.803ValHis: 1.803 ± 1.009
3.967ValIle: 3.967 ± 0.178
2.164ValLys: 2.164 ± 0.587
3.246ValLeu: 3.246 ± 0.018
2.164ValMet: 2.164 ± 1.211
5.049ValAsn: 5.049 ± 0.427
3.606ValPro: 3.606 ± 0.979
3.606ValGln: 3.606 ± 0.22
3.606ValArg: 3.606 ± 1.418
8.655ValSer: 8.655 ± 1.246
4.688ValThr: 4.688 ± 0.825
6.852ValVal: 6.852 ± 0.237
1.082ValTrp: 1.082 ± 0.006
1.442ValTyr: 1.442 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
1.442TrpAla: 1.442 ± 0.208
0.721TrpCys: 0.721 ± 0.404
0.361TrpAsp: 0.361 ± 0.398
0.721TrpGlu: 0.721 ± 0.404
1.082TrpPhe: 1.082 ± 0.605
1.082TrpGly: 1.082 ± 0.593
1.082TrpHis: 1.082 ± 0.605
0.361TrpIle: 0.361 ± 0.202
1.442TrpLys: 1.442 ± 0.807
0.721TrpLeu: 0.721 ± 0.404
1.082TrpMet: 1.082 ± 0.593
0.721TrpAsn: 0.721 ± 0.196
0.0TrpPro: 0.0 ± 0.0
0.721TrpGln: 0.721 ± 0.795
1.082TrpArg: 1.082 ± 0.006
0.721TrpSer: 0.721 ± 0.196
1.803TrpThr: 1.803 ± 0.409
0.361TrpVal: 0.361 ± 0.398
0.0TrpTrp: 0.0 ± 0.0
0.721TrpTyr: 0.721 ± 0.404
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.246TyrAla: 3.246 ± 0.617
0.0TyrCys: 0.0 ± 0.0
2.524TyrAsp: 2.524 ± 0.386
2.885TyrGlu: 2.885 ± 0.415
2.164TyrPhe: 2.164 ± 0.587
2.885TyrGly: 2.885 ± 0.783
1.082TyrHis: 1.082 ± 0.605
1.082TyrIle: 1.082 ± 0.006
1.082TyrLys: 1.082 ± 0.605
3.246TyrLeu: 3.246 ± 1.216
0.361TyrMet: 0.361 ± 0.398
1.803TyrAsn: 1.803 ± 0.789
0.361TyrPro: 0.361 ± 0.202
1.803TyrGln: 1.803 ± 0.789
0.0TyrArg: 0.0 ± 0.0
3.606TyrSer: 3.606 ± 0.38
0.721TyrThr: 0.721 ± 0.196
5.049TyrVal: 5.049 ± 0.427
0.0TyrTrp: 0.0 ± 0.0
1.803TyrTyr: 1.803 ± 0.19
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2774 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski