Amino acid dipepetide frequency for Gumbo Limbo virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.773AlaAla: 0.773 ± 0.324
0.386AlaCys: 0.386 ± 0.162
1.546AlaAsp: 1.546 ± 2.262
1.932AlaGlu: 1.932 ± 0.773
2.705AlaPhe: 2.705 ± 1.134
0.386AlaGly: 0.386 ± 1.269
0.773AlaHis: 0.773 ± 0.324
5.796AlaIle: 5.796 ± 1.317
5.41AlaLys: 5.41 ± 5.126
4.637AlaLeu: 4.637 ± 1.943
0.386AlaMet: 0.386 ± 0.162
4.637AlaAsn: 4.637 ± 3.998
0.386AlaPro: 0.386 ± 0.162
2.705AlaGln: 2.705 ± 1.875
1.932AlaArg: 1.932 ± 0.773
1.546AlaSer: 1.546 ± 0.879
1.546AlaThr: 1.546 ± 0.648
1.159AlaVal: 1.159 ± 1.0
0.773AlaTrp: 0.773 ± 1.131
1.546AlaTyr: 1.546 ± 0.879
0.0AlaXaa: 0.0 ± 0.0
Cys
0.773CysAla: 0.773 ± 0.324
0.773CysCys: 0.773 ± 1.626
1.159CysAsp: 1.159 ± 0.486
0.0CysGlu: 0.0 ± 0.0
0.773CysPhe: 0.773 ± 0.324
1.546CysGly: 1.546 ± 1.486
0.0CysHis: 0.0 ± 0.0
1.159CysIle: 1.159 ± 0.486
0.773CysLys: 0.773 ± 0.324
2.318CysLeu: 2.318 ± 1.407
0.773CysMet: 0.773 ± 0.324
0.386CysAsn: 0.386 ± 0.162
0.773CysPro: 0.773 ± 0.324
0.773CysGln: 0.773 ± 1.626
0.0CysArg: 0.0 ± 0.0
1.932CysSer: 1.932 ± 0.81
1.159CysThr: 1.159 ± 0.486
0.773CysVal: 0.773 ± 0.324
0.386CysTrp: 0.386 ± 0.162
0.386CysTyr: 0.386 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
1.159AspAla: 1.159 ± 1.0
1.159AspCys: 1.159 ± 0.486
3.091AspAsp: 3.091 ± 0.623
3.091AspGlu: 3.091 ± 1.296
3.478AspPhe: 3.478 ± 1.458
2.318AspGly: 2.318 ± 0.689
0.773AspHis: 0.773 ± 1.131
6.569AspIle: 6.569 ± 2.753
5.023AspLys: 5.023 ± 0.838
5.796AspLeu: 5.796 ± 2.429
1.546AspMet: 1.546 ± 0.648
2.705AspAsn: 2.705 ± 1.134
2.705AspPro: 2.705 ± 1.875
2.705AspGln: 2.705 ± 1.134
3.864AspArg: 3.864 ± 1.62
1.932AspSer: 1.932 ± 0.773
4.637AspThr: 4.637 ± 1.943
2.318AspVal: 2.318 ± 0.972
0.0AspTrp: 0.0 ± 0.0
2.705AspTyr: 2.705 ± 1.134
0.0AspXaa: 0.0 ± 0.0
Glu
2.318GluAla: 2.318 ± 0.689
0.386GluCys: 0.386 ± 0.162
3.091GluAsp: 3.091 ± 1.296
6.182GluGlu: 6.182 ± 2.226
6.569GluPhe: 6.569 ± 3.402
1.159GluGly: 1.159 ± 0.486
0.773GluHis: 0.773 ± 0.324
8.501GluIle: 8.501 ± 2.377
5.796GluLys: 5.796 ± 2.429
6.955GluLeu: 6.955 ± 3.294
3.864GluMet: 3.864 ± 1.545
2.318GluAsn: 2.318 ± 1.999
1.546GluPro: 1.546 ± 0.648
1.546GluGln: 1.546 ± 0.648
5.023GluArg: 5.023 ± 1.317
4.637GluSer: 4.637 ± 0.913
4.25GluThr: 4.25 ± 1.526
2.318GluVal: 2.318 ± 1.407
0.386GluTrp: 0.386 ± 0.162
1.932GluTyr: 1.932 ± 0.81
0.0GluXaa: 0.0 ± 0.0
Phe
0.773PheAla: 0.773 ± 0.324
3.478PheCys: 3.478 ± 1.458
2.705PheAsp: 2.705 ± 0.636
4.637PheGlu: 4.637 ± 1.378
3.478PhePhe: 3.478 ± 1.458
1.932PheGly: 1.932 ± 2.128
1.546PheHis: 1.546 ± 0.648
3.478PheIle: 3.478 ± 1.647
6.182PheLys: 6.182 ± 2.591
7.728PheLeu: 7.728 ± 3.091
1.159PheMet: 1.159 ± 0.459
2.705PheAsn: 2.705 ± 1.134
0.773PhePro: 0.773 ± 0.324
1.932PheGln: 1.932 ± 1.438
2.318PheArg: 2.318 ± 0.972
5.41PheSer: 5.41 ± 1.177
2.318PheThr: 2.318 ± 0.972
1.932PheVal: 1.932 ± 1.438
0.386PheTrp: 0.386 ± 0.162
1.159PheTyr: 1.159 ± 0.486
0.0PheXaa: 0.0 ± 0.0
Gly
1.159GlyAla: 1.159 ± 1.0
0.773GlyCys: 0.773 ± 0.324
3.478GlyAsp: 3.478 ± 1.426
5.41GlyGlu: 5.41 ± 1.273
1.932GlyPhe: 1.932 ± 0.773
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
2.705GlyIle: 2.705 ± 1.875
1.546GlyLys: 1.546 ± 0.648
3.478GlyLeu: 3.478 ± 1.241
0.773GlyMet: 0.773 ± 0.324
3.478GlyAsn: 3.478 ± 1.458
1.159GlyPro: 1.159 ± 0.486
1.159GlyGln: 1.159 ± 1.0
2.318GlyArg: 2.318 ± 4.879
1.159GlySer: 1.159 ± 1.0
1.159GlyThr: 1.159 ± 2.399
1.546GlyVal: 1.546 ± 0.879
0.773GlyTrp: 0.773 ± 0.324
1.159GlyTyr: 1.159 ± 0.486
0.0GlyXaa: 0.0 ± 0.0
His
0.773HisAla: 0.773 ± 0.324
0.773HisCys: 0.773 ± 0.324
0.386HisAsp: 0.386 ± 0.162
0.386HisGlu: 0.386 ± 1.269
1.932HisPhe: 1.932 ± 0.81
1.159HisGly: 1.159 ± 0.486
0.386HisHis: 0.386 ± 0.162
1.546HisIle: 1.546 ± 0.648
1.159HisLys: 1.159 ± 0.486
1.932HisLeu: 1.932 ± 1.438
0.386HisMet: 0.386 ± 0.162
3.091HisAsn: 3.091 ± 1.296
0.0HisPro: 0.0 ± 0.0
0.386HisGln: 0.386 ± 0.162
1.159HisArg: 1.159 ± 3.215
1.159HisSer: 1.159 ± 0.486
0.0HisThr: 0.0 ± 0.0
0.773HisVal: 0.773 ± 0.324
1.159HisTrp: 1.159 ± 1.549
0.773HisTyr: 0.773 ± 1.131
0.0HisXaa: 0.0 ± 0.0
Ile
5.41IleAla: 5.41 ± 1.273
0.386IleCys: 0.386 ± 0.162
6.569IleAsp: 6.569 ± 1.263
5.41IleGlu: 5.41 ± 1.177
1.932IlePhe: 1.932 ± 0.81
4.25IleGly: 4.25 ± 1.781
1.932IleHis: 1.932 ± 0.81
7.728IleIle: 7.728 ± 3.239
11.206IleLys: 11.206 ± 4.697
9.274IleLeu: 9.274 ± 3.198
2.318IleMet: 2.318 ± 0.972
4.637IleAsn: 4.637 ± 1.943
3.091IlePro: 3.091 ± 1.296
3.864IleGln: 3.864 ± 1.119
4.637IleArg: 4.637 ± 0.918
6.569IleSer: 6.569 ± 2.142
6.182IleThr: 6.182 ± 4.277
4.25IleVal: 4.25 ± 0.806
0.386IleTrp: 0.386 ± 0.162
3.091IleTyr: 3.091 ± 1.757
0.0IleXaa: 0.0 ± 0.0
Lys
3.478LysAla: 3.478 ± 1.458
1.546LysCys: 1.546 ± 0.648
6.182LysAsp: 6.182 ± 2.591
6.955LysGlu: 6.955 ± 2.067
4.637LysPhe: 4.637 ± 0.918
4.637LysGly: 4.637 ± 0.918
2.318LysHis: 2.318 ± 0.972
5.41LysIle: 5.41 ± 2.267
5.796LysLys: 5.796 ± 1.317
7.342LysLeu: 7.342 ± 1.356
3.478LysMet: 3.478 ± 0.783
5.023LysAsn: 5.023 ± 1.043
2.318LysPro: 2.318 ± 0.689
2.318LysGln: 2.318 ± 1.999
3.478LysArg: 3.478 ± 1.458
5.023LysSer: 5.023 ± 1.043
3.864LysThr: 3.864 ± 1.62
5.023LysVal: 5.023 ± 4.271
1.159LysTrp: 1.159 ± 1.0
2.318LysTyr: 2.318 ± 0.689
0.0LysXaa: 0.0 ± 0.0
Leu
3.091LeuAla: 3.091 ± 1.757
1.159LeuCys: 1.159 ± 1.549
6.569LeuAsp: 6.569 ± 2.753
6.569LeuGlu: 6.569 ± 0.956
5.41LeuPhe: 5.41 ± 1.177
1.932LeuGly: 1.932 ± 3.172
2.705LeuHis: 2.705 ± 1.875
7.342LeuIle: 7.342 ± 2.227
6.182LeuLys: 6.182 ± 2.226
11.592LeuLeu: 11.592 ± 6.974
1.546LeuMet: 1.546 ± 1.486
8.501LeuAsn: 8.501 ± 3.053
3.478LeuPro: 3.478 ± 1.647
3.091LeuGln: 3.091 ± 1.296
3.478LeuArg: 3.478 ± 2.999
10.433LeuSer: 10.433 ± 5.164
6.569LeuThr: 6.569 ± 0.956
4.25LeuVal: 4.25 ± 0.806
0.773LeuTrp: 0.773 ± 0.324
3.478LeuTyr: 3.478 ± 0.65
0.0LeuXaa: 0.0 ± 0.0
Met
1.932MetAla: 1.932 ± 0.81
0.386MetCys: 0.386 ± 0.162
1.546MetAsp: 1.546 ± 0.648
1.546MetGlu: 1.546 ± 0.648
1.546MetPhe: 1.546 ± 0.879
1.159MetGly: 1.159 ± 0.486
0.386MetHis: 0.386 ± 0.162
1.932MetIle: 1.932 ± 0.773
3.864MetLys: 3.864 ± 1.545
3.091MetLeu: 3.091 ± 0.623
1.932MetMet: 1.932 ± 0.81
1.932MetAsn: 1.932 ± 0.81
1.546MetPro: 1.546 ± 0.648
0.0MetGln: 0.0 ± 0.0
2.318MetArg: 2.318 ± 1.407
2.318MetSer: 2.318 ± 1.649
1.932MetThr: 1.932 ± 1.438
1.546MetVal: 1.546 ± 1.486
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.478AsnAla: 3.478 ± 1.458
0.0AsnCys: 0.0 ± 0.0
6.182AsnAsp: 6.182 ± 1.462
5.796AsnGlu: 5.796 ± 1.317
2.705AsnPhe: 2.705 ± 0.636
1.546AsnGly: 1.546 ± 1.486
2.705AsnHis: 2.705 ± 0.636
7.728AsnIle: 7.728 ± 2.067
2.318AsnLys: 2.318 ± 0.972
4.637AsnLeu: 4.637 ± 0.918
3.091AsnMet: 3.091 ± 1.184
5.41AsnAsn: 5.41 ± 1.177
2.705AsnPro: 2.705 ± 0.636
2.318AsnGln: 2.318 ± 0.972
2.705AsnArg: 2.705 ± 3.276
3.864AsnSer: 3.864 ± 1.119
3.864AsnThr: 3.864 ± 1.119
0.773AsnVal: 0.773 ± 0.324
1.546AsnTrp: 1.546 ± 0.648
1.932AsnTyr: 1.932 ± 0.81
0.0AsnXaa: 0.0 ± 0.0
Pro
2.318ProAla: 2.318 ± 0.689
0.386ProCys: 0.386 ± 0.162
2.705ProAsp: 2.705 ± 0.636
3.091ProGlu: 3.091 ± 0.623
0.773ProPhe: 0.773 ± 0.324
2.318ProGly: 2.318 ± 1.999
0.0ProHis: 0.0 ± 0.0
3.091ProIle: 3.091 ± 0.623
3.091ProLys: 3.091 ± 0.623
2.705ProLeu: 2.705 ± 3.258
0.386ProMet: 0.386 ± 0.162
1.546ProAsn: 1.546 ± 0.648
0.386ProPro: 0.386 ± 0.162
0.386ProGln: 0.386 ± 0.162
0.773ProArg: 0.773 ± 0.324
3.091ProSer: 3.091 ± 1.296
1.546ProThr: 1.546 ± 0.648
1.159ProVal: 1.159 ± 0.486
0.773ProTrp: 0.773 ± 0.324
0.773ProTyr: 0.773 ± 0.324
0.0ProXaa: 0.0 ± 0.0
Gln
3.478GlnAla: 3.478 ± 1.647
0.386GlnCys: 0.386 ± 0.162
1.932GlnAsp: 1.932 ± 0.81
1.546GlnGlu: 1.546 ± 0.648
2.318GlnPhe: 2.318 ± 0.972
0.773GlnGly: 0.773 ± 1.131
0.0GlnHis: 0.0 ± 0.0
4.25GlnIle: 4.25 ± 1.526
2.318GlnLys: 2.318 ± 1.999
2.318GlnLeu: 2.318 ± 1.407
1.546GlnMet: 1.546 ± 0.648
1.159GlnAsn: 1.159 ± 0.486
0.386GlnPro: 0.386 ± 0.162
1.932GlnGln: 1.932 ± 1.794
2.318GlnArg: 2.318 ± 1.407
3.091GlnSer: 3.091 ± 1.372
2.705GlnThr: 2.705 ± 1.395
1.159GlnVal: 1.159 ± 0.486
0.386GlnTrp: 0.386 ± 1.715
1.932GlnTyr: 1.932 ± 0.773
0.0GlnXaa: 0.0 ± 0.0
Arg
0.773ArgAla: 0.773 ± 2.538
0.773ArgCys: 0.773 ± 0.324
2.705ArgAsp: 2.705 ± 1.134
3.091ArgGlu: 3.091 ± 0.623
1.546ArgPhe: 1.546 ± 0.648
1.546ArgGly: 1.546 ± 2.262
0.386ArgHis: 0.386 ± 0.162
5.41ArgIle: 5.41 ± 1.438
3.478ArgLys: 3.478 ± 1.647
5.796ArgLeu: 5.796 ± 1.885
1.159ArgMet: 1.159 ± 1.549
4.25ArgAsn: 4.25 ± 2.841
0.773ArgPro: 0.773 ± 1.131
2.705ArgGln: 2.705 ± 1.508
1.546ArgArg: 1.546 ± 1.486
3.864ArgSer: 3.864 ± 4.579
2.705ArgThr: 2.705 ± 3.031
1.932ArgVal: 1.932 ± 1.794
0.0ArgTrp: 0.0 ± 0.0
2.318ArgTyr: 2.318 ± 0.972
0.0ArgXaa: 0.0 ± 0.0
Ser
3.091SerAla: 3.091 ± 4.523
1.546SerCys: 1.546 ± 1.486
3.091SerAsp: 3.091 ± 1.296
5.41SerGlu: 5.41 ± 1.78
3.864SerPhe: 3.864 ± 2.929
2.705SerGly: 2.705 ± 0.636
2.318SerHis: 2.318 ± 1.407
6.955SerIle: 6.955 ± 2.241
7.342SerLys: 7.342 ± 1.913
8.114SerLeu: 8.114 ± 5.556
1.932SerMet: 1.932 ± 1.794
2.705SerAsn: 2.705 ± 0.636
2.318SerPro: 2.318 ± 0.689
1.546SerGln: 1.546 ± 0.648
2.705SerArg: 2.705 ± 0.636
5.41SerSer: 5.41 ± 4.115
5.41SerThr: 5.41 ± 2.418
3.864SerVal: 3.864 ± 0.714
1.159SerTrp: 1.159 ± 1.0
2.318SerTyr: 2.318 ± 0.972
0.0SerXaa: 0.0 ± 0.0
Thr
2.705ThrAla: 2.705 ± 1.134
0.773ThrCys: 0.773 ± 1.626
1.159ThrAsp: 1.159 ± 0.486
3.478ThrGlu: 3.478 ± 1.241
3.864ThrPhe: 3.864 ± 0.714
1.932ThrGly: 1.932 ± 1.438
0.773ThrHis: 0.773 ± 1.626
5.41ThrIle: 5.41 ± 1.78
5.41ThrLys: 5.41 ± 2.267
3.864ThrLeu: 3.864 ± 3.588
0.773ThrMet: 0.773 ± 0.324
3.478ThrAsn: 3.478 ± 1.458
3.091ThrPro: 3.091 ± 0.623
2.705ThrGln: 2.705 ± 3.031
1.932ThrArg: 1.932 ± 0.81
5.796ThrSer: 5.796 ± 1.317
3.478ThrThr: 3.478 ± 1.426
4.25ThrVal: 4.25 ± 1.008
1.159ThrTrp: 1.159 ± 1.0
2.705ThrTyr: 2.705 ± 1.134
0.0ThrXaa: 0.0 ± 0.0
Val
1.932ValAla: 1.932 ± 3.529
1.159ValCys: 1.159 ± 0.486
1.932ValAsp: 1.932 ± 0.81
1.546ValGlu: 1.546 ± 0.879
3.864ValPhe: 3.864 ± 0.714
1.546ValGly: 1.546 ± 0.648
1.159ValHis: 1.159 ± 1.549
1.932ValIle: 1.932 ± 0.81
1.932ValLys: 1.932 ± 0.81
3.864ValLeu: 3.864 ± 1.119
1.546ValMet: 1.546 ± 0.879
4.637ValAsn: 4.637 ± 3.299
2.318ValPro: 2.318 ± 0.972
1.546ValGln: 1.546 ± 0.648
1.546ValArg: 1.546 ± 3.253
3.478ValSer: 3.478 ± 0.65
2.705ValThr: 2.705 ± 1.134
1.932ValVal: 1.932 ± 0.773
0.0ValTrp: 0.0 ± 0.0
1.932ValTyr: 1.932 ± 0.773
0.0ValXaa: 0.0 ± 0.0
Trp
0.386TrpAla: 0.386 ± 0.162
0.0TrpCys: 0.0 ± 0.0
0.773TrpAsp: 0.773 ± 0.324
0.773TrpGlu: 0.773 ± 0.324
0.773TrpPhe: 0.773 ± 0.324
0.773TrpGly: 0.773 ± 1.131
0.0TrpHis: 0.0 ± 0.0
0.773TrpIle: 0.773 ± 0.324
0.386TrpLys: 0.386 ± 1.269
0.773TrpLeu: 0.773 ± 0.324
0.386TrpMet: 0.386 ± 1.269
0.773TrpAsn: 0.773 ± 0.324
0.0TrpPro: 0.0 ± 0.0
1.159TrpGln: 1.159 ± 0.486
1.159TrpArg: 1.159 ± 3.339
1.932TrpSer: 1.932 ± 0.773
0.0TrpThr: 0.0 ± 0.0
0.773TrpVal: 0.773 ± 0.324
0.0TrpTrp: 0.0 ± 0.0
0.386TrpTyr: 0.386 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.159TyrAla: 1.159 ± 2.399
0.386TyrCys: 0.386 ± 0.162
0.773TyrAsp: 0.773 ± 0.324
2.705TyrGlu: 2.705 ± 1.134
1.932TyrPhe: 1.932 ± 0.81
1.546TyrGly: 1.546 ± 0.648
0.386TyrHis: 0.386 ± 0.162
5.41TyrIle: 5.41 ± 1.177
3.091TyrLys: 3.091 ± 0.623
1.932TyrLeu: 1.932 ± 2.128
1.546TyrMet: 1.546 ± 0.648
2.318TyrAsn: 2.318 ± 0.972
1.546TyrPro: 1.546 ± 0.879
1.159TyrGln: 1.159 ± 0.486
1.546TyrArg: 1.546 ± 0.879
1.159TyrSer: 1.159 ± 0.486
2.705TyrThr: 2.705 ± 1.134
0.773TyrVal: 0.773 ± 0.324
0.773TyrTrp: 0.773 ± 0.324
1.159TyrTyr: 1.159 ± 0.486
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski