Amino acid dipepetide frequency for Culex negev-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.085AlaAla: 6.085 ± 2.085
0.913AlaCys: 0.913 ± 0.439
3.651AlaAsp: 3.651 ± 1.87
2.738AlaGlu: 2.738 ± 1.317
1.825AlaPhe: 1.825 ± 0.841
2.13AlaGly: 2.13 ± 0.466
2.13AlaHis: 2.13 ± 0.466
4.868AlaIle: 4.868 ± 1.437
3.042AlaLys: 3.042 ± 2.831
8.214AlaLeu: 8.214 ± 2.043
3.042AlaMet: 3.042 ± 1.252
2.13AlaAsn: 2.13 ± 0.657
3.042AlaPro: 3.042 ± 2.143
1.825AlaGln: 1.825 ± 0.878
2.738AlaArg: 2.738 ± 1.317
1.825AlaSer: 1.825 ± 0.841
4.868AlaThr: 4.868 ± 0.814
5.476AlaVal: 5.476 ± 0.95
0.304AlaTrp: 0.304 ± 0.146
2.434AlaTyr: 2.434 ± 0.527
0.0AlaXaa: 0.0 ± 0.0
Cys
0.913CysAla: 0.913 ± 0.635
0.0CysCys: 0.0 ± 0.0
1.521CysAsp: 1.521 ± 0.474
0.913CysGlu: 0.913 ± 0.439
1.521CysPhe: 1.521 ± 0.474
0.304CysGly: 0.304 ± 0.146
0.913CysHis: 0.913 ± 0.635
0.913CysIle: 0.913 ± 0.635
0.608CysLys: 0.608 ± 0.293
1.521CysLeu: 1.521 ± 0.732
0.608CysMet: 0.608 ± 0.746
1.521CysAsn: 1.521 ± 0.474
1.521CysPro: 1.521 ± 0.474
0.608CysGln: 0.608 ± 0.293
0.913CysArg: 0.913 ± 0.635
2.434CysSer: 2.434 ± 1.082
1.521CysThr: 1.521 ± 0.474
1.825CysVal: 1.825 ± 0.446
0.0CysTrp: 0.0 ± 0.0
1.217CysTyr: 1.217 ± 0.66
0.0CysXaa: 0.0 ± 0.0
Asp
4.868AspAla: 4.868 ± 0.814
2.13AspCys: 2.13 ± 0.466
4.259AspAsp: 4.259 ± 1.397
5.476AspGlu: 5.476 ± 0.275
2.434AspPhe: 2.434 ± 1.171
2.738AspGly: 2.738 ± 0.802
0.913AspHis: 0.913 ± 0.439
4.868AspIle: 4.868 ± 1.518
6.693AspLys: 6.693 ± 1.066
4.868AspLeu: 4.868 ± 1.518
1.521AspMet: 1.521 ± 0.732
2.738AspAsn: 2.738 ± 0.422
2.738AspPro: 2.738 ± 0.422
2.13AspGln: 2.13 ± 1.171
3.347AspArg: 3.347 ± 1.61
6.389AspSer: 6.389 ± 0.826
7.301AspThr: 7.301 ± 1.448
3.651AspVal: 3.651 ± 1.756
0.304AspTrp: 0.304 ± 0.146
3.042AspTyr: 3.042 ± 0.726
0.0AspXaa: 0.0 ± 0.0
Glu
2.434GluAla: 2.434 ± 0.527
0.913GluCys: 0.913 ± 0.439
2.738GluAsp: 2.738 ± 1.317
1.521GluGlu: 1.521 ± 0.732
1.217GluPhe: 1.217 ± 1.493
1.217GluGly: 1.217 ± 0.585
2.13GluHis: 2.13 ± 0.657
3.347GluIle: 3.347 ± 1.013
2.738GluLys: 2.738 ± 1.317
3.651GluLeu: 3.651 ± 0.974
2.434GluMet: 2.434 ± 0.719
1.825GluAsn: 1.825 ± 0.878
2.13GluPro: 2.13 ± 0.657
2.13GluGln: 2.13 ± 1.025
2.434GluArg: 2.434 ± 0.719
1.825GluSer: 1.825 ± 0.625
1.217GluThr: 1.217 ± 0.585
1.521GluVal: 1.521 ± 0.985
0.0GluTrp: 0.0 ± 0.0
3.651GluTyr: 3.651 ± 1.756
0.0GluXaa: 0.0 ± 0.0
Phe
3.651PheAla: 3.651 ± 3.129
1.521PheCys: 1.521 ± 0.732
5.172PheAsp: 5.172 ± 1.138
1.217PheGlu: 1.217 ± 0.585
2.13PhePhe: 2.13 ± 2.698
1.217PheGly: 1.217 ± 0.585
0.913PheHis: 0.913 ± 1.613
2.13PheIle: 2.13 ± 1.686
4.563PheLys: 4.563 ± 2.196
4.259PheLeu: 4.259 ± 0.607
0.608PheMet: 0.608 ± 0.262
1.825PheAsn: 1.825 ± 0.878
2.434PhePro: 2.434 ± 2.012
0.913PheGln: 0.913 ± 0.723
3.042PheArg: 3.042 ± 1.804
5.172PheSer: 5.172 ± 1.807
4.868PheThr: 4.868 ± 2.81
3.042PheVal: 3.042 ± 2.143
0.304PheTrp: 0.304 ± 0.146
1.521PheTyr: 1.521 ± 0.985
0.0PheXaa: 0.0 ± 0.0
Gly
2.13GlyAla: 2.13 ± 0.466
0.913GlyCys: 0.913 ± 0.439
3.955GlyAsp: 3.955 ± 1.257
0.913GlyGlu: 0.913 ± 0.439
1.825GlyPhe: 1.825 ± 0.878
2.738GlyGly: 2.738 ± 0.802
0.0GlyHis: 0.0 ± 0.0
2.434GlyIle: 2.434 ± 1.171
3.347GlyLys: 3.347 ± 1.61
1.825GlyLeu: 1.825 ± 0.625
0.608GlyMet: 0.608 ± 0.293
3.042GlyAsn: 3.042 ± 1.464
0.608GlyPro: 0.608 ± 0.808
0.913GlyGln: 0.913 ± 0.723
2.13GlyArg: 2.13 ± 1.171
3.042GlySer: 3.042 ± 0.902
1.521GlyThr: 1.521 ± 0.626
3.042GlyVal: 3.042 ± 0.295
0.304GlyTrp: 0.304 ± 0.146
2.738GlyTyr: 2.738 ± 1.006
0.0GlyXaa: 0.0 ± 0.0
His
2.13HisAla: 2.13 ± 0.466
0.913HisCys: 0.913 ± 0.439
1.521HisAsp: 1.521 ± 0.474
0.608HisGlu: 0.608 ± 0.293
0.913HisPhe: 0.913 ± 0.439
1.825HisGly: 1.825 ± 0.625
0.608HisHis: 0.608 ± 0.808
1.521HisIle: 1.521 ± 0.732
1.521HisLys: 1.521 ± 0.732
1.217HisLeu: 1.217 ± 0.66
0.913HisMet: 0.913 ± 0.635
1.825HisAsn: 1.825 ± 1.27
1.521HisPro: 1.521 ± 0.985
0.913HisGln: 0.913 ± 0.439
0.0HisArg: 0.0 ± 0.0
1.825HisSer: 1.825 ± 0.878
2.434HisThr: 2.434 ± 0.719
3.042HisVal: 3.042 ± 1.16
0.0HisTrp: 0.0 ± 0.0
1.521HisTyr: 1.521 ± 0.732
0.0HisXaa: 0.0 ± 0.0
Ile
5.476IleAla: 5.476 ± 1.575
0.304IleCys: 0.304 ± 0.146
7.301IleAsp: 7.301 ± 2.265
2.434IleGlu: 2.434 ± 1.171
4.259IlePhe: 4.259 ± 2.913
3.651IleGly: 3.651 ± 0.974
1.825IleHis: 1.825 ± 0.446
5.172IleIle: 5.172 ± 0.797
3.042IleLys: 3.042 ± 0.902
5.172IleLeu: 5.172 ± 0.807
1.521IleMet: 1.521 ± 0.626
2.434IleAsn: 2.434 ± 1.171
3.042IlePro: 3.042 ± 0.948
0.608IleGln: 0.608 ± 0.293
3.347IleArg: 3.347 ± 2.006
3.347IleSer: 3.347 ± 1.202
5.476IleThr: 5.476 ± 2.528
4.259IleVal: 4.259 ± 2.6
0.608IleTrp: 0.608 ± 0.293
2.13IleTyr: 2.13 ± 1.171
0.0IleXaa: 0.0 ± 0.0
Lys
3.955LysAla: 3.955 ± 1.106
1.521LysCys: 1.521 ± 0.732
6.085LysAsp: 6.085 ± 0.431
2.13LysGlu: 2.13 ± 0.657
4.868LysPhe: 4.868 ± 1.348
2.738LysGly: 2.738 ± 1.317
0.913LysHis: 0.913 ± 0.439
4.868LysIle: 4.868 ± 1.653
3.955LysLys: 3.955 ± 1.903
6.085LysLeu: 6.085 ± 2.207
1.217LysMet: 1.217 ± 0.541
3.042LysAsn: 3.042 ± 0.295
3.347LysPro: 3.347 ± 1.243
3.042LysGln: 3.042 ± 1.464
2.13LysArg: 2.13 ± 1.025
4.259LysSer: 4.259 ± 2.049
5.78LysThr: 5.78 ± 2.067
3.347LysVal: 3.347 ± 0.909
0.0LysTrp: 0.0 ± 0.0
3.042LysTyr: 3.042 ± 1.315
0.0LysXaa: 0.0 ± 0.0
Leu
6.085LeuAla: 6.085 ± 1.471
1.825LeuCys: 1.825 ± 1.27
5.172LeuAsp: 5.172 ± 0.29
4.868LeuGlu: 4.868 ± 0.814
4.563LeuPhe: 4.563 ± 1.256
3.042LeuGly: 3.042 ± 0.902
2.434LeuHis: 2.434 ± 0.719
3.955LeuIle: 3.955 ± 3.019
7.91LeuLys: 7.91 ± 2.211
5.78LeuLeu: 5.78 ± 0.713
2.738LeuMet: 2.738 ± 1.317
4.563LeuAsn: 4.563 ± 0.984
3.042LeuPro: 3.042 ± 1.804
2.738LeuGln: 2.738 ± 1.006
3.955LeuArg: 3.955 ± 1.002
4.563LeuSer: 4.563 ± 0.856
3.347LeuThr: 3.347 ± 1.013
5.172LeuVal: 5.172 ± 2.488
0.608LeuTrp: 0.608 ± 0.293
3.955LeuTyr: 3.955 ± 1.545
0.0LeuXaa: 0.0 ± 0.0
Met
1.217MetAla: 1.217 ± 0.66
0.304MetCys: 0.304 ± 0.146
1.217MetAsp: 1.217 ± 0.541
2.13MetGlu: 2.13 ± 1.025
3.347MetPhe: 3.347 ± 1.825
0.0MetGly: 0.0 ± 0.0
1.521MetHis: 1.521 ± 0.732
1.521MetIle: 1.521 ± 0.985
1.217MetLys: 1.217 ± 0.585
3.651MetLeu: 3.651 ± 1.132
1.521MetMet: 1.521 ± 0.732
1.217MetAsn: 1.217 ± 0.541
0.304MetPro: 0.304 ± 0.146
0.304MetGln: 0.304 ± 0.146
0.913MetArg: 0.913 ± 0.635
2.13MetSer: 2.13 ± 1.025
2.13MetThr: 2.13 ± 1.171
2.13MetVal: 2.13 ± 0.657
0.304MetTrp: 0.304 ± 0.146
0.608MetTyr: 0.608 ± 1.818
0.0MetXaa: 0.0 ± 0.0
Asn
2.738AsnAla: 2.738 ± 0.617
0.608AsnCys: 0.608 ± 0.746
3.042AsnAsp: 3.042 ± 0.902
0.913AsnGlu: 0.913 ± 0.439
3.651AsnPhe: 3.651 ± 0.184
1.521AsnGly: 1.521 ± 0.474
0.608AsnHis: 0.608 ± 0.293
3.955AsnIle: 3.955 ± 2.005
2.738AsnLys: 2.738 ± 1.317
4.259AsnLeu: 4.259 ± 0.932
0.913AsnMet: 0.913 ± 1.613
1.521AsnAsn: 1.521 ± 0.985
2.434AsnPro: 2.434 ± 1.447
0.608AsnGln: 0.608 ± 0.746
4.259AsnArg: 4.259 ± 0.39
3.651AsnSer: 3.651 ± 2.541
3.347AsnThr: 3.347 ± 1.61
3.347AsnVal: 3.347 ± 0.909
0.304AsnTrp: 0.304 ± 0.146
3.347AsnTyr: 3.347 ± 0.846
0.0AsnXaa: 0.0 ± 0.0
Pro
1.825ProAla: 1.825 ± 1.27
0.913ProCys: 0.913 ± 2.605
2.738ProAsp: 2.738 ± 0.802
1.521ProGlu: 1.521 ± 0.626
1.521ProPhe: 1.521 ± 0.732
1.521ProGly: 1.521 ± 0.732
0.304ProHis: 0.304 ± 0.146
4.563ProIle: 4.563 ± 2.238
3.651ProLys: 3.651 ± 0.974
3.042ProLeu: 3.042 ± 3.135
0.608ProMet: 0.608 ± 0.585
1.521ProAsn: 1.521 ± 1.378
3.347ProPro: 3.347 ± 3.615
2.434ProGln: 2.434 ± 1.082
0.913ProArg: 0.913 ± 0.439
3.955ProSer: 3.955 ± 1.539
4.868ProThr: 4.868 ± 1.722
4.259ProVal: 4.259 ± 1.477
0.0ProTrp: 0.0 ± 0.0
1.825ProTyr: 1.825 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
1.521GlnAla: 1.521 ± 0.985
0.0GlnCys: 0.0 ± 0.0
2.13GlnAsp: 2.13 ± 1.025
0.913GlnGlu: 0.913 ± 0.439
1.825GlnPhe: 1.825 ± 1.27
0.913GlnGly: 0.913 ± 1.274
1.521GlnHis: 1.521 ± 0.732
2.13GlnIle: 2.13 ± 3.104
1.521GlnLys: 1.521 ± 0.732
2.434GlnLeu: 2.434 ± 0.719
0.913GlnMet: 0.913 ± 0.635
1.217GlnAsn: 1.217 ± 0.585
0.913GlnPro: 0.913 ± 0.439
2.738GlnGln: 2.738 ± 1.006
2.434GlnArg: 2.434 ± 1.171
2.434GlnSer: 2.434 ± 0.527
2.738GlnThr: 2.738 ± 1.435
2.13GlnVal: 2.13 ± 1.025
0.0GlnTrp: 0.0 ± 0.0
1.825GlnTyr: 1.825 ± 0.446
0.0GlnXaa: 0.0 ± 0.0
Arg
2.13ArgAla: 2.13 ± 1.025
1.217ArgCys: 1.217 ± 0.541
3.042ArgAsp: 3.042 ± 0.726
3.955ArgGlu: 3.955 ± 1.257
2.434ArgPhe: 2.434 ± 1.082
2.13ArgGly: 2.13 ± 0.699
1.825ArgHis: 1.825 ± 0.446
2.738ArgIle: 2.738 ± 0.617
3.955ArgLys: 3.955 ± 0.741
3.955ArgLeu: 3.955 ± 1.274
1.825ArgMet: 1.825 ± 0.878
4.259ArgAsn: 4.259 ± 1.397
1.521ArgPro: 1.521 ± 0.732
1.521ArgGln: 1.521 ± 0.474
1.825ArgArg: 1.825 ± 1.447
2.738ArgSer: 2.738 ± 1.303
2.738ArgThr: 2.738 ± 1.317
1.825ArgVal: 1.825 ± 0.878
2.13ArgTrp: 2.13 ± 0.466
2.13ArgTyr: 2.13 ± 1.025
0.0ArgXaa: 0.0 ± 0.0
Ser
3.651SerAla: 3.651 ± 0.974
1.521SerCys: 1.521 ± 1.88
4.563SerAsp: 4.563 ± 0.48
3.042SerGlu: 3.042 ± 0.902
1.521SerPhe: 1.521 ± 1.527
1.521SerGly: 1.521 ± 0.732
1.825SerHis: 1.825 ± 0.841
5.172SerIle: 5.172 ± 1.658
4.563SerLys: 4.563 ± 0.525
7.301SerLeu: 7.301 ± 1.757
1.217SerMet: 1.217 ± 0.585
3.042SerAsn: 3.042 ± 3.732
3.042SerPro: 3.042 ± 0.295
1.825SerGln: 1.825 ± 0.625
3.651SerArg: 3.651 ± 1.132
5.172SerSer: 5.172 ± 1.352
6.085SerThr: 6.085 ± 0.589
6.693SerVal: 6.693 ± 0.394
0.0SerTrp: 0.0 ± 0.0
2.738SerTyr: 2.738 ± 1.317
0.0SerXaa: 0.0 ± 0.0
Thr
3.347ThrAla: 3.347 ± 1.61
1.825ThrCys: 1.825 ± 0.446
5.476ThrAsp: 5.476 ± 1.928
2.434ThrGlu: 2.434 ± 0.527
3.042ThrPhe: 3.042 ± 0.726
1.825ThrGly: 1.825 ± 0.878
1.521ThrHis: 1.521 ± 0.626
4.563ThrIle: 4.563 ± 0.525
3.347ThrLys: 3.347 ± 1.013
3.955ThrLeu: 3.955 ± 2.005
2.13ThrMet: 2.13 ± 1.025
2.434ThrAsn: 2.434 ± 0.719
4.259ThrPro: 4.259 ± 2.246
4.563ThrGln: 4.563 ± 1.256
5.172ThrArg: 5.172 ± 0.29
4.563ThrSer: 4.563 ± 2.238
8.518ThrThr: 8.518 ± 5.582
4.868ThrVal: 4.868 ± 2.712
0.304ThrTrp: 0.304 ± 0.146
6.389ThrTyr: 6.389 ± 0.48
0.0ThrXaa: 0.0 ± 0.0
Val
4.563ValAla: 4.563 ± 1.474
1.825ValCys: 1.825 ± 0.446
3.651ValAsp: 3.651 ± 0.893
2.13ValGlu: 2.13 ± 0.466
3.651ValPhe: 3.651 ± 3.47
4.259ValGly: 4.259 ± 1.315
1.825ValHis: 1.825 ± 0.878
3.651ValIle: 3.651 ± 1.097
4.259ValLys: 4.259 ± 0.39
3.347ValLeu: 3.347 ± 0.846
1.217ValMet: 1.217 ± 1.25
4.563ValAsn: 4.563 ± 0.48
4.259ValPro: 4.259 ± 0.932
0.913ValGln: 0.913 ± 0.635
4.259ValArg: 4.259 ± 1.241
6.389ValSer: 6.389 ± 3.074
3.347ValThr: 3.347 ± 0.197
5.78ValVal: 5.78 ± 1.942
0.913ValTrp: 0.913 ± 0.439
4.868ValTyr: 4.868 ± 0.369
0.0ValXaa: 0.0 ± 0.0
Trp
0.608TrpAla: 0.608 ± 0.293
0.0TrpCys: 0.0 ± 0.0
0.608TrpAsp: 0.608 ± 0.293
0.0TrpGlu: 0.0 ± 0.0
0.913TrpPhe: 0.913 ± 0.439
0.913TrpGly: 0.913 ± 0.439
0.304TrpHis: 0.304 ± 0.146
0.0TrpIle: 0.0 ± 0.0
0.913TrpLys: 0.913 ± 0.439
0.913TrpLeu: 0.913 ± 0.439
0.0TrpMet: 0.0 ± 0.0
0.304TrpAsn: 0.304 ± 0.146
0.304TrpPro: 0.304 ± 0.146
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.304TrpSer: 0.304 ± 0.146
0.304TrpThr: 0.304 ± 0.146
0.304TrpVal: 0.304 ± 0.146
0.304TrpTrp: 0.304 ± 0.146
0.304TrpTyr: 0.304 ± 0.868
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.955TyrAla: 3.955 ± 2.392
2.13TyrCys: 2.13 ± 0.466
4.563TyrAsp: 4.563 ± 0.984
1.217TyrGlu: 1.217 ± 0.585
3.042TyrPhe: 3.042 ± 0.948
1.825TyrGly: 1.825 ± 0.446
2.738TyrHis: 2.738 ± 0.422
3.042TyrIle: 3.042 ± 0.948
2.738TyrLys: 2.738 ± 1.279
4.868TyrLeu: 4.868 ± 0.369
1.825TyrMet: 1.825 ± 1.816
2.738TyrAsn: 2.738 ± 0.617
1.521TyrPro: 1.521 ± 1.378
1.521TyrGln: 1.521 ± 0.474
2.434TyrArg: 2.434 ± 0.719
2.13TyrSer: 2.13 ± 1.025
2.13TyrThr: 2.13 ± 1.025
4.259TyrVal: 4.259 ± 2.049
0.608TyrTrp: 0.608 ± 0.293
2.738TyrTyr: 2.738 ± 1.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3288 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski