Amino acid dipepetide frequency for Drosophila melanogaster totivirus SW-2009a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.475AlaAla: 7.475 ± 2.139
1.068AlaCys: 1.068 ± 0.537
4.805AlaAsp: 4.805 ± 0.796
3.203AlaGlu: 3.203 ± 1.612
2.136AlaPhe: 2.136 ± 0.264
3.737AlaGly: 3.737 ± 1.069
0.534AlaHis: 0.534 ± 0.269
3.203AlaIle: 3.203 ± 0.821
4.805AlaLys: 4.805 ± 2.447
6.941AlaLeu: 6.941 ± 0.249
1.068AlaMet: 1.068 ± 0.537
7.475AlaAsn: 7.475 ± 1.328
3.203AlaPro: 3.203 ± 1.612
1.068AlaGln: 1.068 ± 0.537
3.737AlaArg: 3.737 ± 0.259
3.737AlaSer: 3.737 ± 1.069
3.203AlaThr: 3.203 ± 0.801
6.407AlaVal: 6.407 ± 0.02
2.67AlaTrp: 2.67 ± 1.343
3.737AlaTyr: 3.737 ± 0.552
0.0AlaXaa: 0.0 ± 0.0
Cys
0.534CysAla: 0.534 ± 0.269
0.534CysCys: 0.534 ± 0.542
0.0CysAsp: 0.0 ± 0.0
1.602CysGlu: 1.602 ± 1.627
0.534CysPhe: 0.534 ± 0.542
0.534CysGly: 0.534 ± 0.269
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.534CysLys: 0.534 ± 0.542
1.602CysLeu: 1.602 ± 0.816
0.0CysMet: 0.0 ± 0.0
0.534CysAsn: 0.534 ± 0.269
0.534CysPro: 0.534 ± 0.269
1.068CysGln: 1.068 ± 0.537
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.534CysVal: 0.534 ± 0.269
0.534CysTrp: 0.534 ± 0.269
1.068CysTyr: 1.068 ± 0.274
0.0CysXaa: 0.0 ± 0.0
Asp
1.602AspAla: 1.602 ± 0.005
0.534AspCys: 0.534 ± 0.269
2.67AspAsp: 2.67 ± 0.279
6.407AspGlu: 6.407 ± 0.02
2.136AspPhe: 2.136 ± 0.547
3.737AspGly: 3.737 ± 0.552
0.534AspHis: 0.534 ± 0.269
5.339AspIle: 5.339 ± 0.254
2.136AspLys: 2.136 ± 1.074
4.271AspLeu: 4.271 ± 0.527
0.534AspMet: 0.534 ± 0.269
3.203AspAsn: 3.203 ± 0.01
1.068AspPro: 1.068 ± 0.537
4.805AspGln: 4.805 ± 0.796
3.203AspArg: 3.203 ± 0.01
3.203AspSer: 3.203 ± 1.612
1.068AspThr: 1.068 ± 0.274
2.67AspVal: 2.67 ± 0.532
0.534AspTrp: 0.534 ± 0.542
1.068AspTyr: 1.068 ± 0.537
0.0AspXaa: 0.0 ± 0.0
Glu
5.873GluAla: 5.873 ± 0.522
0.534GluCys: 0.534 ± 0.542
1.068GluAsp: 1.068 ± 0.537
3.203GluGlu: 3.203 ± 0.801
2.136GluPhe: 2.136 ± 1.358
2.67GluGly: 2.67 ± 1.089
0.534GluHis: 0.534 ± 0.542
3.737GluIle: 3.737 ± 0.552
5.873GluLys: 5.873 ± 0.522
4.271GluLeu: 4.271 ± 1.338
0.534GluMet: 0.534 ± 0.269
3.737GluAsn: 3.737 ± 2.174
4.271GluPro: 4.271 ± 1.338
1.602GluGln: 1.602 ± 0.806
3.737GluArg: 3.737 ± 0.259
2.136GluSer: 2.136 ± 1.074
5.339GluThr: 5.339 ± 1.368
4.805GluVal: 4.805 ± 1.607
2.136GluTrp: 2.136 ± 0.547
3.203GluTyr: 3.203 ± 0.801
0.0GluXaa: 0.0 ± 0.0
Phe
1.602PheAla: 1.602 ± 0.005
0.534PheCys: 0.534 ± 0.269
3.737PheAsp: 3.737 ± 0.552
2.67PheGlu: 2.67 ± 0.532
0.0PhePhe: 0.0 ± 0.0
4.271PheGly: 4.271 ± 0.284
0.534PheHis: 0.534 ± 0.269
2.136PheIle: 2.136 ± 1.358
2.136PheLys: 2.136 ± 1.358
2.136PheLeu: 2.136 ± 0.264
0.534PheMet: 0.534 ± 0.837
0.534PheAsn: 0.534 ± 0.269
2.136PhePro: 2.136 ± 1.074
0.534PheGln: 0.534 ± 0.269
2.136PheArg: 2.136 ± 0.264
3.737PheSer: 3.737 ± 1.363
1.602PheThr: 1.602 ± 0.005
2.67PheVal: 2.67 ± 1.343
0.534PheTrp: 0.534 ± 0.542
1.068PheTyr: 1.068 ± 1.084
0.0PheXaa: 0.0 ± 0.0
Gly
2.136GlyAla: 2.136 ± 1.074
0.534GlyCys: 0.534 ± 0.542
4.805GlyAsp: 4.805 ± 0.796
3.737GlyGlu: 3.737 ± 0.259
4.805GlyPhe: 4.805 ± 0.826
3.737GlyGly: 3.737 ± 0.259
0.534GlyHis: 0.534 ± 0.542
4.805GlyIle: 4.805 ± 1.637
3.737GlyLys: 3.737 ± 2.174
4.805GlyLeu: 4.805 ± 1.607
0.534GlyMet: 0.534 ± 0.542
4.805GlyAsn: 4.805 ± 0.796
3.737GlyPro: 3.737 ± 0.552
2.136GlyGln: 2.136 ± 0.264
2.136GlyArg: 2.136 ± 0.264
2.67GlySer: 2.67 ± 0.532
2.136GlyThr: 2.136 ± 1.074
2.67GlyVal: 2.67 ± 0.532
2.136GlyTrp: 2.136 ± 1.358
1.068GlyTyr: 1.068 ± 1.084
0.0GlyXaa: 0.0 ± 0.0
His
1.068HisAla: 1.068 ± 0.537
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.602HisGlu: 1.602 ± 0.005
0.534HisPhe: 0.534 ± 0.269
1.602HisGly: 1.602 ± 0.005
0.0HisHis: 0.0 ± 0.0
1.068HisIle: 1.068 ± 0.274
0.534HisLys: 0.534 ± 0.542
3.203HisLeu: 3.203 ± 1.632
0.534HisMet: 0.534 ± 0.269
1.068HisAsn: 1.068 ± 0.537
0.0HisPro: 0.0 ± 0.0
2.136HisGln: 2.136 ± 1.358
0.534HisArg: 0.534 ± 0.542
1.068HisSer: 1.068 ± 0.537
1.068HisThr: 1.068 ± 0.537
1.602HisVal: 1.602 ± 0.806
0.534HisTrp: 0.534 ± 0.542
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.407IleAla: 6.407 ± 2.452
0.0IleCys: 0.0 ± 0.0
1.068IleAsp: 1.068 ± 0.274
4.271IleGlu: 4.271 ± 0.527
3.203IlePhe: 3.203 ± 0.01
3.737IleGly: 3.737 ± 0.259
0.0IleHis: 0.0 ± 0.0
0.534IleIle: 0.534 ± 0.269
3.737IleLys: 3.737 ± 2.174
4.271IleLeu: 4.271 ± 1.094
2.136IleMet: 2.136 ± 0.264
3.737IleAsn: 3.737 ± 0.552
6.407IlePro: 6.407 ± 0.831
2.67IleGln: 2.67 ± 0.279
3.737IleArg: 3.737 ± 0.552
3.737IleSer: 3.737 ± 2.174
4.805IleThr: 4.805 ± 0.826
2.67IleVal: 2.67 ± 0.532
1.602IleTrp: 1.602 ± 0.816
2.136IleTyr: 2.136 ± 1.358
0.0IleXaa: 0.0 ± 0.0
Lys
2.136LysAla: 2.136 ± 1.358
0.0LysCys: 0.0 ± 0.0
3.737LysAsp: 3.737 ± 1.363
4.271LysGlu: 4.271 ± 0.284
1.602LysPhe: 1.602 ± 0.816
3.203LysGly: 3.203 ± 1.632
1.068LysHis: 1.068 ± 0.274
3.737LysIle: 3.737 ± 2.985
5.339LysLys: 5.339 ± 0.557
3.203LysLeu: 3.203 ± 0.821
3.203LysMet: 3.203 ± 0.801
3.737LysAsn: 3.737 ± 1.363
3.737LysPro: 3.737 ± 0.259
4.271LysGln: 4.271 ± 1.905
2.67LysArg: 2.67 ± 1.089
1.602LysSer: 1.602 ± 0.005
6.941LysThr: 6.941 ± 0.562
4.271LysVal: 4.271 ± 0.527
1.602LysTrp: 1.602 ± 0.816
3.737LysTyr: 3.737 ± 2.174
0.0LysXaa: 0.0 ± 0.0
Leu
8.542LeuAla: 8.542 ± 0.244
1.068LeuCys: 1.068 ± 0.537
5.873LeuAsp: 5.873 ± 0.522
4.805LeuGlu: 4.805 ± 0.015
2.67LeuPhe: 2.67 ± 1.089
4.271LeuGly: 4.271 ± 0.284
2.67LeuHis: 2.67 ± 1.089
4.805LeuIle: 4.805 ± 0.796
4.805LeuLys: 4.805 ± 0.826
8.009LeuLeu: 8.009 ± 2.407
0.534LeuMet: 0.534 ± 0.269
3.203LeuAsn: 3.203 ± 0.01
6.941LeuPro: 6.941 ± 1.059
4.271LeuGln: 4.271 ± 2.149
2.67LeuArg: 2.67 ± 1.089
3.737LeuSer: 3.737 ± 1.88
6.941LeuThr: 6.941 ± 0.249
2.136LeuVal: 2.136 ± 0.547
2.67LeuTrp: 2.67 ± 1.343
1.602LeuTyr: 1.602 ± 0.816
0.0LeuXaa: 0.0 ± 0.0
Met
3.203MetAla: 3.203 ± 0.801
0.534MetCys: 0.534 ± 0.269
0.0MetAsp: 0.0 ± 0.0
1.068MetGlu: 1.068 ± 0.537
0.534MetPhe: 0.534 ± 0.269
0.0MetGly: 0.0 ± 0.0
1.602MetHis: 1.602 ± 0.806
1.602MetIle: 1.602 ± 0.816
2.67MetLys: 2.67 ± 0.532
1.068MetLeu: 1.068 ± 0.274
0.0MetMet: 0.0 ± 0.0
0.534MetAsn: 0.534 ± 0.269
1.602MetPro: 1.602 ± 0.005
1.068MetGln: 1.068 ± 0.274
0.534MetArg: 0.534 ± 0.269
2.136MetSer: 2.136 ± 1.074
3.203MetThr: 3.203 ± 1.632
0.534MetVal: 0.534 ± 0.269
0.0MetTrp: 0.0 ± 0.0
1.602MetTyr: 1.602 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
4.271AsnAla: 4.271 ± 1.338
1.068AsnCys: 1.068 ± 1.084
2.136AsnAsp: 2.136 ± 1.074
1.068AsnGlu: 1.068 ± 0.274
0.534AsnPhe: 0.534 ± 0.269
2.67AsnGly: 2.67 ± 0.279
1.602AsnHis: 1.602 ± 0.806
6.407AsnIle: 6.407 ± 3.263
2.67AsnLys: 2.67 ± 0.532
4.805AsnLeu: 4.805 ± 0.796
3.203AsnMet: 3.203 ± 0.243
4.805AsnAsn: 4.805 ± 0.015
4.271AsnPro: 4.271 ± 0.527
3.737AsnGln: 3.737 ± 0.259
3.737AsnArg: 3.737 ± 0.259
3.737AsnSer: 3.737 ± 1.069
2.67AsnThr: 2.67 ± 1.343
3.737AsnVal: 3.737 ± 1.069
1.602AsnTrp: 1.602 ± 0.816
1.602AsnTyr: 1.602 ± 1.627
0.0AsnXaa: 0.0 ± 0.0
Pro
5.339ProAla: 5.339 ± 2.686
0.0ProCys: 0.0 ± 0.0
2.136ProAsp: 2.136 ± 0.264
2.136ProGlu: 2.136 ± 0.264
3.203ProPhe: 3.203 ± 0.801
3.737ProGly: 3.737 ± 0.552
0.534ProHis: 0.534 ± 0.269
3.203ProIle: 3.203 ± 0.01
2.136ProLys: 2.136 ± 0.264
5.873ProLeu: 5.873 ± 0.522
2.67ProMet: 2.67 ± 1.343
3.203ProAsn: 3.203 ± 0.801
2.136ProPro: 2.136 ± 0.264
3.203ProGln: 3.203 ± 0.821
3.737ProArg: 3.737 ± 1.069
4.271ProSer: 4.271 ± 0.284
5.873ProThr: 5.873 ± 1.91
2.67ProVal: 2.67 ± 0.532
1.602ProTrp: 1.602 ± 1.627
1.068ProTyr: 1.068 ± 0.537
0.0ProXaa: 0.0 ± 0.0
Gln
5.873GlnAla: 5.873 ± 0.522
0.0GlnCys: 0.0 ± 0.0
2.67GlnAsp: 2.67 ± 0.532
2.67GlnGlu: 2.67 ± 0.532
0.534GlnPhe: 0.534 ± 0.269
2.67GlnGly: 2.67 ± 0.279
0.534GlnHis: 0.534 ± 0.542
3.203GlnIle: 3.203 ± 0.821
3.203GlnLys: 3.203 ± 0.801
3.737GlnLeu: 3.737 ± 1.069
1.068GlnMet: 1.068 ± 0.274
1.602GlnAsn: 1.602 ± 0.816
3.737GlnPro: 3.737 ± 0.259
2.67GlnGln: 2.67 ± 0.279
1.068GlnArg: 1.068 ± 0.274
3.737GlnSer: 3.737 ± 0.259
2.136GlnThr: 2.136 ± 0.547
3.737GlnVal: 3.737 ± 0.259
0.0GlnTrp: 0.0 ± 0.0
3.203GlnTyr: 3.203 ± 0.801
0.0GlnXaa: 0.0 ± 0.0
Arg
2.67ArgAla: 2.67 ± 0.532
0.0ArgCys: 0.0 ± 0.0
3.203ArgAsp: 3.203 ± 0.801
1.602ArgGlu: 1.602 ± 0.005
1.602ArgPhe: 1.602 ± 0.005
3.737ArgGly: 3.737 ± 0.259
0.534ArgHis: 0.534 ± 0.542
1.602ArgIle: 1.602 ± 0.806
3.203ArgLys: 3.203 ± 1.632
4.271ArgLeu: 4.271 ± 0.284
1.068ArgMet: 1.068 ± 0.274
2.136ArgAsn: 2.136 ± 0.264
1.068ArgPro: 1.068 ± 0.274
2.136ArgGln: 2.136 ± 0.547
4.271ArgArg: 4.271 ± 0.284
3.203ArgSer: 3.203 ± 0.801
2.136ArgThr: 2.136 ± 0.547
3.737ArgVal: 3.737 ± 0.259
0.534ArgTrp: 0.534 ± 0.269
2.67ArgTyr: 2.67 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
4.271SerAla: 4.271 ± 2.149
1.068SerCys: 1.068 ± 0.274
2.136SerAsp: 2.136 ± 1.074
3.737SerGlu: 3.737 ± 0.259
3.737SerPhe: 3.737 ± 0.552
4.805SerGly: 4.805 ± 0.015
2.136SerHis: 2.136 ± 0.264
2.136SerIle: 2.136 ± 0.264
4.271SerLys: 4.271 ± 1.905
2.136SerLeu: 2.136 ± 0.547
2.67SerMet: 2.67 ± 0.532
3.203SerAsn: 3.203 ± 0.01
1.068SerPro: 1.068 ± 0.274
1.068SerGln: 1.068 ± 0.537
3.737SerArg: 3.737 ± 1.88
3.203SerSer: 3.203 ± 0.821
5.873SerThr: 5.873 ± 2.144
3.737SerVal: 3.737 ± 0.259
0.0SerTrp: 0.0 ± 0.0
2.67SerTyr: 2.67 ± 1.089
0.0SerXaa: 0.0 ± 0.0
Thr
3.737ThrAla: 3.737 ± 0.552
1.068ThrCys: 1.068 ± 1.084
4.805ThrAsp: 4.805 ± 0.015
5.339ThrGlu: 5.339 ± 0.254
1.602ThrPhe: 1.602 ± 0.005
3.737ThrGly: 3.737 ± 0.552
2.67ThrHis: 2.67 ± 0.279
5.339ThrIle: 5.339 ± 1.064
4.805ThrLys: 4.805 ± 1.637
3.737ThrLeu: 3.737 ± 1.069
0.534ThrMet: 0.534 ± 0.542
5.339ThrAsn: 5.339 ± 1.064
2.136ThrPro: 2.136 ± 1.358
3.737ThrGln: 3.737 ± 0.259
1.068ThrArg: 1.068 ± 0.537
6.941ThrSer: 6.941 ± 0.562
5.873ThrThr: 5.873 ± 0.522
4.271ThrVal: 4.271 ± 1.338
1.602ThrTrp: 1.602 ± 0.005
2.136ThrTyr: 2.136 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
3.737ValAla: 3.737 ± 1.069
1.068ValCys: 1.068 ± 0.274
3.203ValAsp: 3.203 ± 1.612
4.805ValGlu: 4.805 ± 0.015
3.203ValPhe: 3.203 ± 0.01
3.737ValGly: 3.737 ± 1.069
0.534ValHis: 0.534 ± 0.269
2.136ValIle: 2.136 ± 0.264
3.203ValLys: 3.203 ± 0.821
5.873ValLeu: 5.873 ± 2.955
1.068ValMet: 1.068 ± 0.537
3.737ValAsn: 3.737 ± 1.069
6.941ValPro: 6.941 ± 0.249
3.203ValGln: 3.203 ± 0.801
1.068ValArg: 1.068 ± 0.274
1.068ValSer: 1.068 ± 0.537
5.339ValThr: 5.339 ± 1.875
5.339ValVal: 5.339 ± 1.064
1.602ValTrp: 1.602 ± 0.816
1.068ValTyr: 1.068 ± 0.537
0.0ValXaa: 0.0 ± 0.0
Trp
1.602TrpAla: 1.602 ± 0.806
0.0TrpCys: 0.0 ± 0.0
1.068TrpAsp: 1.068 ± 0.274
1.068TrpGlu: 1.068 ± 1.084
0.0TrpPhe: 0.0 ± 0.0
0.534TrpGly: 0.534 ± 0.269
0.534TrpHis: 0.534 ± 0.542
1.068TrpIle: 1.068 ± 1.084
2.136TrpLys: 2.136 ± 2.169
3.737TrpLeu: 3.737 ± 0.259
0.534TrpMet: 0.534 ± 0.269
2.67TrpAsn: 2.67 ± 0.279
1.068TrpPro: 1.068 ± 0.274
1.602TrpGln: 1.602 ± 0.806
1.068TrpArg: 1.068 ± 0.274
1.068TrpSer: 1.068 ± 1.084
1.068TrpThr: 1.068 ± 1.084
1.068TrpVal: 1.068 ± 0.537
0.0TrpTrp: 0.0 ± 0.0
0.534TrpTyr: 0.534 ± 0.542
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.136TyrAla: 2.136 ± 0.264
0.534TyrCys: 0.534 ± 0.269
1.602TyrAsp: 1.602 ± 0.816
2.136TyrGlu: 2.136 ± 0.547
1.068TyrPhe: 1.068 ± 0.537
0.534TyrGly: 0.534 ± 0.269
1.068TyrHis: 1.068 ± 0.274
4.805TyrIle: 4.805 ± 1.637
2.136TyrLys: 2.136 ± 1.358
4.271TyrLeu: 4.271 ± 1.905
0.534TyrMet: 0.534 ± 0.542
1.068TyrAsn: 1.068 ± 0.537
2.67TyrPro: 2.67 ± 0.532
1.068TyrGln: 1.068 ± 1.084
0.534TyrArg: 0.534 ± 0.542
2.67TyrSer: 2.67 ± 0.279
3.203TyrThr: 3.203 ± 0.01
2.67TyrVal: 2.67 ± 0.532
0.534TyrTrp: 0.534 ± 0.542
1.602TyrTyr: 1.602 ± 0.806
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski