Software Engineer, Blockchain / Web3, Data Engineer
Location: New YorkAllium makes blockchain data accurate, simple and fastBlockchain data is hard, messy, and chaoticWhen we started out in late 2021 our thesis was simple – blockchain data, despite it being public and free, was difficult to understand, clunky to access and troublesome to maintain. Answering a simple question like “Who are the biggest Ethereum token holders over time?” requires an engineering team to run their own RPC nodes, ingest the full history of the blockchain, clean the data, transform the data and finally summon a wizard to cast a complex SQL query.Accessing data is hard because blockchains are optimized for Writes and not ReadsBlockchains have historically been optimized for Writes (getting data onto the blockchain) and less for Reads (getting data out of the blockchain). This focus on transaction throughput and fault‑tolerant consensus has made it hard to get data out efficiently and reliably at scale. Parsing and interpreting blockchain data requires both deep domain expertise and data manipulationBlockchains are virtual computers, not databases.They support general computations, and anyone can write and deploy their own smart contract for their own use case. The resulting fragmentation of data schemas requires deep domain expertise to turn esoteric outputs into clear information for concepts like tokens, NFTs, stable coins and DEXs. Allium abstracts the complexity with a simple way to query blockchain dataAllium tames the chaos by ingesting, sanitizing, and standardizing all this data. As of this post, the data we’ve archived across 40+ blockchains is in the petabytes and growing exponentially.Google and Bloomberg had to organize the world’s public financial and webpage data – Allium is on a mission to do the same for blockchain dataWe index a giant public dataset that is sorely needed by everyone – similar to what Bloomberg did for financial data and what Google organized for public webpages. With this indexed data we support trailblazers in industry trends such as NFTs, stable coins and decentralized exchanges. About our customersWe serve two groups of customers with the same data but different platforms:Analysts who need to answer data questions (BI focus) and Engineers who need highly reliable, near‑real‑time queryable data (application backends).Our customers include Visa, Stripe, Grayscale, Phantom, Uniswap, and other major institutions and crypto companies. About the RoleWe love engineers who solve new problems every single day. Responsibilities• Data egress – How to transport hundreds of terabytes of data worldwide without breaking the piggybank. • Handle high traffic – Support the biggest applications and handle 100,000 QPS at peak traffic without downtime. • Botnets – Detect botnets based on behavioral patterns in the early days of the industry.• Fraud (Sybil) detection – Transfer fraud detection heuristics into the blockchain world. • Who is real? – Define meaningful and organic transactions on the blockchain. • BringYour Own Transformation – Let customers design their own APIs and transform their own real‑time data streams. • Data governance – Ensure data consistency across every copy and every region 24/7. • AI and LLMs – Design the LLM and AI experience on top of our data to lower the barrier of entry to crypto data. • Data transformation holy grail – Unify streaming and batch transformation logic into a single code base.If any of those bore you, we have many more problems to solve. Allium sizzle reelGiant infrastructure budget per headYou will make mistakes – costly mistakes – but at Allium’s expense. We have an internal leader board of the costliest infrastructure mistakes made, and we learn from them. We provide a huge infrastructure budget to help you refine your craft. We leverage every tool (no prerequisites) because we meet our enterprise customers where they are at:• Every OLAP:Snowflake, Databricks, Bigquery, Clickhouse*• Every OLTP:Postgres, Aurora• Every event bus:Kafka, SNS, Pub Sub• Every cloud provider: AWS, GCP, Azure (one day)• A copy of data in every region: US East, Central, West, Europe, Asia• Every data transformation and orchestration tool:Apache Beam, Materialize, Tiny Bird, DBT, SQLMesh, Temporal• Data governance tools:Data FoldWe invite people of all backgrounds.Engineers who started coding late, who learned on the side, who are still in school, who went to top schools, are all welcome if you bring a curious mind and an infectious work ethic. Administrative BenefitsMedical, Dental, Vision, Life and AD&D insurance – US folks get 100% coverage for Gold plans, 80% for dependents. Note:The sun never sets on Allium – we hire from any geographic location as long as you can overlap two hours of NYC morning overlap Mon‑Thurs from 10am‑12pm ET. We have people based in New York, Seattle, Singapore, and Australia.#J-18808-Ljbffr Apply tot his job