Docker Usage and Configuration
Translated by GPT4o from my original chinese page. Come try one-click foolproof distribution and become a Docker expert! In software development, whenever you encounter CI/CD concepts, Docker is indispensable. Docker Basic Concepts1. Docker IntroductionDocker is an open-source containerization technology that can package applications and all their dependencies into a portable container. Through Docker, you can ensure application consistency across different environments (development,...
Commonly Used Commands
When I first started studying computers, I was resistant to using Linux and Git, but once I got used to them, I found them extremely useful. This document is a record of frequently used commands so I don’t have to look them up or ask GPT each time. Common Linux CommandsSystem Informationuname -a # Display system information hostname # Show or set the hostname whoami # Current logged-in user uptime # System uptime and load date ...
AWS EC2 Free Tier and Dify 0.12.1 Experience
Translated by GPT4o from my original chinese page. The Dify official site already has detailed deployment tutorials, and you can find guides on Bilibili or YouTube too, so I won’t repeat those here. For source code deployment (the main method in this article), once you’ve cloned the repo, the key is to follow the steps in the README.md files under the dify/api and dify/web folders step by step. This post focuses on deploying Dify on AWS and some pitfalls I encountered. Tools used: AWS EC2...
Devops, MLOps to LLMOps
I can’t live without Large Language Models now 😢. Begin from CI/CDThere are many definitions of CI/CD (Continuous Integration/Continuous Delivery) online, but everything related to XXXOps is based on CI/CD. In my understanding, CI/CD is a development and operations practice that automates building, testing, and deployment processes to improve software delivery efficiency and quality. Just imagine how you use GitHub and Docker. You commit code to a repository, and...
Internship as NLP Engineer...
Back from a long gap… I enjoyed my Spring Break during crafting last post, but I just realized I still didn’t have an intern after a wonderful road trip (This is how PEER PRESSURE looks like!!!). Ok so, started from March, I put a lot of effort on hunting an intern. Unfortunately, most application process of intern in US had already passed deadline (another reason was: I gave up on finding intern under current suck market…), but fortunately, most summer interns just start recruitment in...
Copula III — Applications
In this post, we may quickly go through the applications for Copulas. Firstly, let’s talk about quantile regression based on Copula, and then move to a real-world case about anomalies detection. 1, Quantile RegressionSynthetic DataUnfortunately, I do not find a good way to synthetic data in Python, but R does provide some great functions for Copulas. I’ll use R to generate artificial data, and use Python to do the quantile regression. (However, I highly recommend to use R to do everything...
Copula II — Definition
The word Copula derives from the Latin noun for a “link” or “tie” that connects two different things. When I talk about Copula in this post, actually I mean the Sklar’s Theorem. Here is what the Sklar’s Theorem tells us: we always can find a Copula function \(C\) that every multivariate (a.k.a joint) cumulative distribution function(CDF) can be expressed by its marginal CDFs. In other words, we can use copula function to analyze the dependence between each random variable, and Sklar’s...
Copulas I — Probability Integral Transform
When I wrote this article, I was enjoying my spring break after winter quarter (Greetings from Atlanta😃). Initially, my intention was to delve into the intricacies of Copulas. However, I realized the paramount importance of laying a solid foundation by elucidating the basic concept of the Probability Integral Transform before advancing further. Definition & Proof Definition: For a random variable $X$, the function $F$ defined by $$F_{X}(x) = P(X \leq x),\quad x \in \mathbb{R}$$,...
Simple NLP — Sentiment Analysis
Translated by ChatGPT A few days ago, during an interview, the interviewer suddenly asked me, “Why are you interested in pursuing a major related to data science?” I was momentarily at a loss for words. Saying it was purely out of love seemed somewhat insincere—it’s all about making a living, after all. There aren’t always grand narratives or beautiful visions. However, I answered the interviewer like this: I said that perhaps, without us knowing, some simple data analyses could bring us a...
Advanced Hexo Setup Tips, 2024 Edition
Translated by ChatGPT In the previous article, we built a very basic website. This time, let’s take a look at some personal configurations I plan to make in 2024. In fact, the original documentation and blogger lijunliang‘s articles have already provided very detailed instructions, which are likely clearer and easier to understand than what an outsider like me could write. However, these articles might be a bit dated, and some additional configurations may be necessary in 2024. This...
Hexo + Github Personal Website Buildup
Translated by ChatGPT As the first post, let’s talk about how this website was made. I’ve wanted to create a personal website for a long time, but I gave up on the idea due to not knowing too much about it. Otherwise, I would have to start learning some strange JavaScript and CSS related stuff from scratch, which would be too torturous for me. However, while chatting yesterday, a friend mentioned, “Don’t you guys have something like a personal portfolio?” I realized I didn’t, but then I...